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AAAGATGTCA TTAATCAATT GAGAGAGAAA GGATATAAAG TATTTTTOGC AACAGGACGT 4 800 



TCGCATTCTG AAAATACATC AACTTG 

5 

(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 6 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : double 

<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

GAATAAAAAG TAAAATTACT CGCCTTTGTT ACCTTTTACT TTATCAATaA AATCAGTTGC 60 

TTTTTCTTTT GCATTTTCAA CGAATTCTTT CGCTTTACCA GAAGCTTTAT CTTCTTTACC 120 

20 TTCGTTTTCT AAATTTTTAT TATCAGTAAC ATTACCTACT GTTTCTTTAA CATTACCTTT 180 

TGCTTGTTCA AATTTACTTT CGTCTGCCAT AATAAATGCC TCCTCGGAAT AATTAAATGT 240 

TATATATAAT ACTTACCCAC TGAAAAATTA TCTAAACATT TTACTTAAAT AATTTTTGAT 300 

ATTGATTTGA CGTCATTTTA TAACTAGCGA AATAGATTCA TCATTAACTT GAGGGAGTGG 360 

GACTGAAATA ATAAAGAATC ACTAATGATT TATGATGTAT TAGTCACTAG CCATGTGAAA 420 

TTAAAAATAA GAATAAATGA GTAGCACGCA TGCATATAGG ATTTTACTTT ATCCGTAATA 480 

GCATCTCATT CCTAAATATC ATATAAATAC CTGTTTAAAT TAAAAAGCCC AGCAACATCA 540 

CGTTACTGAG CCATTAATAT GATTTATTTA GCAGGAATAA CTGCACCATT GTATTTTTCA 600 

TTAATGAAGT CTTGAATATC TTTAGATTGT AATACTTCAA TTAATGCTTT GATTTTCTTA 660 

TCATCTTGAT GTCCTTCTTT AACAGCAATT AAGTTTGCAT AAGGATTATC TTTCGCACTT 720 

TCTACAGCAA TAGAATCTTT TTTAGGATTT AGTTTTTGTT CGATTGCAAA GTTCGAATTA 780 

ATGATAACAG CATCAGCGTC TTCATTTTGA TAAATTTTAG GTAAGAATTC TGCTGATTGT 840 
TTATTATTAA ACTTAATATC TTTTTTATTC TCTGTAATAT CACTAAACTT AGCATCTTCA ' 900 

ATTTTTACGC CTTTTTTGAT TTTAATTAAA CCTGCATCAA CGAAGAATTT TAAGAAACGT 960 

CCTTGTTCAG CTGGATTATT AGACACATAG ACTGTTGCAC CTTTTGGTAA TTCTTTTAAA 1020 

CTTTTATACT TTTTAGAGTA TACAGCCATA GGTTCTAAGT GAACATCACC GGCACTTACG 1080 

ATTTTGTAAC CTTTATCCTT TTTCTCTGTG TTTAAATATG GTGTATGTTG GAAATAGTTT 1140 

GCGTCAATTT CACCTTTGTC TAGTAATTTA TTAGGTGTAG TGTAATCGTT AATTGTTTTA 1200 

ATATCTAGTT CATAACCTTT TTTCTCTAAT AATGGTTTTG CTTTTTCTAA AATTTCAGCA 1260 
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TTACCGCAAG 
TTCATAAAAT 
TGAATTGGAT 
TTTGATTTCG 
CACCTGCAAC 
GCGCTGGCAT 
TTGACCAAGC 
TTGCATAAAA 
TCGTTCCAAG 
TCGCTCTAAA 
ACTTACCTTT 
ATATAGTTGA 
ACTGAATATT 
CTTAACACCT 
ACACTTGAAA 
TTAATATTTG 
ATAGGCTCGG 
AATGOCTCTA 
CGTTTTGTCA 
ACTTTACCAC 
TGCGTAATTA 
ATTTfcATCTG 
GGATCATTaG 
ATGGATAAGC 
GCTTAGCTCT 
CAGTCCTTGA 
CTCTTAATCC 
ATGTAGGCGC 
AACCAATGAC 



CTGCTAATAC 
GAAACCCCCA 
TATAAATACA 
AGTGAAACCT 
TGCTGTTGAA 
AGCTTCAGGT 
CGCTTCGATG 
CGGTGctGCG 
TATCAAACTT 
TAAGTTAACA 
AGCAGACAAG 
GACGACTGTC 
AGGCATTGTA 
CCATTTTAAC 
TATATGGAAT 
CTTCTAAAAT 
TTGTTGTTGA 
ATTCTGTTAA 
CAGTGTGTTG 
TTTCCATAAC 
GTACAATTGT 
TTGTTTGCGG 
TAACGCTCGT 
CTTTTCTCTA 
CCTTCTAGGG 
CCATAACAAA 
ATTTTTGGAC 
TTCTAAATGA 
GCCATAAATC 



AACTGCAAAT 
ATTTATCGTT 
ATAATTAAAA 
GTTAAGTATG 
CCAACTAAAG 
AAAAGGACTT 
ACACCTTTAT 
CCAATGATCA 
GTAAATGGAA 
ATGAAAGAAA 
AATAATAACA 
ATGTATAGTG 
ATCATTTCAT 
TTGTCGCTCA 
ATGCAGAACT 
ATTAATTTTA 
ACCAGCGAAA 
AGATGTTTCG 
CGGATTTTCA 
TGCAACTTCA 
TAAATTTTGT 
ATCAAGTGCA 
GCAATCCCAA 
CCTTTTAAAC 
ACACCTGCAA 
TTAAAATGTT 
AATTGACCTA 
TTAAACATTC 
GATCCTGCTC 



GTTAATACTA 
TATCAAGTTT 
TAAAAACTGT 
CTAAATTTCC 
CGATTGCTGT 
TACGAATTAC 
CAATTTCTTT 
AGGCTGGTAA 
TTAATAATAA 
CGATAGAATA 
CACCTAAAAT 
TTTCGACTAT 
TTATAATTTC 
ATTAACTCTT 
AAAAAGCCGA 
ATATCATAGG 
ACTAATCTAA 
AAATCATCAT 
AAAACCTGTG 
TCACAAATAC 
TGTTCTCTAA 
CTTGTTGCCT 
CACGTTGCTT 
CGACGAGTTC 
TTTCAAGCGG 
GGAAGATCAT 
TATGGTCTCC 
GAATCAAAGT 
GAATCGATAA 



AAATAAGACC 
ATTTGTAAGC 
TGATACTAAA 
TAAACCACCG 
AACTGTAATG 
TGTCCAAGTA 
AAAAGCAATT 
OGCACCTGTC 
AATTAAAATA 
AAATAACCTT 
AAGACCAAGT 
TGCAGTCCAA 
ACTAAATGAT 
TTTCGAATTT 
CTGTTCCATT 
CAGTTGATAG 
CGATATATGC 
TTAAATCGTC 
TCACCGGTCC 
GACGAATGAC 
TTTTTAGTAG 
CATCACAAAG 
TTGTCCACCT 
GACAAGTTCT 
AAACATAATA 
ACTTACTTTT 
ATCTATAATA 
ACTTTTTCCT 
ATTAACGTGA 



AAATAATTTT 
CAATCCCCAA 
ATGACATCAT 
GCACCAATTA 
CCAGACACTA 
TTAGCGCCCA 
TCTACOAGCC 
GGACCACTTA 
ATAAATGGTA 
GCACCGATAG 
ATAAATGCAA 
ACTTCTGGCC 
TTACCCATGT 
TCCGAAATCT 
TTTTGTATTT 
ACTCGATACA 
ATCTTTTTCT 
TTTCACAAAT 
TTGTTCTATC 
. ATGCATTTCA 
TAGATCTAAA 
CAAGACCGTT ' 
GATAACTCTG 
AATGCTTTTT 
TTTTTTAACA 
TGTCTTTTTG 
ACTTCACCTG 
GCTCCAGAAA 
TCTACAGCAA 



1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1960 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 
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TTCCTCCCTC 


TGTTGCTTAA 


TAAAATAAAA 


AATGCTTTCT 


CAATATCGAT 


AGAAAAATTG 


3X80 




AGAAAGCAAT 


AGTAGTATTG 


TTTCTCTCAT 


CTTCAAAAGT 


TAAAACTTTA 


TGTGAATTGG 


3240 


5 


CACCATTTCT 


ATATAAGACG 


GTTGCCGGGC 


TTCGTAGGGC 


ACATCCCTCC 


ACCACTCTCG 


3300 




ATAAGAGTTT 


ACGCATCATT 


TAATTTGTAT 


TAATCCTAAC 


ACCTTAGTAA 


AATTTCGTCA 


3360 


10 


ATAACTATTT 


TAAATTTTCT 


AACAAATCAG 


TCACCGATTT 


AAATGCATAA 


ATTCGTTTTA 


3420 


CTTCTTTATC 


TTTATTCATC 


AACAATAAAA 


TCGGCGTAGA 


CATGATTTGC 


ATATCTTTAC 


3480 




AAAACTGAGG 


ATAAAAGTTT 


AAATCTATTT 


TCAATAATGG 


TAACTGCAAT 


ATTTCATTAG 


3540 


15 


CAATGTCTAA 


CATTCTTTCT 


GaAACCTTAC 


AAGTACCACA 


CGTTGGTGTA 


TAACCAAAGA 


3600 


TTAAATGTTT 


GTCTTCCTCA 


TAAAATGTAG 


TTACATCTTT 


GATGTCTAAT 


GAATTATTCA 


3660 




TTTACTAAAA 


CTAACCTTTC 


ATTATTTATA 


TTCGGTAAAA 


GAGGTGTTTC 


TTTCTTACAA 


3720 


20 


GTAAAGCCAT 


GTTTTGAAAG 


TACATGCGCC 


AAATATTGTT 


TGGGGCAATT 


CGCAACTTGA 


3780 




CAGTAAGTTT 


TATCAATAAA 


TATATGTTCA 


CTTTCACTCA 


AATAACGTTT 


AAACCAATTT 


3840 




CTAATTCGAT 


CTCCTTCGTC 


ATCAGAATCG 


GCTAATACAA 


AAACTTGTTT 


ATCATACAGT 


3900 


25 


GATTCTATCA 


TATCATCAAG 


CTTATCTATA 


CTCATTGTTC 


CATGAGTACA 


AATAATATTG 


3960 




ACTGGTTCTG 


CAATAACCTG 


TTGCACCCTT 


TTTTTATCAG 


ATTTTCCTTC 


AACAATTATC 


4020 




ACTTTATTTA 


CAATAGC CAT 


CATCATCACC 


CTTTAAAATC 


AATAAACATC 


TGTCACTGTA 


4080 


30 


TCATTTCACA 


AAATTGGTAT 


GAATAAAACA 


TAAATCACAA 


AAAATTTAAA 


CTAGCTTAAT 


4140 




ATAATAATTA 


CAAACTCAAT 


GTTTGACTAG 


CTGGAACATT 


TAACATAAGC 


AGACAAAGGC 


4200 




TAAGTCAAAA 


ATCAACATCC 


TAAAATCTAC 


AATGTTATAT 


TAACAATAGT 


TAACCAAAAG 


4260 


35 


AAAATACACC 


TATAACAAAC 


TTTTCAATTA 


TAGCGGGGCC 


CCAACACAGA 


AGCTGATGGT 


4320 




AAGTGAGCTT 


ACAATAATGT 


GCAAGTTGGC 


GGGGCCCCAA 


CATAAAGAAA 


TACTTTTTCT 


4380 


40 


TTAGAAATTA 


GTATTTCTTA 


TGCATGAGTT 


TTACTCATGT 


ATTCCTATTT 


TTAAATACAC 


4440 


ATTAGCTGTG 


GCTTATGAAA 


ACAGGCTGGG 


ACATAAATCA 


ATGTTCTATG 


CTCTACGAAg 


4500 




TTATATTGGC 


AGTAGTTGAC 


TGAACGAAAA 


TGCGCTTGTA 


ACAAGCTTTT 


TTCAATTCTA 


4560 


45 


GTCAGGGGCC 


CCAACACAGA 


GAATTTCGAA 


AAGAAATTCT 


ACAGGCAATG 


CAAGTTGGGG 


4620 




ATGGGCCCCA 


ACAAAGAGAA 


ATTGGATTCC 


CAATTTCTAC 


AGACAATGCA 


AGTTGGGGTG 


4680 




GGACGACGAA 


ATAAATTTTG 


CGAAAATATC 


ATTTATGTCC 


CACTCCCTAG 


ATTGATCTAT 


4740 


50 


AGATACTACA 


CTTATTAAAG 


TAATATATTT 


TTATGATTCT 


CTTAGCTGCA 


ATCCCATGAA 


4800 




TACATGTAAT 


CATCAAACTT 


CATAGCCTCA 


AGGTCAGTAG 


ATTTCA 




4846 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1843 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 277: 
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AACAAAGACA 


CAATCGAACA 


TGAACCATCA 


GTAAAAGCTG 


AAGAT AT AT C 


AAAAAAGGAG 


60 


GATACACCAA 


AAGAAGTAGC 


TGATGTTGCT 


GAAGTTCAGC 


CGAAATCGTC 


AGTCACTCAT 


120 


AACGCAGAGA 


CACCTAAGGT 


TAGAAAAGCT 


C^TCTGTTC 


ATGAAGGCTC 


TTTTGATATT 


180 


ACAAGAGATT 


CTAAAAATGT 


AGTTGAATCT 


ACCCCAATTA 


CAATTCAAGG 


TAAAGAACAT 


240 


TTTGAAGGTT 


ACGGAAGTGT 


TGATATACAA 


AAAAAACCAA 


CAGATTTAGG 


GGTATCAGAG 


300 


GTAACCAGGT 


TTAATGTTGG 


TAATGAAAGT 


AATGGTTTGA 


TAGGAGCTTT 


ACAATTAAAA 


360 


AATAAAATAG 


ATTTTAGTAA 


GGATTTCAAT 


TTTAAAGTTA 


GAGTGGCAAA 


TAACCATCAA 


420 


TCAAATACCA 


CAGGTGCTGA 


TGGTTGGGGG 


TTCTTATTTA 


GTAAAGGAAA 


TGCAGAAGAA 


480 


TATTTAACTA 


ATGGTGGAAT 


CCTTGGGGAT 


AAAGGTCTGG 


TAAATTCAGG 


CGGATTTAAA 


540 


ATTGATACTG 


GATACATTTA 


TACAAGTTCC 


ATGGACAAAA 


CTGAAAAGCA 


AGCTGGACAA 


600 


GGTTATAGAG 


GATACGGAGC 


TTTTGTGAAA 


AATGACAGTT 


CTGGTAATTC 


ACAAATGGTT 


660 


GGAGAAAATA 


TTGATAAATC 


AAAAACTAAT 


TTTTTAAACT 


ATGCGGACAA 


TTCAACTAAT 


720 


ACATCAGATG 


GAAAGTTTCA 


TGGGCAACGT 


TTAAATGATG 


TCATCTTAAC 


TTATGTTGCT 


780 


TCAACTGGTA 


AAATGAGAGC 


AGAATATGCT 


GGTAAAACTT 


GGGAGACTTC 


AATAACAGAT 


840 


TTAGGTTTAT 


CTAAAAATCA 


GG CAT ATAAT 


TTCTTAATTA 


CATCTAGTCA 


AAGATGGGGC 


900 


CTTAATCAAG 


GGATAAATGC 


AAATGGCTGG 


ATGAGAACTG 


ACTTGAAAGG 


TTCAGAGTTT 


960 


ACTTTTACAC 


CAGAAGCGCC 


AAAAACAATA 


ACAGAATTAG 


AAAAAAAAGT 


TGAAGAGATT 


1020 


CCATTCAAGA 


AAGAACGTAA 


ATTTAATCCG 


GATTTAGCAC 


CAGGGACAGA 


AAAAGTAACA 


1080 


AGAGAAGGAC 


AAAAAGGTGA 


GAAGACAATA 


ACGACrCCAA 


CACTAAAAAA 


TCCATTAACT 


1140 


GGAGwAATTA 


TTAGTAAAGG 


TGAAyCgAAA 


GAAGAAATCA 


CAAAAGATCC 


GATTAATGAA 


1200 


TTAACAGAAT 


ACGGACCAGA 


AACGATAACA 


CCAGGTCATC 


GAGACGAATT 


TGATCCGAAG 


1260 


TTACCAACAG 


GAGAGAAAGA 


GGAAGTTCCA 


GGTAAACCAG 


GAATTAAGAA 


TCCAGAAACA 


1320 


GGAGAyGTAG 


TTAGACCACC 


GGTCGATAGC 


GTAACAAAAT 


ATGGACCTGT 


AAAAGGAGAC 


1380 


TCGATTGTAG 


AAAAAGAAGA 


rATTCCATTC 


rAGAAAGAAC 


GTAAATTTAA 


TCCTGATTTA 


1440 
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CCAACACTAA AAAATCCATT AACTGGAGAA ATTATTAGTA AAGGTGAATC GAAAGAAGAA 1560 

ATCACAAAAG ATCCGATTAA TGAATTAACA GAATACGGAC CAGAAACGAT AACACCAGGT 1620 

CATCGAGACG AATTTGATCC GAAGTTACCA ACAGOAGAGA AAGAGGAAGT TCCAGGTAAA 1680 

CCAGGAATTa AGAATCCAGA AACAGGAGAT GTAGTTAGAC CACCGGTCGA TaGCGTAACA 1740 

AAATATGGgA CCTGTaAAAG GAGACTCgAT TgTaGGAAAA AGarGAATTc CaTTCaGGAA 1800 

AGAcGTaATT TaTCCTGTTT AGCACCCGGG GCAGAAAAGT TAC 1843 

(2) INFORMATION FOR SEQ ID NO: 278: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8536 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

TAAACAGCGC GTGTACTTGT GATTCCCCCT TCTTCTATTT TACCCACCCG GGAAATAATA 60 

25 CTTTTCG CG A TTCCTTACTT GaACAAGCAA TATTTTATCa GCTGTTTCTT CAACTAAACA 120 

GACACATTTA ATCATCTTTG ACACCCCAAC TTTGTGAAAT CAATTTTTCA AATTATACTG 180 

TACAATTATG TTATCATATA TGAGTAGTTA TAGCGCAAAA CGTTAGCAAT TCAGCGCACC 24 0 

CAACTTTTCA TATAAACAGA AGATACTAGG GGGAATTATT ATTATGGCAA AACgTTCCAA 3 00 

ATCACAACGT TTATCAAGTT TACTAAATGT CGCAGGTTTC ATAGTCGACG GCTACAATGG 360 

CTATAAATAT CATGCTAAAA ATAAAAAATT AGTATATCTT TCATTAGGTT TAAGCACTGT 42 0 

AGGAACCGTG TTAGACTTTT ACATTTCAAT TAAGTCACCA CGTAAGTTCA AAAAAGCAGT 480 

GGCAGTTGTT ACTTTAATAA CAAACGGTGC TAGATTATTT ACAAGCATTC GCAAAGTAAA 54 0 

40 ACATGAATAC TAATTCAGAA AAGGATTGGT CGAACATAGA ACATGAAGTT CATTCGACCA 600 

ATCCTTTTTC ATATACAAAA ATTCCTATTA CAACAATCAC GACTTTGATA GCCCCGCCAT 660 

AAAATTTAGG ATTCAATCCA ACTTTTCAGC TTGTGaAATG TAATAGGaAT TCATTATATA 720 

TTTATATACG TAAGACTTTA GTGAATATAT CTATAATTAT TTACTTGGTA AGCTGGTACC 780 

GTTCTGTAAG TTAAAATACC TGGTGCAGCT GAATAGTTCA TTTCTGAAAC TAAAATACTA 84 0 

CCATCGTTAT TTACACGTTC TACAAACATA ACATGACCAT AGTAACCTAC ATCTGTTTGA 900 

GCGATAGAAC CTACAGTAGG TCTATTGTCG ATAGTGTAAC CATCTGCTGC CGCTGCGTTA 960 

TCCCAGTTAT TAGCATTCCA CCAATAAGTA CTAATACCTT TACCAATTTC AGCACGACGA 1020 
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GTATTCTAAC CTCTATTTGT CGTTGTTGCA GATCCTGAGT TCGTAGATGC aTTACCAGTT 1140 

ACTTTCAATT TTTGACCCGG ATATATAAAG AAATTATTTA AGCCATTTAA GCTCATAATT 1200 

TTTTGATAAG TTGTACCATA TTTTGATGCA ATTAATGACA ATGAGTCACC TGCTTGTACT 1260 

GTATAGTATG ATCCGCCACC TGAGTTCGTT GATGGACGGC TACTATTGCT CGCAGCGTTA 1320 

CTTGAGCTAG CAGTACCTGA TACTTTTAAT TTTTGACCTG GATAAATAAA GAAATTATTT 1380 

AAACCATTAA GTCGCATAAT GTTTTGGTAA GTTGTACCAT ATTTTGATGC GATTAATGAT 1440 

AATGAGTCGC CTGCTTGTAC TGTGTAGTAT GATCCGCCAC CTGAGTTCGT TGATGGACGG 1500 

CTACTATTAC TCGTAGAATT ACTTGAGCCA GATACTTTTA GTACTTGGTT TGGGAAAATT 1560 

AGATTAGATG TTAAATTGTT TAATGACTTT AATTTAGCAA TCGAAATCCC ATACTTATTT 1620 

GAAATTGCCC ACACTGATTC ACCCGGTTTT ACTGTGTGAG TTGTAGCCGC ATTTGCTTGA 1680 

20 GTTGCCGCAA CAGCGCTAAT CGCGCTTGTC CCAATAATAG CTGCAATTAC TTTTTTTTGC 174 0 

ACTTTAAAAT CCTCCTCTTG CTTAACTTTC CTAACATTCT TTTATCCGAA TTTATGAATA 1800 

CTACATCATT ATACGATTTT ATTATGTATA ATAGGTTGAT GTTTGATGAC ATTATGaTTA 1860 

25 CAAAAAAATC ATATACTGTA TCATCAAATT TTATAATTAT CCCTTAAAAT TATTACAACT 1920 

TATTAGATTT TACAATATCT AAATTATTAC AATTTCATAA TATTTCACTA TAAAATGATT 1980 

ACAATCCCTT TCTCTATTGG AAATAATTTT ATTCTCCAAC AATAACGCCC TACAAACATA 2040 

AGCATGAACT TTGCTTGTAG GACG CAATAT AATTTATTTT GCTAATGACG TTTCTATTGC 2100 

CTTAATCTCA TCTTTAGATA AATTAACAGG TTTCTCTCCA TCTTTGACAT CTTCCTGCAA 2160 

CGCTTTTTGA GCTTCTTTTG AATGATACAA TTCTACGATT TTAGCATATG TTTTGTTATC 2220 

CAAGTCTTTG TCATTAACTG CAACAATATT AATATATGGC TTTACTGCAT CTGAATTTGA 2280 

TTTTTCTAAA AATATCGGAT CATTTTTAGG ATCTTTACCC GCTTTAGTTG CTACACCGTT , 2340 

ATTAATAACT GCAATATCGA CATCAGATAA AGCACGTGCA GTTTGTTGTG CATCTACTGC 2400 

AGTAATTTTT AAATGTTTTG GATTTGACGT TATATCTTTC ACCGTGCCTG CTAATCCGAA 2460 

ATCTTTTTTC AGTTTTATTA AACCAGCTGC TTCTAATAGT TTAAGTGCAC GTGCTTGGTT 2520 

45 TGACACATCA TTTGGAATGA CAACTTTAGC ACCATCTTTA ACCTTTTTGA CATCTTTAAT 2580 

TTTATCTGAG TAAATGC CCA ACGGTGCTAA AACTGTTGTA CTTAATGCTG AAATCTTTGT 2640 

TCCTTTATGC GCCTTTTTAT ATTGATCTAA AAATGCAAAA TGTTGGAATG CATTCATATC 2700 

SO 

AATATCACCA TCATTTAATG CTTTATTCGG TAAATTGTAA TCTGAGAAGT GCTTAATCTC 2760 

CACATCAATA TCATCTTTTT TAGCTAATTC TTTAACCTTC TCCCAAGCCT TAGTGTCATT 2820 
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AAOTOCTACG ATAACTAACC CAATCAATCT TTTCATTCTA TCAATTCCTT TCAAAATCTT 2940 

CACTATATAT CATTAATGTC TACGTATGAA TCTAGCTAGA ACATTCCCTA GGOTTTGAAT 3 000 

CACTTGGACA ATAATGACTA ATACAATAAC GGTAATAATA ATGACCGTCG TATCAAATCT 3060 

TTGATAACCA TACACTAAAG CTAAGTCTCC TATACCACCA CCGCCAACAG CTCCTGCCAT 3120 

CGCCGTACTT CCAATAAGTC CAATAATCGC AGTGGTAATT GCTAATACTA ACGAACCTAA 3180 

AGCTTCAGGA ATTAAAAAAT ATCTAATGAT TTGTAGTGGT GAAGCGCCCA TCGnTTTCGC 3240 

CGCTTCAATA ATCCCCTCGT CTACTTCCAA TAATGAGTTT TCAACAAGTC TTGCAATGTA 3300 

AGGTGCCACA TATACTGTTA AAGGCACGAT GGCAGCAGTC GTACCAATTG AAGTACCTAC 3360 

TACTAATTTT GTOAATGGCA CAATCGCAAT TAACAAAATA ATAAATGGTA GTGACCTTAA 3420 

AATATTGATT AAAGGATTTA AAACTTGATG TATCACTATA TTGGGCGATA TGCCTTGTTT 3480 

TCGAGTAATT ACCAATAAGA CACCTAATGG AATACCAATC ACTGCTCCTA AAAATAAAGC 3540 

AATAGATACC ATATATAGCG TTTCGTACAA TGCTTGTAAT AACTGTGCAC TGTCTAAATC 3600 

AGAACCAAAC ATATGTTAAT GcACCTCCTC AAATTGAATA TTTTTCTCTT TGAAATATTG 3660 

25 ATTTATTGCC GTGTCTTCAA ATTGTTGATC CATATTAAAT CGAAGCCACA TATAACATAC 3720 

GGTGTTACCT TGTATTTCTG ACATAGATGA AAATAAAATT TTAACCTCTC TGCCACAAAT 3780 

TTGAATCAAG TCATTTATAA TCGGTTGTGT CACCTGAGTT TCCTCGACGA AGATTTTATA 3 840 

30 ATCTTTAAAA TCGCCAACTT GTTCGTCATT CAATCGACGA ATCAATGATG TACTTGGCTC 3 900 

AGTCTGTATA ACTGTAGACA CAAAATTTTG AGCAATCGTC GTTTTAGGAT GACTAAACAC 3 960 

CTCTTTAAcA GTTCCTGTTT CAACCACTTT CCCCTTTTCC ATTACAG CAA CACGATTACA 4 020 

35 

AATGTCTTTA ATAACGCGCA TTTCATGTGT AATCATCATA ATTGTAATGC CAAAGGTTTG 4080 

ATTGACATTC TTTAATAACG TCAATATCGA AGCAGTCGTT GCTGGATCCA ATGCGCTTGT 4140 

40 TGCTTCATCG CATAGGAGTA TTTTCGGATT AGTAACAAGC GCTCTTGCAA TAGCCACCCT 4200 

TTGCTTCTGC CCACCAGATA ATTCATCAGG AAATTGGTCT TTTTTATCAC TCAATCCTAC 4260 

AAATTCAAGC ATTTCCGTTA CTCGTTGCTT AATTTCTGTT TTGCTTTTCT TACTTAAAAT 4320 

45 GAGTGGCATT GCTACATTTT TAAATACGGT AGCTGAATTT AATAAATTGA AATGCTGAAA 4380 

TATCATACCG ATATCTTTCT TAATATCCCT CATCATTTTA TCGCTATAAT TCGTAATATC 4440 

ATGTCCATCT ACAATCACTT GTCCATTCGA GGCAGCTTCA AGATGATTCA CGAGTCTTAC 4 500 

50 

CAACGTACTT TTTCCTGCAC CACTATATCC AATCACACCA AAAATATCAT TGCGATTGAC 4560 

CGTAAATGAT ACGTCCTTCA AAGCATCTAT TTTTTGCTTC TTTTTATTAA AGGTCTTACT 4620 
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TAAGAGCATT 


ATATGTAAAA 


TTGCATATAT 

X X WV>XftX.ftXxt X 


CGTcAATACA ATTTGCCGAA 


*I "I* PTfT AAAA 






AATTAAAAAA 


TAAGTAATTC 


ATGTQACAAT 


GACGAATTGT 


GAGACTACTA 


TfJfiPATTTAT 

A VJAwl A X X X 


a gnn 


5 


CAAATTAAAT 


CCATAAAAAT 


flTrrArraaT 


CCTCCACAAC 


GCAATTACTA 


AATATTAACA 


■vobU 




TCGCACAAAA 


AAGCACTAGC 


ATATTrAAfiA 
xixnx x **p/v*o**k 


ACAACAAACG 


TTGAACTCAA 


AATATATGCC 




10 


X vv a WW * 


x ini x inl/vi 


Ao Inlul ri\J X 


GCTTGTTTTA 


AATCATCGAC 


X AAAX w X X wA 


49BQ 






AAX iWi XAw-V 


AGTCCGTCTG 


TAATACCTTC 


TTTAGCTCGA 


5040 








A X\J X vj X (JA IT. 


AATGCAGGTA 


CTGAAATTAA 


ACTTTCCACT 


5100 


1S 


*otw*.Ul» lAAAv. 


TXTCAGCTAA 


xGTGTaATAC 


GATGTTGCTT 


TAATCAATTG 


TTTGGCACTT 


5X60 




"1A,"I1*"XATTTT 


TCACTTCAAA 


TGCAATCaCA 


CCTGTATGGC 


CATCCGCTTG 


AGCCATATGG 


5220 




ft ^^ft mm m m/i ft 

ACATCATGAT 


TTAAATGACT 


TTCAATACTT 


GGATGGAACA 


CTTGTTGCAC 


AGCTGGATGT 


52B0 


20 


GCTTGTAACA 


TTTTAATAAT 


TTCAATAACG 


CTGCGATTAA 


TTTGTTCCAT 


ACGTAAACCT 


5340 




AATGTTTTAA 


TACCCCTCAC 


AAGTAAATAG 


CTATCTTGAG 


GTCCTAAAAT 


OCCACCTGTT 


5400 




GAATTTGAAA 


TAAATGCTAA 


ACGTTCTGCA 


AGCTTGTCAT 


CCGATGTTGC 


AACTAAACCA 


5460 


2S 


GCAACGACAT 


CACTATGTCC 


ACCTAAATAT 


TTCGTTGCAG 


AATGTAAGAC 


AATATCGATA 


5520 




CCTAAATCTA 


ATGGATTCTG 


ATAATAAGGT 


GTCATAAATG 


TGTTATCAAC 


AACTGAAATC 


5580 


30 


AAACOalVlT 


CTTTCGCAAT 


TTCAGCAGAC 


TxTl'rAATGT 


CAGTAACACG 


TAATAATGGA 


5640 


TTAGAAGGTG 


TTTCAATAAA 


CAACATCTTT 


GTTGTTGGGC 


GTATCGCTTG 


TACAATTGAA 


5700 




TCTGTATGCG 


TTGTATCTAC 


AAAATCCACT 


TCAATGCCAA 


ATCGTGTAAA 


TACTTTTGTC 


5760 


3S 


AATG CGCG AT 


AAGTACCGCC 


yM- f««ft #**ft s**ft ww^'wn 

GTATACATCT 


GAATTTAAAA 


TAATATGATC 


TCCTTTGTCC 


5820 




M 7fc ft /■*» A*» ft rr» ft ft 

AACAG CAT AA 


ft ft «nV1^1« 

CAACTGCACT 


GATTGCTGCA 


ACACCTGAAC 


TAAATGCAAA 


GCCATGTTTG 


5880 




. CCA'lVl'lViA 


ATGTCGCAAT 


AACGCTTTCT 


ACAGAACTTC 


TTGTTGGATT 


CGCAGTACGA 


5940 


40 


uAAlAl lUAl 


ft rp/^i/ wiwiyi * 

A I Cvl XvACU 


X AAA XlJAt, vJA 


ATATCATCTT 


GTAAATATGT 


ACTTGTTTGA 


6000 




X AAA X XvjCy l\j 


X X\» XAAlJvjvjL, 


ACL- Xv-rX A1AA 


TCGTCTGTTG 


TGTGCCCACC 


AlVxAAX XAAx 


oUbU 






lul X VJAl XAX 




AXAAX XAAAX 




nUilAlAl wo 




4S 


ATCACTACCA 


TCTGGAAATA 


CGACAACAAT 


CGTACCTTCA 


GATAATTGCG 


CTTTTAAATT 


6180 




CAATGCACCT 


TGTAATGCTG 


CACCTGaAGA 


ACTGCCTACT 


AACAACCCTT 


CATTTATAGC 


6240 


SO 


CAAACTTTTG 


ACATTTCGAA 


AGGCATCTTG 


ATCTTTAATC 


GTAAATATCC 


CATCTACAAG 


6300 


ACGTCTCTCT 


AAAAATATCG 


GCCATTTCTC 


AGAACCX5ATA 


CCTTCAGTGT 


CATGTGCATG 


6360 




AGCTGGCCCT 


CCATTTAACA 


CGGACCCTTC 


TGGCTCAACG 


GCATAACATT 


GCACGTGATG 


6420 



SS 



1099 



TO 
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ATAATCAATT TGCTGTAATG CTGAAGTCAA TTCGGGTCCC AATGTATGAA AATATGTATC 6540 

CGGATTATGT TCGGATTCAA ATTGATTCAT ATAAACGGCA CCATATTTTT CAGCATAGGA 6600 

ACGTGCAGCT AATTGTGCCC CATGCATACC TTCAGACTGA CTCGTCCTTG AAACTTCTGC 6660 

ACCAAGCGCT ATCATAATAT TAATCTTTTC TTCTGAAAAA CCATACGGCG CAAAGATCTT 6720 

ACATTTCAAA TGATGTCTAT TCGCTGCAAT AGCTAACCCT ATGCCTGTAT TACCAGCAGT 6760 

CGCTTCAACA ATAGTTTGAC CTGCACGCAC ACGCCCTTCT TGAATTGCCT TCTCTACTAA 6840 

ATATTTCCCG AGTCTGTCTT TAACACTGCC TCCAGGATTC CATTGTTCAA GCTTGGCATA 6 900 

AATTTTAACT TTATCATCAC TATAATGTTC TAACAGTACT AATGGTGTAT TGCCaATTAA 6960 

ATCATAAGTA ATCATAGATG CACCCTCATC TGACATGCCG ATCAAATGAA TGAAACCTTT 7020 

CTTCATGTCT CAATTTTAAT TCTTACTTTT CAGATAAGAA TTATAAACGA CATTTTGTTA 7080 

20 TTTTGCAATT ATCTAAGTTT CGATTAATTC AGAAC CAGTA CTAAATTTTC AATTCCAAAC 7140 

AAAAAAACAC CTGAGCAACA CAAATACTTG TGTGT CAGAT GCTTCTATAT ATTAACTAAA 7200 

TAATTGCACG ATAAAGACTA AAATAATAAC GACAGGCATC GCATACTTAA TTAAGTAATA 7260 

CCAACCACTG AATAATCTAA ATCGATCTTT ACCAAAATAT TGTTGTAATA ATTTTTTATC 7320 

TAATAATTGT CCTACGACAA GCGTAGTACC TAATGCGCCT AATGGCATCA ATACATTCGA 73 80 

AACGATCAAA TCCATATTAT CAAAAATCGT TCCCGCACCG AATCTTACAT CTTTTAAGAT 7440 

ACCAAAAGAT AAGGTTGCTG GAATACTAAT GATAAATACT AAAATACTAC CGATCACTGC 7500 

GACTTTTTTA CGTTTTGTAT TGTCATTCTT CGTGAAGTTA GAAACATTTA ATTCTAATAA 7 56 0 

AGAAATAGAT GACGTTAAAG CCGCAAATAA GAACAGCACT AAGAATCCCA AATAGAATAA 7620 

TGTGCCTAGA TGCATTTGAC TAAAGACCAT TGGCAGTACT TTAAATAATA ATCCAGGCCC 7680 

TTC15X3TGGT TCATAGCCAA AACTATGTAA AGCCGGAAAT ATAG CTAGAC CTGCCAATAC 774 0 

40 AGATACAAAG ATATTCATAA CAACGATAGA AATAGCTGAT GACTTAATCG TCATGTCTTT 7800 

AGAGGCATAA CTCGCATAAG TAATCATACC TGTAGTTCCT AATGATAACG TAAAGAATGA 7860 

TTGACCTAGC GCAAACAAGA TGCCATCAGC AGTAATCTCT GATACTCTTG GTTGTAAAAT 7920 

45 AAATTTCACA CCTTCTAAGA CGCCATCTAA TGTTAAAGAC TTAATCACAA TGACGATTAA 7980 

AAAGACAAAC AGCAATGGCA TCATAACTTT CGATGCCTTT TCTAATCCTT TTTCAACACC 8040 

TAACATGACA ATAATCATCG TAG CGAATAT GAATATACCT TGCCCTAGAA CGGTTAACCA 8100 

60 

AGGATTTGAT ATTACCGCTT CAAAATTCAT TTCTTGGAGA TGATTGATGC GTTGAAATAT 8160 

AACTAATTGC CATAATACTT GTCCGATGTA AATGACAATC CAACCACCGA TAACACTATA 8220 

55 
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TTTACCAGTT AATTTACTAT ATATTTGTGT TGTATATOTC CGTCCCATTT TCCCAACAGT 8340 

GAATTCCATA ATGAGTAATG GCAACCCAAC AAAAATGGTG AATATTAAGA ACATAGCTAG 84 00 

AAAGGCACCG CCGCCATAAA TCCCTGCCAT ATATGGGAAT TTCCACATGG CACCAAGACC 8460 

GATTGCAGAA CcCGCACTAG CTAAAATAAA TCCAGTTGAT GACTTCCATT GTGATTGTTG 8520 

TCTTTTCATC ATTCAC 8536 
(2) INFORMATION FOR SEQ ID NO: 279: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4328 base pairs 
1S (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2° <xi) SEQUENCE DESCRIPTION : SEQ ID NO: 279: 

GCTTTGGCCA TTTTATGTGG CGATTGAGAC AATCtGTkGT TGTCTTATTT GATGTTGTAT 60 

TTCAACTGGT AATTCTAGTT GCGATTGAAA TAATGGCAAC TTTTCCCAAT CATTAACAAA 120 

25 

TAATTCAATA CCTGCTATGT CTAATACTTT AGCAOGTGCA TCATCAACAA GACGGCGTTC 180 

CAATTGATTT GCTTCTTCTT TAATACCTGG TGACGTACTT TCTAATATCA AATTAGATAT 240 

AGGGATGTGA CCATTAATTG CATAATATAA TGCAACACGC CCACCCATTG AATATCCAAA 300 

30 

CAATGTTATT GATTTATCTT TATATTTATC TAAAATTCGG TCTAACAACG TCGTAATATA 3 60 

ATCAAAATTC CACGTTTCAT CCATTGAAGA CTGATCTTCG CCATGGCCTG GTAAGTCTAT 420 

35 AGTGATGACA TGATAGTTAT CAGTAAATTT TTCGATGTGA TTATGATAAG TACGGCTGTC 480 

GCTAAGAAAT CCATGCAGAA ATACTAAAAC TTGATTGGTC TCAACGTTTG CTTCATAAAA 540 

TTTCTAATGT GTCATGAATC ATTTCACTCA ATTTCTGGTA TAAAATTTGA TGCTGTTTAA 600 

40 AGTTATCTTC GCGATTCGTT ATCAATTCAT AAATCGTCGA AGTTTCAGAT AACAATGTGG 660 

CATTTTTAAA TTCTGAAACA CTGTTAAAAC GTTTAAAATC GAATTGATAT AACTTAGCTG 720 

TATACTCGAA ATCCAATCCC GTCGGTGTGC CAAACAACCG TTCAAAATAG TCAGTTGCAC 780 

45 

TTTCTTTTTG TGGTAAATAT GAAAAAATAC CGCCACCATC GTTGTTCAAT AATACAATAT 840 

TCATCTGAAT ATTATTTAAT TTTGACATTA ATAGTCCATT CATATCATGA TAAAATGATA 900 

AATCACCTAT CAATAATGTT ATTCGTTTAT GCACAGCCAT ACCCAGTGCA GTTGAAACGA 960 

50 

TACCATCAAT ACCATTCGCA CCACGATTCG CATAGACATC TATATTTTTA TTCAATAACA 1020 

AGTTATCTAC ATCTCTGATA GGCATACTAT TACTAATAAA TAATGCATCT TTTTCAGATG 1080 

55 
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TAATTTCTTT 


ACGCCCTTTT 


TTCTCTAAGC 


ATTGCCATTT 


TTCTAACCAA 


CTTACGCGAT 


1200 




TAACTGTCGT 


GTCTTCCATT AATGACCTAA AGAAATCATT 


CGCAGAAATC 


TCATATGAAA 


1260 


5 


TATCTGGCGC 


TATCGGAAAG 


ACATCAATCT 


TATCATTGTT 


TTGCACTAAT 


ATTTGAAATG 


1320 




CATCAGTTTT 


CTTTAACCAT 


TGATTTAACT 


TTTTAGAAAT 


CACTGGTTTC 


CCAACACGAA 


1380 


10 


TTACGAAATC 


CACATTTAAG 


TCTAAGCCGC 


TTCTAAACAG 


CAAATCATAT 


GTACAGATAA 


1440 


CATTCGGATG 


ATCAAATTTT 


CTTAAATGAC 


TTAAAGGATC 


AGCTAAAATA 


GGCAAATCAT 


1500 




ATATCGTTGA 


ATACGTTAGT 


ATTTGaTCAA 


CTTCTTGGTG 


CTGCATATCC 


CCTACAATAA 


1560 


1S 


TTAAACCTTT 


TTTCTTATTT 


AAAATGTGTC 


TTAATGCCGA 


TGCATCTATA 


crrrrrrGAT 


1620 




AGTGCGGTAA 


AATCTTCATC 


TCAGAAGTTA 


ACAATTCTGT 


TGCATTCAAA 


TCAGGTGTTA 


1680 




ACGGATCTCT 


AAATGGCAAG 


TTAAAATGAA 


TTGGCCCTTT 


ATGTGGTCCA 


TATAAATATT 


1740 


20 


GACTAGCAAT 


TTGCATTTGA 


TAGTAAATTG 


CATCAATGGT 


CTCTTTACTA 


TCATCCGCAA 


1800 




TAGGCATATC 


GAACTCATAA 


CTTACATAAT 


TATTAAACAT 


ATTTACTTGA 


TTAATCGCTT 


1860 




GTGGTGCGCC 


TACACTTCTT 


AATTCATGCG 


GACGGTCACT 


TGTTAAAACG 


ATTAAAGGAA 


1920 


25 


TTCTACTAAT 


TTGGCTTTCA 


GCAATTGCAG 


GCGTATAATT 


CGCTGCTGCT 


GTACCTGACG 


1980 




TACATAATAT 


AGCGACAGGT 


CTTTCACTGC 


CTTTAATTAA 


CCCAACTGCA 


AAAAACGCTG 


2040 


30 


CACTTCGCTC 


ATCGGGGTGT 


ATCCATGTTT 


TAATATTTGG 


ATGTGCTTCA 


AATGCAAGTG 


2100 


CAAGTGGCGT 


TGAGCGTGAT 


CCCGGACTGA 


TAACTACTTC 


CCTTACGCCG 


TACGCATATA 


2160 




ACTCAGATGC 


AAATGTAAAA 


ACTTGCTTCG 


TTAAAGCTGC 


TTTATGATTT 


CCCATTCATA 


2220 


35 


TCGACTCCTA 


ATGCATTCAT 


CATAGG TGTG 


AACTTAAGGT 


TCGTTTCTGC 


CAATTCACTA 


2280 




TCTGGATCAG 


AATCTTTAAC 


AATGCCACAC 


CCAGCAAATA 


AAGTTGCTTG 


TGCTTTCTTA 


2340 




ATAAGCATCG 


AACGAATTGC 


AACAATAAAT 


TCACAATCAT 


CGTATATATC 


TATATAGCCA 


2400 


40 


ACCGGTGCAC 


CATATAATCC 


TCGCGTACCA 


AATTCTTTCT 


GCTCAATAAA 


ATCCATTGCA 


2460 




AATTCTTTTG 


GATAGCCACC 


TAAAGCAGGT 


GTTGGATGTA 


AATTATCAAT 


TAAACTAATA 


2520 




TACGAATCAT 


CCTTCAGTGG 


CGCCTTTATT 


TCAGTGTACA 


AGTGATATAA 


ATGATCATTT 


2580 


45 


TTTAGAATTT 


TAGGCGTCTT 


ATCATAATGT 


AATTCAGTGA 


TATAAGGTTT 


AATATCATGT 


2640 




AAAATACTGT 


CAACAACAAA 


TCGATGTTCG 


ATTAAGTTTT 


TATTATCTTT 


TAAAAATGCT 


2700 


50 


TCAACATTTT 


TTGTATCTTC 


GTCCTCATCT 


TGTGAACGTT 


TAATTGTACC 


TGCTACAGCT 


2760 


TTAGTCGATA 


GTATTTTATT 


ATTGACCTTT 


ATTAATTGTT 


CAGGTGTTTG 


TGAAAAGAAT 


2820 




ATAGAATCTT 


GTGATTCTAA 


CAAGAATATA 


TAACTGTTTT 


TTTCTTTAGA 


ATATGCTTGC 


2880 
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ACAATTTTTT 


CTTCATTATT 


AATAGATTCT 


ATAGCTTCTA 


CTACAAGTTG 


ACGCCAGTCA 


3000 




TCTTTATAAA 


TATCTTCATT 


TCTAGTAATT 


TCCCCAATTT 


GCTCGTCCAC 


ATCTATGTCC 


3060 


5 


GATATATTGT 


TGAACAAATC 


CATTAAATCG 


TTCAATGCCT 


CAACAGTAAA 


ACTTTCCCTT 


3120 




TTAACTGTAT 


AAGTTAAAAA 


TGTCCCATTA 


TTATCAGTTG 


AAATTAAAAC 


TTCAGGTAAT 


3180 


10 


ACAAAATGAT 


TTAGTCCAAA 


CTCTCGCCAT 


TCATCATCTG 


ATTTATOACT 


TGAAAATTGG 


3240 


AACCCTCCAA 


CAACTCGAAG 


ATGATGTTTC 


TCAGATTGCG 


GATGTATAAA 


TGTGATGTTA 


3300 




TGTTTTAATT 


TTTCCCAGTC 


TTTAAAAATA 


GATTGTTTAT 


TTTTAGAATT 


ATTTTTGAAT 


3360 


15 


AATTGAATTG 


CTTTGTAGCC 


AAAATATGAC 


GTTCGATTAT 


CATTCAAACG 


CATATAAAAG 


3420 




CGATCTCCTG 


CCTCATTGTC 


AGTGAGATGA 


AATAATGTGC 


TCGGGTCTAG 


TGACTGTGAT 


3480 




AATTTCACTT 


CAACTGAAAC 


CCATTCCTTT 


GAGCTGCCAT 


ATATCTCTTT 


GACAATATCG 


3540 


20 


TCCTCTAATA 


CGCCCGTAGC 


CATCCATTTC 


ACTTCTTTCT 


TCGTCTTTTT 


TCACTCATTA 


. 3600 




TTATATTGTA 


TCATTTTTGG 


ATAATTGTGT 


TACAAGAATT 


GCTTAAACTT 


ATCTTGCAAT 


3660 




TTTTCACGTC 


AATTGACCTT 


TATGCTACTT 


TCTATTAAAA 


TATCTTTGTT 


ATAAAAAATA 


3720 


25 - 


TGATTTAAAG 


AGGTTTTGTA 


TTCAATGAGT 


AATCAATATC 


AGCAATATTC 


TACAGTTAAG 


3780 




AAATATTGGC 


ATTTAATGOG 


TCCTCATACA 


TTAACTGCTT 


CCGTAGTACC 


CGTTTTAGTT 


3840 




GGTACAGCAG 


CATCTAAAAT 


ATATTTTCTT 


GGTAGCGAAG 


ATCATATTAA 


AATCAGCCTA 


3900 


30 


TTCATTGCCA 


TGTTACTAGC 


ATGCTTACTT 


ATTCAAGCAG 


CAACTAATAT 


GTTTAATGAA 


3960 




TACTATGATT 


ATAAAAAAGG 


CCTCGATGAT 


CATGAATCTG 


TAGGCATTGG 


TGGTGCCATT 


4020 


35 


GTTCGCAACG 


GTATG AGCCC 


AGAGCTTGTG 


CTACGATTAG 


CCATTGCATT 


TTACATCTTA 


4080 


GCAGCAATAT 


TAGGTTTGTT 


TTTAGCTGCT 


AACTCTTCAT 


TTTGGTTATT 


ACCAGTTGGA 


4140 




ttagTatgta 


TGGCTGTTGG 


TTACCTATAT 


ACAGGTGGCC 


CTTTCCCTAT 


TTCATGGACG 


4200 


40 


CCTTTCGGTG 


AATTATTCTC 


AGGCGTATTT 


ATGGGTATGT 


TTATTATCGT 


TATTGCATTC 


4260 




TTTATTCAAA 


CTGGCAATAT 


TCAAAGTTAT 


GTAATTTGGT 


TAAGTGTACC 


TATAGTAATC 


4320 



ACTATCGG 4328 
45 (2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1450 base pairs 

(B) TYPE: nucleic acid 

50 <C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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5 



10 



1S 



30 



3S 



40 



Q* I rCAATACA 


GAAAAAATAA 


ATTTAGATGT 


TGAAGCATCC 


m« OK ft >f T»ft ft rr* 

TACAATTAAT 


ACAGATCCAT 


60 


TTCAATATAT 


TTAAACTAAA 


ATCTCGGGAT 


TTCTAAATTT 


liu'i ft ft ft ii ■IWIU'W*' 

TQAAATTTCG 


AGGTTTTnAT 


120 


ATTTTTATTT 


AAAATAGCAC 


ATTTATACTT 


TATAATAGTA 


ft ft ^ ft HV ft ft ^^ft 

AAGATGAACA 


m»m» ft. /"'O ft./"^f"« 

TATAAGGAGG 


180 


CCAAATCATG 


GCAAAACATC 


GATTCGAAGA 


ATTTAATCTA 


GAATCTAOTT 


TAATTGACGC 


240 


It ft^ft f% 

TGTGAAAGAC 


CTTAATTTTG 


AAAAACCAAC 


ft ft ft iw«in« m 

TGAAATTCAG 


ft ft ft ft nwv ft 

AATCGAATTA 


TTCCAAGAAT 


300 


ACTAAAGAGA 


ACAAATTTAA 


TTGGTCAATC 


TCAAACGGGT 


ACAGGGAAAT 


CTCATGCATT 


360 


TTTATTACCA 


TTAATGCAGT 


TAATTGATAG 


TGAAATAAAA 


GAACCACAAG 


CAATCGTAGT 


420 


TGCACCAACA 


AGAGAACTTQ 


CACAACAACT 


ATACGATGCA 


GCGAACCATT 


TAAGCCAATT 


480 


TAAAGCTGGT 


GTTTCAGTTA 


AAGITITIAT 


TGGTGGTACA 


GATATAGAGA 


AAGATAGACA 


540 


ACGTTGTAAT 


GCACAACCAC 


AATTGATTAT 


AGGCACCCCT 


ACTAGAATTA 


ATGACTTAGC 


600 


TAAAACGGGA 


CATTTACATG 


TGCACTTAGC 


ATCATATTTA 


GTTATTGATG 


AAGCGGATCT 


660 


TATGATTGAC 


TTAGGATTAA 


TTGAAGATGT 


AGATTACATT 


GCTGGAAGAT 


TGGAAGATAA 


720 


TGCAAATATT 


GCGGTGTTTA 


GTGCTACAAT 


CCCACAACAG 


TTACAACCAT 


TTTTAAATAA 


780 


ATATTTAAGT 


CATCCAGAAT 


ATGTAGCTGT 


CGACAGTAAA 


AAACAAAATA 


AAAAGAACAT 


840 


CGAATTCTAT 


TTAATACCTA 


CTAAAGGTGC 


AGCTAAAGTT 


GAAAAGACTT 


TAAATTTAAT 


900 


TGATATACTA 


AATCCATACT 


TATGTATTAT 


TTTCTGTAAT 


AGTAGAGATA 


ATGCAAATGA 


960 


TTTAGCACGT 


TCACTAAATG 


AAGCTGGTAT 


TAAAGTTGGT 


ATGATTCATG 


GTGGCTTAAC 


1020 


GCCaCGTgAA 


CGTAAACAAC 


AAATGarACG 


TATACGTAAT 


TTaGAATTCC 


aATACGTTAT 


1080 


TGCCaGCGAT 


TTAGCATCTC 


GTGGTATTGA 


TATTGAAGGT 


GTTAGTCrTG . 


TCATCaATTT 


1140 


TGATGTGCCA 


AATGATATTG 


ACTTCTTTAC 


GCATAGAGTC 


GGACGAACTG 


GTCGTGGGAA 


1200 


TTATTrAGGT 


GTAGCAATTA 


CGCTTTATAG 


TCCTGATGAA 


GAACACAATA 


TTTCATTAAT 


1260 


AGAAGATCGC 


GGTTTTGTAT 


TCAATACTGT 


TGATATTAAA 


GATGGTGAGT 


TAAAAGAAGT 


1320 


TAAAGCGCAC 


AATCAGCGTC 


AAGCAAGAAT 


GCGCAAAGAT 


GACCATTTAA 


CTAATCAAGT 


1380 


GAAGAACAAA 


GTTCGAAGTA 


AAATTAAAAA 


CAAAGTTAAA 


CCAGGTTATA 


AGAAGAAATT 


1440 



TAAACAAGAA 1450 
(2) INFORMATION FOR SEQ ID NO: 281: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1139 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

AGTCAGGTAT ATCATGCCaT yCTOAATTGG TCGATATTAA TATCAGTGGT GTTAAAGAAC 60 

GAATTGTATA CCAATAGACG CTTTATATTG TAAAATAGTA TTAAATGCaO AATAGAGAGG 120 

AGATTTAATG CGATATGACA AATTATAAAG TTGTCGTTTT AGATATGGAT GACACATTGC 180 

TAAATTCAGA TAATGTGATA TCAGAAGAAA CTGCAAATTA TTTAACAGCA ATTCAAGATG 240 

AAGGTTATTA TGTTGTTCTA GCATCTGGTA GACCTACTGA AGGTATGATT CCAACTGCTA 300 

GAGATTTAAA ATTACCTGAA CATCATAGCT ATATTATTAG TTATAACGGT AGTAAAACGA 360 

TTAACATGAC TAATGAAGAA GTAGAAGTAA GTAAATCGAT TGGTAAGCAA GATTTCGATG 420 

AAATTGTAGA TTATTGTCGA GATAGAGGCT TTTTCGTTCT TACATATCAT GATGGTCAAA 4 80 

TTATTTaCGA CAGCGAACAT GAGTATATGA ATATTGAAGC AGAATTAACA GGTTTACCGA 540 

20 TGAAACGTGT TGATGATATC AAAGCGTATA TTCAAGGCOA TGTACCCAAG GTCATGGGTG 600 

TAGATTATGT AGCGAATATT ACAGAAGCTA GAATTGATTT GAATGGTGTG TTCAATGATA 660 

ATGTAGATGC TACGACAAGT AAGCCATTCT TCTTAGAATT TATGGCCAAA GACGTTTCAA 720 

26 AAGGTAATGC AATTAAAGCG TTATGTCACA AATTGGGATA TTCGGTGGAT CAAGTCATTG 780 

CTTTTGGTGA TAGTATGAAT GATAAATCAA TGTTTGAAGT CG CAGGTCT A GCTATTGCTA 840 

TGGGGaATGC ATCAGATGAA CTTAAGCAAT ATGCAAATGA AGTTACGTTG GATCATAATG 900 

AAAATGGTAT TCCACATGCG CTCAAAAAAT TGTTATAAAT TTTAAAATAA GCCTTAACAC 960 

ATGATATTTG AATAAGATAT CTTGTGGTTA AGGCTTTTTA TTTTTGTGAA AATGACTTCA 1020 

GTTATACTAT GGAGGATTTG AAATACATAT TTTAGATTAG TAATGATATC AAACGAATAG 1080 

AGTAAATGTA TATTTttTGA ATAAATCAAG TATTAACTAG TCACGGAAGG nAGATAAAT 1139 
(2) INFORMATION FOR SEQ ID NO: 282: 

40 ' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 282: 

TCTAAAAATG CTGTGAAATT CTTTTATAAA TATCTAAAAG GAATTAATGT TGATAACATT 60 

50 

G CTGTG AT AG GAAGTAAGAC AGCGCAATAT TGTGAATCAC TTGGCATTCG AGTTGATTTT 120 

AtGCCAAACG ACTTTTCTCA AGAAGGATTT TTAAAATCAT TTAATCAAAC TAACCAAAAA 180 

55 
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AATQAAGTTG TTAAAATAGA TTTATATACT 
GTTAAAGAAA TGATAGAACA TCAACAAATC 
5 GTACGTTATT ATTTTAATGA AGGATTTGTA 

GAACAAACAG CACGGACCAT TAAATCATAT 
ACACTCOAAT CACTAATTGA AAAGATTTTA 

10 

GACATAGAAG ATTGAGATCA TCAGCGACAA 
GAAAAGAAGA TTTAATATAT CCAATTTTTG 
TTAAGTCATT GCCAGGTGTA TACCAAATCA 

15 

AAGCTTATGA CTTAGGCATA CGTGCCATTA 
ATATAGGTAC TGGTGCATAC ATTCACGATG 

2o AAAAAATGTA TGATGACTTA TTAATTGTTG 

ATGGTCATTG TGGCGTGATT GATGACCATA 
CACTACTTGT TAAAACAGCA ATTTCTCAAG 

25 GTAATATGAT GGATGGTTTT GTTGCTGAAA 

ACAATATTCC TATAATGAGT TATGGTGTCA 

GAGATGCAGC AGATTCAGCG CCATCATTTG 

30 

CTAACCGTTT GGAAGCACTT CGTGAATTAG 

TGATTGTTAA ACCTGCTCTA AGTTATTTAG 

ATGTTCCAGT TGTTGCATAT AATGTGAGTG 

35 

AAAATGGTTG GATAGATGAA GAACGTGTCG 
CAGGTGCTGA TATGATTATT ACGTATTTTG 
AAGGTTTTAT ATTTATGATT TTCCATAAAC 

40 

GAAATCAGAA GAAGCAATGa AGGTTGCTGA 
AGTACGCGCA TTTAAATCAG TAGATACACC 

45 AAAAATTTAT GATATCGATG GTAACGAGTA 

TATTTTAGGA CATAGAGACC CTCAAGTTAT 
TACAAGTTTT GGTGCATCAA CATTACTTGA 

60 AGT AC CTTCA ATAGAAAAAG TGCGTATGGT 
TTTAAGATTA GCACGTGGTT ATACTGGCAG 

55 



TCAGTGCCTA ACAAACAAAA 


TATAGAAGAT 


300 


GATGCATTAA 


CATTTTCAAG 


TTCGTCGGCA 


360 


CCAAAATTCA AGTCGTATTT 


TGCTATTGGA 


420 






AGAAATTCAA 


460 




vjV in/i/wuiJVj 


AAATTTGATA 


540 




wl lAhaAGAG 


AATCATGTAA 


600 


TAGTTGAAAA 


AGACGATGTG 


AAAAAAGAAA 


660 


GTTTGAATTT 


ACTTGAAAGT 


GAATTAAAAG 


720 


TGTTTTTCGG 


TGTTCCAAAC 


TGAAAaGATG 


780 


GTGTTATTCA 


ACAGGCAACA 


CGTATTGCTA 


840 


CAGACACTTG 


TTTATGTGAA 


TATACTGATC 


900 


CACATGACGT 


TGACAATGAT 


AAATCATTGC 


960 


TGGAAGCTGG 


TGCTGATATT 


ATTGCGCCAA 


1020 


TTCGTCGTGG 


ATTAGATGAA 


GCCGGCTATT 


1080 


AGTATGCATC 


AAGTTTCTTT 


GGACCTTTTA 


1140 


GGGATAGAAA 


AACGTATCAG 


ATGGACCCTG 


1200 


AAAGTGATCT 


TAAAGAAGGG 


TGCGACATGA 


1260 


ATATAGTTCG 


AGATGTTAAA 


AATCATACGA 


1320 


GAGAATATAG 


*H » »li 1 » /'Klin «l & 

TATGACTAAA 


GCAGCGGCAC 


1380 


TTATGGAACA 


AATGGTTTCA 


ATGAAACGTG 


1440 


CAAAGGACAT 


TTGTCGCTAT 


TTAGATAAAT 


1500 


TGTAGGAGGA 


ATTTACTTTA 


TGAGATATAC 


1560 


AACTTTAATG 


CCTGGTGGTG 


TAAATAGTCC 


1620 


AGCAATTTTT 


ATGGATCACG 


GTAAAGGTTC 


1680 


TATCGACTAT 


GTACTAAGTT 


GGGGACCACT 


1740 


TAGTCATTTA 


CATGAAGCAA 


TTGATAAAGG 


1800 


AAATAAATTG GCGCAgcTCG TTATTGACCG 


1860 


GTCATCTGGT ACAGAAGCTA 


CATTGGATAC 


1920 


AAATAAAATT 


GTGAAATTTG 


AAGGTTGCTA 


1980 
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GCCGGATTCT CCTGGTGTGC CTGAAGGTAT TOCTAAAAAT ACAATTACAG TTCCATACAA 2100 
TGATTTAGAT GCACTTAAAA TCGCTTTCGA AAAATTTGGa AACGATATTG CTGGTGTAAT 2160 

5 

CGTAGAACCT GTTGCTGGTA ATATGGGTGT CGTACCGCCG ATTGAAGGTT TTTTACAGGG 2220 
ATTAAGAGAT ATTACGACTG AATACGGCX3C ATTGCTAATT TTCGATGAAG TAATGACTGG 2280 
TTTCAGAGTC GGTTATCATT GTGCACAAGG TTACTTTGGT GTGACACCAG ATTTAACTTG 2340 

10 

CTTAGGAAAA GTTATCGGTG GAGGACTACC TGTAGGTGCA TTTGGTGGTA AAAAAGAAAT 2400 
CATGGATCAT ATAGCACCAT TAGGAAATAT TTATCAAGOG GGTACGTTAT CAGGAAATCC 2460 

1S TCTTGCAATG ACAAGTGGTT ATGAAAOGTT AAGCCAATTA ACGCCAGAGA CATATGAGTA 2520 

TTTTAATATG TTAGGCGATA TACTTGAAGA CGGTTTAAAA CGTGTATTTG CTAAACACAA 2580 
TGTACCAATA ACTGTAAATA GAGCAGGTTC AATGATTGGT TATTTCTTAA ATGAAGGACC 2640 

20 TGTAACTAAT TTTGAACAAG CGAATAAAAG TGATTTGAAA TTATTTGCAG AAATGTATCG 2700 

AGAAATGGCA AAAGAAGGTG TGTTTTTACC ACCATCTCAA TTTGAAGGTA CATTCTTATC 2760 
TACGGCACAC ACGAAAGAAG ATATTGAAAA AACGATTCAA GCATTTGATA CGGCTTTAAG 2820 

25 

TCGTATTGTA AAATAAATAT ACGGACAAAT TGAGAGCCTG AACTTTGTTC AGGCTCnTTT 2880 
TAAATGTATA TAAGGCATGG GCGGCGACTT GATAGTGAAA GTCCACTACT A 2 931 

(2) INFORMATION FOR SEQ ID NO: 283: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

35 (D) TOPOLOGY: linear 

CUD SEQUENCE DESCRIPTION: SEQ ID NO: 283: 



45 



50 



AATTATGAAT 


GCATTACCAG 


TATTATTACA 


AAAGAACAAT 


TAAAAATGTT 


TGTTTATGAT 


60 


TATGATACGC 


ATCTCATTAA 


AAATGTAaTG 


GTTGCAGCAG 


ACGTGTTAAA 


GGCAAATGAT 


120 


ATTCAAGGAC 


ATGAACCATT 


AATCGTTAAC 


CTTCAAACGA 


TTGATGAAAC 


ATTACATCGT 


180 


TTACCTATGC 


ATAATAGAAA 


AGACATGATG 


GTTAATGGCG 


GTGTACTTAT 


GGCACATTTA 


240 


AATGCCAAAA 


GTGGTCCGTG 


GTTAAAAGAT 


GTGCTAAGAC 


AAATTGAGAT 


AGCGATTGTA 


300 


ACAGGTAAAG 


TAAGCAACGA 


AGAAACTGAA 


ATTTTGAAAT 


GGGTGGATAA 


TCATGTCAAA 


360 


ATATAGTCAA 


GATGTACTTC 


AATTACTCTA 


TAAAAATAAA 


CCGAATTATA 


TATCTGGACA 


420 


AAGCATTGCG 


GAATCACTTA 


ATATTTCACG 


CACTGCAGTA 


AAAAAAGTGA 


TTGACCAATT 


480 
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CCCAGATATT TGGTATCAAG GTATAATAGA CCAATATACA AAAAGTTCTG CTTTOTTTOA 600 

TTTTAGTGAA GTATACGATT CAATAGATTC TACACAACTT GCTGCGAAAA AGTCACTTGT 660 

TGGAAATCAA TCTTCATTTT TTATCTTGAG TGATGAACAA ACGAAAGGTC GTGGGCGATT 720 

TAATAGACAT TGGAGTTCTT CAAAAGGGCA AGGACTTTGG ATGTCTGTCG TGTTAAGACC 780 

TAACGTTGCA TTCTCAATGA TATCTAAATT TAATTTATTT ATTGCATTAG GGATAAGAGA 840 

TGCGATTCAA CATTTTAGTC AAGATGAAGT CAAAGTGAAA TGGCCGAATG ATATATTTAT 900 

TGATAATGGT AAAGTGTGTG GTTTCTTAAC TGAAATGGTT GCTAATAATG ATGGTATAGA 960 

AGCAATAATA TGTGGTATAG GTATTAATTT GACGCAACAA CTAGAAAACT TTGATGAAAG 1020 

TATTAGACAT AGAGCAACAA GTATACAATT ACATGATAAA AATAAATTAG ATAGATATCA 1080 

ATTTTTAGAG ATATTACTTC AAGAAATTGA AAAAAGATAT AATCAATTTT TAACGTTACC 1140 

TTTTTCTGAA ATTCGTGAAG AATATACTGC AGCTTCTAAT ATTTGGAATA GAACGTTGCt 1200 

ATTTACAGAA AATGATAAAC AGTTTAAAgG ACAAGCAATT GATTTAGATT ACGATGGCTA 1260 

TCTAATTGTT AGAGATGAAG CGGGTGAATC ACACCGTTTA ATTAGTGCAG ATATAGATTT 1320 

TTAACACTAA AGCAAGGAGA GATAGCTATG GGTATGGCAA CCTATGCCGT TGTGGATTTG 1380 

GnAACAACAG GCAACCAATT AGATTTTGAC GATATCATTC A 1421 
30 (2) INFORMATION FOR SEQ ID KO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2202 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
35 (D) TOPOLOGY: linear 



10 



is 



20 



26 



40 



45 



Mxi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

CCAAGTTGCC TAAAATGATT AAGCAAGGTT TATAC CCTAT GCnAACGATT GAACAAGAAT 60 

CTGGAGCCAT CCGACTGCCA ACGATTTCTA GAGTGAnGCG TTCATTACAA TGGGGTAATG 120 

ATGCTTATAC AATGATTTTA GATCGTATGA ATATTGAAAC AAATGAATAA TAAATGAACG 180 

ATAAACAATG GTTATCTATC TGCACTAATA AGGTAGATAA TCATTGTTTT TTCACGAAAA 24 0 

AATTTACAGA GTAAAAGAAC TTAAATTTCA TATTAAGTCT TTAGAACTCG ACACTTAAAA 300 

SO ATGCTATAAT CATATGTATG TTAAAAAAGG AGTTTCGGAA AATGTATGAC ATTAAAAAAT 360 

GGCGCCATAT TTTTAAATTA GACCCAGCTA AACATATTTC AGATGATGAT TTAGATGCGA 420 

TTTGTATGTC TCAAACAGAT GCAATTATGA TTGGTGGAaC TGATGACGTT ACTGAAGATA 480 

55 
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CAAACATCGA AAGTGTAATG CCTGOTTTTG ATTtTTATTT TGTACCTACA GTACTGAACA €00 

GTACAGATGT TGTATTTCAC AATGGTACAT TATTAGAAGC GCTTAAAACA TATGGACATA €60 

S 

GTATAGATTT TGAGGAAGTA ATATTTGAAG GGTATGTCGT GTGCAATGCT GATAGCAAAG 720 

TGGCAAAACA TACCAAAGCA AATACAGATT TAACAACAGA AGATTTAGAA GCATATGCCC 780 

AAATGGTCAA TCATATGTAT CGATTACCGG TTATGTATAT AGAGTATAGT GGCATTTATG 84 0 

10 

GCGACGTATC AAAGGTTCAA GCTGTCTCAG AACATCTAAC AGAAACGCAA CTTTTTTATG 900 

GTGGCGGTAT TTCCTCAGAA CAACAAGCGA CAGAGATGGC AGCTATTGCA GATACAATTA ' 960 

is TCGTCGGTGA TATTATTTAT AAAGATATTA AAAAAGCTTT AAAAACAGTA AAAATAAAGG 1020 

AGTCTAGTAA ATGAATGCGT TATTAAATCA TATGAATACA GAGCAAAGTG AAGCTGTAAA 1080 

GACAACAGAA GGACCATTGT TAATTATGGC AGGTGCTGGT TCAGGGAAGA CACGTGTTTT 1140 

20 

AACACATAGA ATTGCTTATT TATTAGACGA AAAAGATGTC TCACCATACA ATGTTTTGGC 1200 

TATTACTTTT ACAAATAAAG CTGCAAGAGA AATGAAAGAA CGTGTTCAAA AATTAGTAGG 1260 

TGATCAAGCA GAAGTTATTT GGATGTCAAC ATTCCACTCA ATGTGTGTTC GTATTTTACG 1320 

25 

TCGTGATGCA GATCGAATTG GTATAGAACG CAATTTTACG ATAATTGATC CTACAGACCA 1380 

AAAATCTGTT ATTAAAGACG TCTTAAAAAA TGAAAATATT GATAGTAAAA AGTTTGAACC 1440 

3Q TCGTATGTTT ATCGGTGCGA TCAGTAATTT GAAAAATGAA CTTAAAACAC CTGCAGATGC 1500 

TCAAAAAGAA GCGACAGATT aTCACTCgcA AwTGGTaGCA ACgGTTTaTA GTgGATATCA 1560 

ACGCCAATTG TCACGTAATG AAGCGTTAGA TTTTGATGAC CTTATTATGA CAACGATTAA 1620 

OS CTTATTTGAG CGTGTACCAG AAGTT CT AG A ATATTATCAG AACAAATTCC AATATATTCA 16 80 

TGTAGATGAG TATCAAGATA CTAATAAAGC ACAATACACA TTAGTTAAAT TATTAGCAAG 1740 

TAACTTTAAA AACTTATGTG TTGTAGGTGA CTCAGATCAG TCAATTTATG GTTGGCGTGG 1800 

40 L 

TGCTGATATT CAAAATATCT TATCATTTGA AAAAGACTAT CCAGAAGCGA ATACAATCTT 1860 

TTTAGAGCAA AATTATCGTT CGACGAAAAC GATTTTAAAT GCGGCTAACG AAGTGrTTAA 1920 

AAATAATTCT GAACGTAAGC CAAAAGGACT GTGGACTGCA AATACGAATG GTGAGAAAAT 1980 

45 

TCATTACTAT GAAGCAATGA CGGAACtGAT GAAGCGGAAT TTGTAATACG AGAAATTATG 2040 

AAGCATCAAC GTAATGGTAA GAAATATCAA GATATGGCAA TTTTATATAG AACGAATGCA 2100 

so CAATCACGTG TACTTGAGGA AACATTCATG AAATCTAATA TGCCATACAC AATGGTTGGT 2160 

GGCCAAAAGT TCTATGACCG TAAAGnAATC CAAAGATTTA TT 2202 
(2) INFORMATION FOR SEQ ID NO: 285: 

55 
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(A) LENGTH: 785 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 

5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 



10 


AGTGGTGCAA 


AGATAGGCAT 


TGATaATACC 


GCTAAGCCAG 


CAAGATGATG 


GCACGATAAA 


60 




ACCTAAACAG 


AAGAAmATAA 


ATAGTAATAC 


GATGATAAAT 


AATGGTCCAC 


TCATATGTTG 


120 




AACTAAAGAT 


GATGAAAAGT 


GTAAGATTGT 


ATCTGAAATC 


ATACCTTCAT 


TCAACACTAA 


180 


15 


ATTAATACCT 


CGAGCTAAAC 


CAATAATTAA 


AGATACACCT 


ACTAAACTTG 


ATGCACCATT 


240 




GACAAATGCA 


TCTACAGTTC 


CTTTTTCTCC 


CAATCCAGAT 


TTACCTGTCC 


CAGCAATAAA 


300 


SO 


CATTATTATA 


ATTGTAAATA 


TTAAAAATGC 


TGAAGCCATA 


ACTGGGAACC 


ACCAACCTTG 


360 


CGTCATAACT 


CCCCATACCA 


TAATTGGAAA 


TGGTAGTACA 


AATAATGTAA 


GGATTATCTT 


420 




CTTACGCAAA 


GTAAAATGGG 


CACTATCGTC 


ATCTTTTAAT 


ACAGACCATT 


GCTGTTCAAA 


480 


25 


AGCATCTTTG 


TCTTCATAAG 


AATATGACGC 


TTTAGGATCG 


TTTTTAATTT 


TTTTACAGTA 


540 




CCAATATAAA 


TAACTAATAA 


CAAAAATCGC 


ACCGACAATA 


CAAGCACCTA 


TTCTCCAATA 


600 




CAAGCCATCC 


GTAAAAGTTG 


TACCAGCGGC 


ATTAGAGGCA 


ATTACAACCG 


AGAACGGGTT 


660 


30 


AATAGTTGAA 


AATGTACTAC 


CGACAGAGCT 


GGCAAGGAAT 


ATGGCACCAA 


CTGGAAACGA 


720 




TAGAATCGTA 


TCCTAACGCT 


AATAAATATA 


GGGACTAAAA 


TCGGATAAAA 


TGCTACAGCT 


780 



TCTTC 785 

35 (2) INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 
_ (A) LENGTH: 812 base pairs 

~ <B) TYPE: nucleic acid 

40 \ <C) STRANDBDNESS: double 

(D) TOPOLOGY: linear 

4S (xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 286: 

CTAACGnGAT AAGGTTGCAA nTTTATCTGA ACATCTGATG ACTGTAATTT TGTtAATGAT 60 

AAAATATTTG TCACTAATAG ATATAAATAC TGACTTTCTT GAAAACTATG TACAAGTAAT 120 

so TGTTCCTTTT CTATGATAGA CATATCTTTA CTATGTGATA CTAAAATATC TAAATkTCCC 180 

ATAATTGTTG TTAACGGTGT ACGTATGTCA TGCGAAATTG ATCTTAAAAA ATTTGAATGT 240 

GTCAGTTGAC GTTCAGCCTG TAACATGGAT TCTCTCGTTT GTTTAAGTAA CGTCACATTT 300 
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ATCACTTGAG AACTTTGGTA ATCAATGGCT AGAATGCCTT TAATCGGAGA TGTGCCAATT 420 

GGTATCAACC ATTTATTAAT GCCTGGAAAT GTATCTGTTG TTGCACCAGC TTGTCTTTCA 480 

TTTTTAATTA CCCAGCTTAA TGCTTGTTCA TGCTGTTGAG TCGTATTATC GATATGGTTT S40 

TGCAATGGTA TTGTTTTAAT TACTTTCGAT TGATTGATAA CGTATATAGT AATTGATTGT 600 

TGCAATAATT GATTAATTTG GTATCCAGCA TTTATTAGTA AGTTTTCAAC TGTATAAGTT 660 

TGTTTAATCG AATCATTAAA TTGAAATAAT AAATCTGTAC GATAAAGTTG CTTTTTAGTA 720 

ATGGaGTAwT GGAATTTAAT TTGTnTTAAT AAAGCACTCG TTAAAATACT TGTnAAAATG 780 

CTAACGATAA ATGTAATAGG ATAGTCAAAO CG 812 
(2) INFORMATION FOR SEQ ID NO: 287: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1732 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

ATnnATTATT ATTACTGCTA TTTTAAATTT TAAAAAATGC TTTTGATTAT ATTCAACAnT 60 

TTGTATAAAA TTAAATTTGC TTTTGATTAA AGCATGAAAA TTGTAATCAA ACCATAAATT 120 

GTCGTATGAT GTAGTTAGAA TTTTAAAATG CAGGAGGTCa AGTATATGAC TGAAATaACA 180 

TTCAAAGGTG GACCAATCCA CTTAAAAGGT CAACAAATTA ATGAAGGTGA TTTTGCACCT 240 

GATTTTACAG TGTTAGATAA TGACTTAAAT CAAGTAACAT TAGCAGATTA TGCTGGTAAA 300 

AAGAAATTAA TTAgTGTGGT ACCATCAATT GATACAGGTG TTTGTGATCA GCAGACTCGC 360 

AAAT*TCAACT CTGATGCTTC TAAAGAAGAG GGGATTGTGC TTACAATTTC AGCAGACTTA 420 

40 CCATTCGCAC AAAAAAGATG GTGCGCTTCA GCAGGTTTAG ACAATGTCAT TACATTAAGT 480 

GACCACCGTG ACTTATCATT TGGTGAAAAC TATGGCGTTG TTATGGAAGA ACTTCGCTTA 540 

TTAGCTCGTG CAGTATTTGT ATTAGATGCA GATAATAAAG TTGTTTATAA AGAAATCGTT 600 

AGTGAAGGTA CTGATTTCCC AGATTTTGAT GCTGCTTTAG CTGCATACAA AAATATTTAA 660 

TCATTAAAGA GATAAATCTT AAAATGTATA CATCGTGTCC ATCGTTGTCA ACAGCATTAA 720 

AATAGAATTG TTTTCTATGA TTGCTAAGAC CTATGGGCAC TTTTTATTGG AGAGGGACGA 780 

ATATGGCAGA ACAACAAACA ATTATGGAAC GCTTGTTTCA TACATTAGAT GAAAAAGCTA 840 

AAACATTAAA TAATGAAAAT GGCCaAAGTT TTATTGAAAA TCTTGGGCTA GCAATGGAAC 900 
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CATTCCAATT TGCATATTT A AGTTT AATGC aGGAAGAAAA GATAcAAGCA AATCATCAAA 1020 

TTACACCAGA TTCAATTGGA TTGATACTAG GATTTTTAQT TGAGCGTTTT ATGAACAACC 1080 

AAGAAGAATT ACATATTGTT GATATTGCAA GTGGTGCCGG TCATTTAAGT GCTACTGTAA 1140 

AAGAAGTGTT ACCTGraAtT GcGGTTATGc ATcATTTaAT TGaAGTTGAt CCAGTTTTAT 1200 

CACGTGTTAG TGTACATTTA GCAAACTTCT TAGAAATTCC TTTCGATGTG TATCCTCAAG 1260 

ATGCCATCAT GCCACTACCA TTAGAAGAAG CAGATATOGT TATTGGTGAT TTTCCAGTAG 1320 

GCTATTATCC AATTGATGAA AGAAGTAAGG AGTTTAAGCT AGGTTTTGAA GAAGGACATA 1380 

GTTATTCACA TTATTTATTA ATAGAACAAG CAATAAATGC ATTAAAAGAT GCTGGATATG 1440 

CCTTTCTAGT GGTACCAAGT AATATTTTTA CAGGTGAACA TGTAAAACAG CTTGAAAAAT 1500 

ATATTGCAAC AGAGACAGAG ATGCAAGCAT TTTTAAATTT ACCACCAACT TTATTTAAAA 1560 

20 ATGAAAAAGC GCGAAAATCT ATATTAATTT TACAAAAGAA AAAATCGGGT GaAACAAAGC 1620 

CAGTTGAAGT ATTATTGGCA AATATTCCTg ATTTCcAAAA TTCCTTCACC AATTTCCAAG 1680 

GATTTATGGA CAGAGTTAAA ATCCAGTGGG ATGGGACCAC CAAATCGTCC TA 1732 
(2) INFORMATION FOR SEQ ID NO: 2 88: 



10 



IS 



26 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2779 base pairs 
(B) TYPE : nucleic acid 
30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 



AS 



so 



ss 



. (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

AAAAGAACTA GCTAAACGCA AGCAAGAAGC TATTAGTAGA ATTAAAGACT TTTCAAATGA 60 

AAAAATAAAT AGTATTCGAA ATAGTGAAAT TGGCACAGCT GATGAAAAAC AAGCAGCAAT 120 

GAATCAAATT AAOGAAATTG TGCTTGAAAC AATTAGAGAT ATTAATAATG CGCATACATT 180 

ACAGCAAGTT GAGGCTGCAT TGAACAATGG TATTGCTCGA ATTTCAGCAG TACAAATTGT 240 

AACATCTGAT CGTGCTAAAC AATCGTCAAG TACTGGAAAT GAATCTAATA GCCATTTAAC 300 

AATTGGTTAT GGAACTGCAA ATCATCCATT TAACAGTTCG ACTATTGGAC ATAAAAAGAA 360 

ACTTGATGAA GATGATGACA TTGATCCACT TCATATGCGT CACTTTAGTA ATAATTTCGG 420 

TAATGTTATT AAAAACGCTA TTGGTGTGGT GGGTATCTCT GGTTTACTAG CTAGTTTCTG 480 

GTTCTTCATT GCCAAACGTC GTCGTAAAGA AGATGAAGAG GAAGAATTAG AAATAAOAGA 540 

TAATAATAAA GATTCAATAA AAGAGACTTT AGACGATACA AAACATTTAC CACTTTTATT 600 
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AAATAATGGC GAGTCACTCG ATAAAGTTAA ACATACGCCG TTCTTCTTAC CAAAACGTCG 720 

TCGTAAAGAA GATGAAGAAG ATGTGGAAGT TACAAATGAA AACACAGATG AAAAAGTGTT 780 

GAAAGATAAC GAACATTCAC CACTCTTATT CGCAAAACGA CGCAAAGATA AAGAGGAAGA 840 

TGTTGAAACA ACAACTAGTA TTGAATCTAA AGATGAGGAC GTTCCTTTAT TATTGGCTAA 900 

AAAGAAAAAT CAAAAAGATA ACCAATCCAA AGACAAAAAG TCAGCATCAA AAAATACTTC 960 

TAAAAAGGTA GCAGCTAAAA AGAAGAAAAA GAAAGCTAAG AAAAATAAAA AATAATTTGT 1020 

TTC T TTGATA AATAGaGGAG CACCGATTGA CATCACATCA GTCGGTGCTC CTTTTATTTA 1080 

TTCTTTTTAA TTAATTTATA CAATGCCTGT TGAGCGTGTT GATTCGCTTC TTTGTTTTGT 1140 

TCTCTCGGTA TCCATTTAAC AAATAATAAA TCAAAATCTT TTTCAAATAT TTCTATTTGA 1200 

TCAAAATAAG GTTTGAAATT TGCGTTTTTC ACATAACCAG CTTCAATGCT ATCTGCAATT 1260 

AGCTTTGAGT CTGTATATAA TAGTGCGTTT TGAACATTTA ATTCACGTGC ATGTTCTAGT 1320 

GCATAAATAC ATGCAGCCCA TTCTGCAGTG TGGTTATCCA TTTCGCCTAA CTCATGTGTA 1380 

TATGTATAAT GCTGCTCATC TTCTTTGATT ACAATGGCAC ATGTACTTAT GCCTGGATTT 1440 

cCTTTCGTCG CAGCATCAAA ATTTATTTTC GCCATAATAA ACCTACTTTC TATTCAATAC 1500 

TTAGTTAAAG TTACTATTAC TGTAATACAA AATATGTTGG GTAATCCATT AAAAAACACG 1560 

CATCACTTAA ATAAGTAACA CGTGTTTAAA ATACTOGCTG ATTCAAAGAT GATTTTCTAA 1620 

TACGTAtACT GTaATATACT TCCTAAAAAA ATCATCTTCA GGCTGGGACA TAAATCAATG 1680 

TTCTATGCTC TACGATGTTA TATTGGCAGT AGTTGACTGA ACGAAAATGC GCTTGTAACA 1740 

AGCTTTTTTC AATTCTAGTC AGGGGCCCCA ACACAGAGAA TTTCGAAAAG AAATTCTACA 1800 

GGCAATGCGA GTTGGGGTGT GGGTCCCAAC ACAGAAGATG ACGAAAAGTC AGCTTACAAT I960 

AATGTGCaAG TTTGGGATGG GCCCCAACAA AGAGAAATTG GATTCCCAAT TTCTACAGAC 1920 

40 AATGCAAGTT GGGGTGGGAC GACGAAATAA ATTTTGCGAA AATATTATTT CTGTCCCACT 1980 

CCCTTAAAAC TTATTCTTTT GTGTAGTAAG TGCGTTAATA GCCTTGATCT AACTTATCAA 2040 

TCTTACCTTT ACGATAAAAT GATTTAGCAA TATATCCAAA TGGTACATTG AAAACTGTTG 2100 

45 AAGCTAATTT TAATACGTAC GTTGTAATAA ATATTTCAAa TACAAmTGTa CCAGGTAAAC 2160 

TTCCGATAAA TGCGATAGCT ACAAATAAAG CTGTATCAAT TATTGrGCTT AAAAATGTAC 2220 

TACCATATGg CACGGATGAA AAACGTTTTa TCmGaACTAA ATACTTTTTT AATTAGTGAA 2280 

60 

AAGATAAATA CATCAATATG TTGACCAATA ATATATGCGA CGATTGAGCC TAAAGCAATG 2340 

CGTGGCACAA CATCAAAGAT TGCGTGTAAT GCTTTTTGTG CCATATCTTC TGGTGCAGGA 2400 

5$ 
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CAAACTGCTC TTTTTGCAAC TCTACGCCCA TAAATATCGT TTAATATATC TGTTGCTAAA 2520 

TAAATAGAAG CAAACATGAC ATTACCTAAA GTTGCTGAAA TACCAAAGAT TTCTACAQTT 2580 

5 TTAATCACTT GTATGTTGGC AATGATTGTG CCAATTGCAA CCCATGCAAT TAAACCTTGT 2640 

TTACCAAAAA AGCGATACAT AAGTACCATA AGCACGAACG TTGCAATAAA OGTAACTAGT 2700 

CCTAAAATTT CATTATACAT ATTAAAATGT CCTCCTAAAT TTTGATCATG CGGGTGTTTA 2760 

10 

GAAACCGCTC AATAAATAA 2779 

(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1999 base pairs 

<B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 



25 



30 



35 



60 



ACTGATGTGC 


GTTCATCAAA 


AACAATATAA 


TCAAATTCAT 


TTTCATCAAA 


TTGCTTAAAA 


60 


TTATCATCTC 


TAGATAATGT 


TTGAATGGTT 


GCAAATAAAT 


ATTTGGCATC 


GACATCTCTA 


120 


TGTTTTCCTG 


TCAACAATCC 


AAAATCACTA 


TCATTTTTTA 


TAGGTAATAC 


ttttttaaat 


180 


TCTTCCTTAG 


CTCTATTTAA 


AATCCCCTCA 


TTATGAACAA 


TAAATAAAAA 


TTTATTAGGG 


240 


TTTACTTCTC 


TAACATCTAA 


TGCACATAAA 


ATCGTTTTAC 


CTGTACCAGT 


TGCAGATATT 


300 


ATTAACGCCT 


TATCTTTGGC 


TTTATCCCTA 


ATAGCTTTTA 


ATGACCTTAA 


TGCTTCTGCT 


360 


TGCATTAAAT 


TGGGTACAAT 


TTCCACTGAT 


TTTTTCACCT 


TATCAGCTAG 


CAGCATTTGA 


420 


GTTTGTTCAA 


CCTCCGCTAA 


TTTTTCTAAG 


GAACGGTACT 


CAAATGATTC 


TTTATATGAA 


480 


TTAATCCATT 


GCTCAGTCAG 


TGGGGTACTC 


TTTTGCCATA 


ACAAGTCAAA 


TTCACTTTTT 


540 


ACACTATCAA 


CTAAATCGCC 


ATTTTTCATA 


GTAGACAGTA 


AAACATTATG 


CTCATAATTA 


600 


ACCTTTAACG 


CATTAGATGT 


TAAATTAGAG 


CTTCCTATTA 


CCATAGAACT 


ATAATCCTTA 


660 


TGCTCAAAAA 


TATATCCTTT 


GGCATGGAAT 


CCAGCAATAT 


CAGTTAATCT 


TACCTCTACA 


720 


TTTTTTAATT 


TAAGTAATTC 


TCCATACATT 


TTAGGACTAT 


TAAACCCTAA 


GTAATTAGAT 


780 


GTTAATATTT 


TCCCTTTAAC 


ACCCTTATTG 


CTTAAATCTA 


ATAGTTGAGC 


CTTTAAGCTG 


840 


GcTAAACCGC 


TTTCTGTTAT 


AAAAGCCACA 


GAAAAATAAA 


ACGTTTCACA 


TTTTTGAAGT 


900 


TCATCTATAA 


TTGTTGAAAG 


AACTTTTTCA 


TTTTTATTAT 


TTACTAAAAG 


CTTCGGTGTA 


960 


TAATTCCCTT 


TATGAGAAAT 


ATGTTTGTCT 


ATAAACCCTT 


TATGTAAAGA 


TTGATTGAAA 


1020 
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CGCTGGGGCC 


CAATTTAATT 


TATCAAGTTC 


GTTTATTGAC 


AACCATTCAA 


TACTCTTATG 


1140 




TTCAGTTAGA 


GTTGGTAACT 


CTTTGTTTAA 


AGTACATTTG 


TATGTTGTTA 


ACCTAACAAT 


1200 


5 


TCCAAAATCA 


TATTCATGTT 


CTGTAGTTAT 


AACTTTGTCT 


CCAACAATTA 


AATCACATTT 


1260 




CATTTCTTCT 


CTAATTTCTC 


TAATCAAAGC 


GTCTTTTTCA 


GTTTCATTCT 


TTTCAACCTT 


1320 


TO 


ACnGCCAGGA 


AATTCCCACA 


TTAAAGGCAG 


ACTCATTTTT 


TCACTTCTCT 


GTGCACAAAG 


1380 


AATTTTGTTA 


TCAGAAAAAA 


TAATAGCTCC 


TACTACATTG 


ATTACTTTTT 


TCATAAGACT 


1440 




CACCCTTCAA 


TTTAAAATCA 


TCTTAATTGT 


TATTCTATCA 


AAAATTACAA 


AACTATATAT 


1500 


75 


AAATCAATAT 


TAAAAATTAA 


TATTTTACAT 


TCACATGAAC 


GCTCTACTCC 


ATGCATTTTC 


1560 




ATACACATCT 


ATTATATAAT 


ACTTGTGAAA 


AGTATTGTCT 


TGGGGCTGTG 


rrrrriTACT 


1620 




TTTGGGGCGT 


ATTTCTTTAT 


AATTCATTAC 


ATAAATGTAA 


GGGCTTTAGT 


TTTCATGTTT 


1680 


20 


TATTAAGTCT 


AACTGAGATT 


TTGAAAGGAT 


GTTTAGCAAC 


AATGGATAAA 


GAATTATGGA 


1740 




TAGAACGAGc 


TAATGATAGT 


TTAGTTAAAC 


ATTTTTATGa 


GCAGCAATCT 


GACATTGAAC 


1800 




AGCGAGAAGG 


TTTTGAAAGT 


AAATTAACAT 


TTGGTACTGC 


GGGTATACGC 


GGAAAATTCG 


1860 


25 


GTCTTGGTGA 


AGGTCGACTT 


AATAAGTTTA 


CTATTGAAAA 


ATTGGCATTA 


GGTTTAGCGC 


1920 




GTTATTTAAA 


TGCCCAAACA 


AACAGTCCAA 


CAATAGTCAT 


TCATTATGAT 


ATTAGACATC 


1980 




TTTCcAACTG 


AATTCGCCC 










1999 



30 

(2) INFORMATION FOR SEQ ID NO: 290: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1933 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

GATGACTTTT CCCCCTCATA ATCTTCATGG TCCAGGCGTC CATTAATGCG TCAAAGGATG 60 

GCACATTTTA CCTGGAACAA ATGATTCATA TGGTTCATAA AAATCACGCG TCGTAATATA 120 

45 ATCTTCTAAA TCAAATGCAT AGAAAATCAT TGGCTTTTTA AATACTGCAT ATTCATATAT 180 

TAAAGATGAA TAGTCACTAA TTAATAAATC TGTTATGAAC AGTATATCAT TAACTTCTCT 240 

AAAGTCAGAA ACGTCAACAA AATATTGTTT ATGTTTGTCT GCAATATTAA GTCTATTTTT 300 

50 

CACAAATGGA TGCATTTTAA ATAATACAAC CGCGTTATTT TTTTCGCAAT ATCTTGCTAA 360 

ACGTTCAAAA TCAATTTTGA AAAATGGGTA ATGTGCTGTA CCATGACCAC TACCTCTAAA 420 
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6 



10 



IS 



20 



26 



30 



35 



40 



45 



X XV* A X luAl\. 


1 Vj l\f X V»wt— A 1 


aafSfTTcaTO 1 

AA)un*X IwtlU 


nnninuinvA 


X V»A\» X *\\m\9 X w 


GAACACCTGT 






X X X X Iv. ILi 1 


T« aTafY^a aa 

InnlnUUvin 


TGCTTCAGCG 


TAGAATGGAA 


^Pi r v r ?/ --i u i u i 

X t\ X N^VJTVJ XXX 


ovu 


M.v*A IVjA InUi 


InHVjWX X X i. V7 


X A X nuv InVvV? 


ATGATTTAAT 


GAATCAATAA 


ATGGTCCACC 


w Q W 


111 


tj X AUvjA X AA 


AU7V.V-AH.V- 1V» X 


TTTAAAGGCA 


CCAACGGCAT 


WV-V-n A.n\* X xvv 


790 


AATAACTTCT 


1 uAVvAAlAv X 


1 AAAA<— \ii-A C 


TGTATAAATC 


AATGGGTGAA 


AV9 X w\ 1 WvtV. 




AAaGATGTAG 


TCTGCCTTCC 


CAAGTAaATA 


TGGCAATCTA AACTTGTCOA 


X w*A X\jt-l«*AV-v» 


04U 


TCTATCTGTA 


ATATTCGCTT 


TAAAAACAGT 


GTGAATATCA 


TACTTTTTAT 


CTAAATTTTG 


AAA 


ACGTAACATT 


TCGTTATAGA 


TGTATTCAAA 






TAGAGTCTGA 


960 


TGTGAACAAC 


ACCGTATTCC 


CTTTTTTCAA 


V7XvjVtAAAAA1 


X X v_vj |\Aj inl 


TAAATATCGC 


1020 


TTTAAAAATA 


AATTGTCTTG 


TATTAAATGA 


TTGTTTGCGG 


a a a Tarrr a c 

AAAXAVX 1AL 


GTAATTCTTT 


1080 


ATATTTACGA 


ACGATATAAA 


TACTTTTAAC 


1 X^LuVinu 1 V* 


uli nwvivnn 


CATCAAGGAC 


1140 


AAATTCATTA 


ACATCGCTAG 


AAATTTCAGG 


XV* X AAVJvj X A 


InnnVAiU XXX 


TCTTCOAAAT 


1200 


GCCGCCTTTT 


CTAAATTCTT 


TTAGGTAAGT 


L. luWiA LAAu 


nftfti XV*AX X X 


TACCATTTTG 


1260 


TGTTTCTAAT 


TCGTTGTATT 


CTTCTTCTTG 


n*r^*Tv^p / *»i w i ** p 

1 XVjOL. XXX 


n , i M i M i M iv?yt # p 
nunl 1 X XVAr&X 


ATGCATCATT 


1320 


AATCACATCT 


GGGTTTAACT 


GTGCAATATA 


ATCAAGTTCT 


TGCTCATTCA 


CTAATAAGTA 


1380 


CTTATCTTCA 


GGTAAGTAAT 


AACCATTATC 


TAAGATAGCT 


ACATTGAAAC 


GACAAACGAA 


1440 


TTGATTCCCA 


TCTATTTTGA 


CATCATTCGC 


CTTCATTGTA 


CGTGTCTCAG 


TTAAATTTCT 


1500 


TAATACAAAA 


TTACTATCTT 


CTAAATCTAG 


GTTTTCACTA 


TGTCCTTCAA 


CGAATAACTG 


1560 


AACACGTTCC 


CAATAGATTT 


TATCTATATA 


TATCTTACTT 


TTAACCAACG 


TTAATTCATC 


1620 


CTTTTCTATT 


TACATAATCC 


ATTTTAATAC 


TGTTTTACCC 


CAAGATGTAG 


ACAGGTCTGC 


1680 


TTCAAAAGCT 


TCTGTAAGAT 


CATTAATTGT 


TGcAATTTCA 


AATTCTTGAC 


CTTTTAACAA 


1740 


CGCTAATTTA 


nCTACAATAT 


CTGGGTATTG 


AATGTATAAG 


TCTACAACAT 


CTTGGAAATC 


1800 


TTTTGAACCA 


CTTCGACTAC 


TACCAATCAA 


CGTTAGTCCT 


TTTTCCAATA 


CTAGACGTGT 


1860 


ATTAACTTCT 


ACTGGGAACT 


CACTTACACC 


TAACAGTnCA 


ATGCTTCCTT 


CTGGTGAAAT 


1920 


GTAATCGATC 


ATT 










1933 



<2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 
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IS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

nGTnCGGnCA GATATATTGG TGOTCTTTAG TAAGTGTATC AAATTCATCA GATGTCAAGG 60 

GCATGTTATC ACCTCCTTAG GTTGATAACA aCATTATACa CGaAAGGAGC ATAAaCAAaT 120 

GAACACAaGA TCAGAAGOAT TGCGTATAOG CGTCCCACAA GTTTCTAGCA AAGCTGATGC 180 

TTCTTCATCC TATTTAACGG AAAAGGAACG TAACTTAGGA GCGGAAATAT TAGAACTTAT 240 

TAAAAAAAGT GATTACAGCT ACTTAGAAAT AAACAAAGTT TTCTATGCAT TAGATAGAGA 300 

ACTTCAATAC AGGGCGAATA ATAACAAACT TTAaCATTTA TCTAAAGGAG TGATAGAGAT 360 

GCCAAAAATC ATAATACCAC CAACACCAGA AAACACATAT CGAGGCGAAG AAAAATTTGT 420 

GAAAAAGTTA TACGCAACAC CTACACAAAT CCATCAATTG TTTGGAGTAT GTAGAAGTAC 480 

AGTATACAAC TGGTTGAAAT ATTACCGTGA AGATAATTTA GGTGTAGAAA ATTTATACAT 540 

20 TGATTATTCA GCAACGGGAA CATTGATTAA TATTTCTAAA TTAGAAGAGT ATTTGATCAG 600 

AAAGCATAAA AAATGGTATT AGGAGGATTA TCAAATGAGC GACACATATA AAAGCTACCT 660 

ATTAGCAGTG TTGTGCTTCA CGGTCTTAGC GATTGTACTC ATGCCATTGC TGTACTTCAC 720 

25 

TACAGCATGG TCAATTGCAG GATTCGCAAG TATAGTGACA TTCATATTTT ATAAGGAATA 7 80 

CTTTTATGAA GAATAAAAAA ACTGCTACTT GTTGGAGCAA GTAACAGTGC AAGATGAGCA 840 

ATTGTCTTAA ATAATTATAT AAGGAGTTAT TAATATGACC TTACAACAAA AAATACTATC 900 

ACATTTTGCA ACATATGACA ATTTCAATTC TGATGATGTT GTTGAAACTT TTGGGATATC 960 

TAAAACACAT GCAAAATCCA CACTTTCAAA ACTTAAGAAA AAAGGAAAGA TTGCAATGGA 1020 

AAGTTGGGGT GTCTGGCGTG TTATTGAATC GCAATTGCAT TTAAGTGTAG TCGAACGTAA 1080 

AAAAGAAATT TTAGAAGAAC AATTTGAATT GTTAGCAAGA TTAAATGAAC AAAGTGATGA 1140 

CCCTAGAGAA ATAGAAGAAC GTATCAAGTT AATGATTCGT CTAGCTAACC AATTTTAAGG 1200 

40 AGGAGTTAAT CAATGGCAGT ATTAGAAGGT ATTTTTGAAG AATTAAAACT ATTAAATAAG 1260 

AACTTACGTG TGTTAAACAC TGAACTATCA ACTGTAGATT CATCAATTGT ACAAGAGAAA 1320 

GTTAAAGAAG CACCAATGCC AAAAGAAGAA ACAGCTCAAC TGGAATCAAT TGAAGAAGTT 1380 

45 AAGGAAACTT CTGCTGATTT GACTAAAGAT TATGTTTTAT CAGTAGGAAA AGAGTTCCTT 1440 

AAAAAAGCAG ACACTTCTGA TAAGAAAGAA TTTAGAAATA AACTTAACGA ACTTGGTGCG 1500 

GATAAGCTAT CTACTATCAA AGAAGAGCAT TATOAAAAAA TTGTTGATTT TATGAATGCG 1560 

60 

AGAATAAATG CATGAAGCTA GATCACTCAA ATAGAGCTCA TGCAAAGCTT AGTGCAAGTG 1620 

GAGCAAAACA ATGGCTAAAC TGTCCACCGA GTATTAAGGC AAGTGAAGGT ATTGCAGATA 1680 

55 



30 



35 



1117 



EP0 786 519 A2 



10 



1S 



GTCTTAAATA TGAAGGCCTA ACACAGTTTG AGTTTAATAA AGCTTTTCAA AATTATAAGC 1800 

GAAATCAATA TTACAGTGAA GAGTTGCGCO AATATGTTGa AGAGTACGTA GCTAATGTAG 1860 

AAGAAAAGTA TAACGAGGCT TTGaGTAGAG ATGACGATGT AATAG CTTT A TTTGAAACAA 1920 

AATTGGATyT AGGTAAATAC GTCCCTGAAT CTTTTGGTAy TCGTGATGTC AtTATATTTT 1980 

CAGGTGGTGT ACTTGAAATT ATTGACCTTA AATACGGTAA AGGCATTGAA GTTTCAGCTA 2040 

TAGATAATC 2049 
(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

ATGATGTTTC TATATTCGTA TTAGGAAAAC CTGTTGGTAT TACAACAAAC GCCCTAAAAT 60 

TACAATACCG CTGCTCCTAT ACCAATTGCA ACAACAGTTT TAACTGAAAT ATCTTGTTTT 120 

TTCATCTTCA TTACTCCTtt ACATAAAAAA TTCATTATAT TGATGGTGCT TTAGATAAAT 180 

GAATCGTCCA ATCATTTCCA GTACCAATAT GATATAAATC TGAAAATGAG TCTTGATTGA 240 

CTGCTACACC AATATTTACT AGCGAGTTAA CATACACAAG AGGTTCACCC ACATTAACAT 300 

CTGCAAACGA TCGCGCAAAT TT AATAAT AT TTTGATAGAC TTTCTTATCT TGATGATAAA 360 

TTGTTACCAC CAAATTATTA CCATGAACAA TTTCCAAGGA TTTTAAGAAT GCCAATGGAA 420 

TATTTGTCCA TAATGACCCA AATCTGATAT CTAAAATATC AATGCTTCCT GTAACAGAAT 4 B0 

CCT<»TTTTT TGTCACTTCT CTTATTTCTA ATGCCTCAAT ACTATCAACA TTAATTGCCT 540 

40 GACCGAGACG TTCAAACGCT ATCTTATTTG CAGCTAATCT CGCACCATTG TATGCATAAA 600 

CATCTCTACC ATGAAAAGTA TGACTTTCTT CCGAATGAGG CAATCGGCTT TTCACTTCAT 660 

CAATTTCGAT AACTTTTTTA ATACCTTCGT AATGTTTAAT ATGACTTAAA GAACCATTAT 720 

CAGGTGTAAT AATGTAATGA CCTGAATATG TTAAGCAAGC AATGTCCGCC TATCACTACC 780 

TACACCCGGG TCTACCACTG ATACAAAraC TGTGCCTTTA GGCCAGTATT TTACAGTTTG 840 

ATATAAACGA TATGACGCTA CCCAAATGTC ATACGGTGGT ATATCATGCG TTAAGTTTTC 900 

AACACGTATA TCATCATTAC AGTATATGCA ACTCCATACA TT 942 
(2) INFORMATION FOR SEQ ID NO: 293: 

55 



25 



30 



35 



45 



SO 



1118 



EP0 786 519 A2 



(A) LENGTH; 1268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doubl 

(D) TOPOLOGY: linear 



15 



25 



30 



35 



50 



(XI J 


bCiUUbiNCh DESCRIPTION : 


SEQ ID NO: 


293: 






TGTAATAAAA 


TTTTATGnAA CATGCTGnGA 


TGCTACCATG 


AACCTTCTTC 


ATTTCTTTAT 


60 


GTGAGATTGT 


GaAATTAATC AAATAATAAT 


ACGGTGGATA 


CTTTCCTAAT 


TTACGATATT 


120 


CCATTTCCTG 


ACGATAAAAT GTTAAATAAT 


catttttttg 


AACATCCAAT 


ATTGAATAAT 


180 


GATCTGGATT 


ATACGTTTGA ATGATGACTT 


GACCTGCCTT 


TTCATGACGA 


CCAGCTCTAC 


240 


CAGCCACTTG 


CGTTAATAGT TGATAAGTAC 


GTTCGCTCGC 


CCGAAAATCA 


GGTAAATTTA 


300 


ACATTGTATC 


TGCATTCAGC ACACCAACTA 


AAGTAATATT 


TGGATAATCT 


AATCCTTTCG 


360 


CAATCATCTG 


AGTACCTAGT AAAATGTCAC 


CGTTACCTTT 


TTCGAATTCA 


GTCAATAACT 


420 


TTTCATGTGC 


ACCTTTCTTT GAGGTTGTAT 


CTACATCCAT 


CCTAATTATG 


CGCGCATCTT 


480 


CAAATTCTTG 


TTGCAATAGT TCTTCAACTT 


TCTGAGTACC 


AGTACCTACT 


TGTCGAATGT 


540 


GTTCACTCTC 


ACAATTTGGA CATTGATTCG 


GTGGCGTCTC 


TTGGTAACCA 


CAATAGTGAC 


600 


ATTTTAATAA 


GTCTGTCGTT TTATGATACG 


TTAATGAAAT 


ATCACAGTTT 


GGACATTGCG 


660 


GTACATATCC 


ACAATCCCGA CATAACATAA 


ACGATGCATA 


ACCACGTCGA 


TTTAAAAATA 


720 


AAACAACTTG 


TTCCTGTCGA TCTAATCTTA 


ATTGTATGGC 


TTCACGTAAA 


TCTTTTGAAA 


780 


ACATTGACCG 


ATTACCTTCA CTCAATTCTT 


CACGCATGTC 


TACTATATCA 


ATTTCAGGTA 


840 


AAGCTTGTTG 


GTTCACTCTG TTTGGTAATG 


ATAGCAAATG 


ATAAACGCCT 


TTTTCAGCTC 


900 


GTGCATAACT 


TTCAAGACAT GGTGTTGCAC 


TTCCTAAAAT 


GACTGGACAG 


TGATGATATT 


960 


CAcTOCGCCA 


TTGGGCAATT TCTCTAGCGT 


GATATCTCGG 


ATAATCTTCT 


TGTTTATATG 


1020 


TAGATTCATG 


TTCTTCATCA ATGATGATTA 


ACCCTAAATT 


TTTGAAAGGT 


GCGAACACAC 


1080 


TTGACCTTGC 


ACCAACACTT ACTCTCGCAC 


GACCATCCCT 


AATTTTTTGC 


CATTCATCAT 


1140 


AACGTTCCCC 


ATTAGATAAG CCAGAATGTA 


ATACAGCAAC 


GTCATCACCA 


AATCGACGTT 


1200 


TGAAGCGTAA 


AACCATTTGC GGTGTTAGAG 


CGATTTCAGG 


AACTAACATC 


ATCGCCTGTT 


1260 


TTCCTTGG 










1268 


(2) INFORMATION FOR SEQ ID NO: 294: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 629 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

5 TACCACCAAA TAATATATTA GCTGGCATTT TAATAACATT TAnATTTGTC ATGATATCAT 60 

CAATAAAATG TTGAAACTTC GTAATTTCAC CTTCATAATC ATCAATTGCT GCTAATTGCG 120 

CACTCGATGC TTGCTGATCT AAATTTAAAA TATTCGACAT GCGTTGACTA TAATAAACTA 180 

10 AATGTTCTAT TAAGCCATCG TCACTCTTTT CCTTTGGTGC TGACATGACA GCGATACGTT 240 

TCAAAGGATA GTGTTGCGCC AATTTTAATG TCATTAATCC ACCTAAAGAC ACACCCGTTG 300 

CACTGATAGA TTCATAACCT TCATTGACTA AAAATTGGTA AGCTTTCTCA ACTTCTTCCC 360 

IS 

ACCAATCATC TACATTATAT GTCATGAAAT CTTTCAACAA TAAACCATGA C CTGG AT AAT 420 

TCGGTGCATA ACAACTAAAT CCTTGGTCAT TTAACTCAGC TGCAAGATGC TTCACATCCC 480 

2Q GATTTGTACC TGTAAATGAA TGTAATAATA ATATCGCATG TCCATTTGTG CCTTTTAAAT S40 

ACGtGGACTC GGTGTTTTAA TTCTCATTTT TCtaTATAcC TCCACTAtGT CTAAAGakGT 600 

TkGCTAAACG CGTTGtCGTC GATGATTAA 629 

25 (2) INFORMATION FOR SEQ ID NO: 295: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 



40 



TATGAAAGTA 


ATGAATGGTA 


ATATTATTAA 


ATTTGATGGA 


AAAGTAGATA 


TTGATAATGC 


60 


AGATAATATC 


GGTTTTTTAA 


TTGAGCATCC 


TAAATTATAT 


GATAATAAAT 


CAGGATTGTA 


120 


TAACTTGAAA 


TTATTTGCAC 


AAGTATTAGG 


TAAGGGTTTT 


GATAAAGCAT 


ACACAGACAA 


180 


AATTATAGAT 


GCATTTGGTA 


TGAGACCTTA 


TATTAAAAAG 


AAAGTTAAGA 


AATATTCAAT 


240 


GGGGATGAAG 


CAAAAGTTAG 


CAATTGCAGT 


ATCTTTArTG 


AATAAACCTA 


AATTTTTAAT 


300 


CTTGGATGAG 


CCTACAAATG 


GkATGGATCC 


AGATGGCTCa 


ATTGATGTGC 


TGACTACAAT 


360 


TAAGTCTTTA 


GkAAATGaAC 


TTGATATGAG 


AATTCtAATA 


TCAAGTCATA 


AGTTAGAAGA 


420 


TATTGAATTA 


ATTTGTGATA 


GAGCTGTATT 


TTTAAGAGAC 


GGnCATTTTG 


TTCAAGATGT 


480 


AAACATGGAG 


GAAGGTGTTG 


CATCTGACAC 


AACGATAGTT 


ACTGTTGATC 


ATAAAGACTT 


540 


TGATAGAACT 


GAAAAATATC 


TTGCAGAGCA 


TTTCCAATTA 


CAAAATGTCG 


ACAAAGCAGA 


600 
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ATTAGATATT TATCCOAAAT ATATTOAAAC ACGTAAAAGT TCATTGCGTG ATACGTACTT 720 

CAATATAAAT CAAAGAGGTG ATAAATAATG AGAATTTTAA ATTTAGTTAA GTATGATTTT 780 

TATAGTATAT TTAAaArTCC TTTAACATAT TTAGCGaTAC TAGTCGTATC TAGTTTGATT 840 

GCAACTCAAA GTATACTTAT GGCAAATTCG ATGGATAACC CGAAACATAT TATTGTCTAT 900 

GGATCTGTAT TTGCTGCAGC AAAATGGTTA TTGTTAATAA TTGGATTAAT GTTTGTTGTT 960 

AAGACAATTA CGCGTGATTT TTCACAAGGT ACAaTTCAAC TATATATGAG TAAAGTTAAA 1020 

ACACGCGTTG GAtACATTAT TTCGAAAACA ATTTCAATTa TTTTAATTTC AATATTATTT 1080 

GCATTAATTC ATTATGTGAT TTTGATTGTT GTGCAGG CAT CTAGTAATGG AAAAAATTTG 1140 

GCGTTTTCTA AATATGTAGA TAATTTATGG TTCTTCCTAA TCTTTTTACT ATTCTTTGGC 1200 

TTGTTTTTAT TCTTAATCAC ACTTG CATCA CAAAAAACAG CAATGATATT TTCATTAGGT 1260 

20 GTATTTTTAG TACTCATTGT ACCGTTTATT AAACCTTTTA TTACATTTAT CCCAAGATAC 1320 

GGTGAAAAAG TTTTAGATGC TTTTGATTAT ATCCCTTTTG CTTACTTAAC TGATAAAATG 1380 

ATTAGCTCTA ACTTTGATTT TAGCAATTGG CAATGGGTAA TTTCATTAGG TTCTATAGTG 1440 

ATATTCTTCA TTTTGAATAT CTTATATGTC GCTAAAAAAG ACATyTAATA AAAATAATTT 1500 

TGAGGTTGGG AATTTTAAAT TTTCCCAACC TCAAAGTTTG TCTTATTGTA AATTTATTTA 1560 

TTTTCTAATT TATTTAGGAT GGAATTATAA ACTGCTTTCC AAAATGAAGC GTCAGTTTTA 1620 

TAGCGGTTTG ATATAACTAA GTGTGTTTCT TTTTCTAAAT CTGCATAGTC TGGATGATCT 1680 

TTGCTCGGTA ATTTATCAGC ACGAACATCA GTTACAAATT TTTGGACTTC ATTTGCTCTT 174 0 

GGTCCCCAAA CTGTTTCTTG TTCGAATTGA TCATTCAAGA ATACGAAGAT AGGAATTGCA 1800 

CGTGATTTAC CATTTGTTAA ATATTGATCG ATCAGTTTTG TATCATCATC TCTATGGAAC 1860 

ACGCGTACTT CTAAATTTAA TGCTTCACTG ATGTGTTTTA GAATTGGGAG ATTCATCATT 1920 

40 GCATCTCCAC ACCAGTCTTC AGTAATTACT AATACTTTAG AATAATTCAT CTCTTTTATT 1980 

TTTTTGATGC GTGAATCATC TTCTGGTAAC TCAAATGATT GATAGATACT GAGAACGGTA 204 0 

TCTTGATTTG TCTTCATTCC ATCAATGTAT TCATTTAAGG GTTGGCTATT TTTGAAATAA 2100 

GTTTCTAAAT TTGTCATTGT AAAAACCTCC TTTAGCATTT ACAACATTAT ACCAATTTAT 2160 

AGTAATAAAA GGTAATGAAA TAAATTAATT GCAAATTCTT TGTTAATTTT TGTTAAGGAT 2220 

GAAACGGGAA GCACCTTATG CTATATTTAA ATAAGTACAA AGAAAGGGTG ACATCAGTGC 2280 

GTATTCAAAA TCGCTGGGTT GTGTTTATAT TATTTTTAAT CTGTTCTTTT GGTGTATTAA 2340 

TTGGTTTATA TCAATATCGT CATACGAAAA CTGTGGATTT GTCTAATCTT GAAATAAACG 2400 
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TTGATCGTTT 


TAAATTTTAT 


AACAGTAAAG 


CTCACCCTGA 


TCTTACCGTT 


AAAGTGAGAG 


2520 




AAAAGGATAA 


CATCGTTAAG 


GGGATAATAT 


TAGTAAGAGA 


TGAAAAGATA 


CATACTAATT 


2580 


5 


TTGAtGGGGG AATTGGTTCG CCGATAAATA ACGCgATTGA AAATCTTGGa 


TTCgGATATA 


2640 




AAAGrACaAA AGTTGGcAAT GrtTkCtCAT CgGTAAAGTA TATTGATAGA 


GATAACCATT 


2700 


10 


TAAAATTAAA 


CTTACTTTAT 


CAAGATTTAG 


AAATTAAACG 


TATTGAATTT 


TTTAGTAAAT 


2760 


AGCTTTAGGT 


CTTAAAGTTw 


TAAAAAACGA 


ATGAaTAATT 


TTATTGGGAT 


GAGTGAC 


2817 




(2) INFORMATION FOR SEQ ID NO: 296: 








1S 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








20 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 








TCTGTTAAAA 


TGATTTTTCT 


TTTAnAAAGG 


CCGnAAATCA 


ATGTTCGATT 


nTTATTTGCA 


60 


25 


TTATGGTCTC 


GATATTGGTA 


GAATATCAAA 


TGGTTAAATG 


AGAAAAACTT 


GGAGGTGCTC 


120 




ACATGTCATC 


AATCGTAGTA 


GTTGGGACAC 


AATGGGGAGA 


CGAAGGAAAA 


GGAAAAATAA 


180 




OGGATTTCTT 


GGcAGAACAG 


TCAGATGTTA 


TCGCGCGTTT 


TTCAGGTGGT. 


AATAATGCAG 


240 


on 


GCCATACCAT 


TCAATTTGGC 


GGAGAAACAT 


ATAAATTACA 


TTTAGTACCA 


TCTGGT ATCT 


300 




TTTACAAAGA 


CAAATTAGCG 


GTAATCGGTA 


ACGGAGTCGT 


TGTTGATCCA 


GTTGCACTAT 


360 


55 


TGAAAGAATT 


AGACGGATTA 


AATGAACGTG 


GCATTCCTAC 


AAGTAATTTA 


CGTATATCTA 


420 


ATCGTGCGCA 


AGTGATTTTA 


CCATATCACT 


TAGCACAAGA 


TGAATATGAA 


GAACGTTTAC 


480 




gTGGtGACAA 


TAAGATTGGT 


ACAACTAAAA 


AAGGTATCGG 


TCCAGCATAT 


GTAGACAAAG 


540 


40 


TTCAACGTAT 


CGGTATTCGT 


ATGGCAGATT 


TACTTGAAAA 


AGAAACATTC 


GAAAGATTAT 


600 




TAAAATCAAA 


CATTGAATAT 


AAACAAGCAT 


ATTTCAAAGG 


TATGTTTAAC 


GAAACATGTC 


660 




CATCATTTGA 


TGATATCTTT 


GAAGAATATT 


ATGCAGCAGG 


TCAACGTCTA 


AAAGAATTTG 


720 


45 


TAACAGACAC 


ATCAAAAATC 


TTAGACGATG 


CATTTGTAGC 


AGATGAAAAG 


GTACTTTTCG 


780 




AAGGTGCGCA 


AGGTGTAATG 


TTAGATATCG 


ACCATGGTAC 


ATATCCATTC 


GTTACATCAA 


840 




GTAATCCAAT 


TGCAGGTAAC 


GTTACTGTTG 


GTACAGGTGT 


AGGTCCTACA 


TTCGTTTCAA 


900 


SO 


AGGTAATTGG 


TGTATGTAAA 


GCTTATACAT 


CACGTGTTGG 


TGATGGTCCA 


TTCCCTACTG 


960 




AATTATTCGA 


TGAAGATGGA 


CATCATATTA 


GAGAGGTTGG 


TCGTGAATAC 


GGTACAACAA 


1020 
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TAAGTGGTAT TACAOATTTA TCTATTAACT CAATCGATGT TTTAACAGGC CTAGACACAG 1140 

TGAAAATCTG TACAGCTTAT QAATTAQACG GTAAAOAAAT TACTOAOTAC CCAGCAAACT 1200 

5 TAGATCAATT AAAACGTTGT AAACCAATCT TTGAAGAGTT ACCAGOTTGO ACAGAAGACG 1260 

TAACAAATGT GCGTACTTTA GAAGAATTAC CTGAAAATGC ACGTAAATAT TTAGAGCGTA 1320 

TTTCAGAATT ATGTAATGTA CAAATTTCTA TCTTCTCAGT TGGTCCAGAT AGAGAACAAA 1380 

10 

CAAACCTATT AAAAGAATTG TGGTAGAACT TTATATAAGT CATACACAAT GATTATAAAT 1440 

ACATGAGCCT TCTATCTTTA TTGGTAGGAG GCTTTTGTTA TGCTTGCTTC TGTATCGATT 1500 

CGATTATTTA GATAAAAAAT ACTAACGTAA AGGCGATATT TGCTAGTCAT AATTTAGAAG 1560 

15 

rTTAgATGAt AtTtAACGAA AAtTAAgATG anATACtTGA ATGGTAA 1607 
(2) INFORMATION FOR SEQ ID NO: 297: 

2 0 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3055 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 





TTAGAAGTAA 


GCACTTTAAT 


ATCTTTACCT 


ACCCATGTAC 


CAACACCTTC 


TTTAGGCTTT 


60 


30 


GGATTTTCAG 


CATGGTTATT 


TGATTTATTC 


ACCTGTTCAG 


ATCTATTTAC 


TTTATTACCT 


120 




TTATTCGGGT 


TCTCTAATAC 


ATCAAATTTT 


AATCTCGGTG 


AATAAAAAAG 


ATATATTAAA 


180 


35 


AATCCAACTA 


AAAATAATAC 


ACCGACGACT 


CTTATAATTA 


ACTTTTTCAT 


CAATCAACCA 


240 


CCTAAAAAGT 


ATTAATACTA 


TTGTAAAAAA 


CAACACATTA 


ATTAGCAAAT 


TTTCAACACT 


300 




GACATAACTG 


TGTCGTTTCG 


ATAAAACAAA 


ATCATCTTCA 


GGCATAATTT 


TAACGTCATA 


360 


40 


AAGATTTTCC 


CTACACTCTA 


TATCATAACC 


TATCTTTATG 


TTTTCAGGTT 


GAATTTCATT 


420 




ATTAAGATTA 


AAATATGTAT 


AAAAAAATGG 


ACAGTTAAGG 


TATCAAATGA 


ATACCATCAA 


460 




CTGTCCGACT 


ATTCTTCATC 


AAAAAACCTG 


ATAAAACAAA 


TTGCCTTATC 


AGATTAGTAT 


54 0 


45 


CATTTGTATA 


AGCATATTAA 


TGACCAAATG 


TTGCTTTAAT 


CAGTGATGTC 


GTTTCTCCAC 


600 




CTGGATATAA 


TACATATAAT 


AATAAGTAAA 


CTGCTACACC 


TGTAATTGCA 


GTACAGAACC 


660 




ATATAACTGA 


AGCGAATGGC 


CCGAATTTGC 


GGTGTACATT 


ATATTTATCT 


TTAAATGCAG 


720 


50 


TAATAATTTG 


AACTAGGCCT 


AGAATACCAC 


CAATTGTTGC 


TAAATTAATG 


TGGAAAAATA 


780 




AGAAAATCGT 


ATAATATTTC 


TTAATTGATG 


CTGGTCCGCC 


AAATGCTGTA 


TTACCGATAA 


840 
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TAACATTTTT GTGTTTATTT ATTTCCCTTT TCCAAATAAG TCTCCAACCA ATGGCAATTA 960 

AAATTGCACT AATOACAATA CATGTCGTAC TAATCGTTGG TAAAATTGGA ACGCCCATAT 1020 

5 

TTTTCATCCT AACTTAATTA ATCTAGATCA AAGTAAGTAA TGAAACAATC ACAGCTAACA 1080 

CGAAAAAGAT CACTAAATAA TTTAGTGAAT ATATAAACAT TTGTGTTGCC CATTTTGTTT 1140 

GATCTGAATT TTTCTTAAAT GTTGTTAAAC CTAATGCAAT CCATCCTAAA TTTAATAAGG 1200 

10 

TrGCTAACAC TaCGAATACG ACACCTAAAT TTATTAGTAA TAAAGGTACT GGCAATAAAA 1260 

TAATCAACCA GATAAACATA CTGACACGTG TACGTTTAAA GCCCTTAACT GATGGTAACA 1320 

1S TTGGAATATT TGCAAGTGCA TATTCATCTT TACGTTTAAT AGCTAAGGcA TAAAAATGAA 1380 

TTOOtTGCCA ACmAAATACA ACTAAAAACA GCGCAATCGC TGTTAAACTA ATTTGTCCTT 1440 

CAATTGCAAC CCATCCAATT AGTGGTGGTA CTGCTCCAGG AAAACTCCCA ATCACTGTGT 1500 

20 TCCATGTTGT ATGTCTTTTA GACCATATTG AGTAATAAGA CACATAACCT ACAATCCCCA 1560 

TAAGACCAAG TACGCCTGAT GGTATATTCA ATAAAAACAA ACAAATTTCT CCAACTAACA 1620 

TCATACCAAA ACTTAATAGT AATAAATTTT GATCTGTAAT TCTATTATTT ACAGTTGGTC 1680 

25 TATTTTGTTT ACTAGGCATA ATACGATCAA TATCTTGGTC GTAATAATTA TTTAAOGCAC 1740 

ATGCGCCACC CATAATTAAA GTAGATCCAA ATAGCATTAA TAAAATTTGA GGTATTGATG 1800 

ATAAGAAGGA ATGATTTGTC ATTACAACTG CTAGCCATGC GCcCGCAAAA -GCCGGAATTA 1860 

30 

AGTTACCTTG AACAAGTCCC ATTTTAATTA TCTGTTGCAA TTCTTTGAAG TTAACTCTGC 1920 

TAATATTTTG TGACmAAGTA TGCTCTTTGC TCATAATCCC CCTCCTTAAA TTTGTTCATA 1980 

TAAGATTATG ATATCTTAGA TTGCATAAAA AGACTAGGTT TAATAAAATT AAATTGTGAC 2040 

35 

AAATTAACGA CAAGAGAAAA TGTCAATTTT GTGACACAAA TAACATTTAA TTTATTGCTA 2100 

TAATGTATAT GTTAGAAAAT TTTAATAAGT AGAATCATGC ATCTAAAAGA GATTAATATT 2160 

TAAGCTTCAA ATTTGAGTAA ACGTGGATTA CATAATTATC CCAATAAAAA AATCATTACG 2220 

40 

ATTAAGTTCT TTTTATGTCG TCCACATACA ATACTTGTAA AATTAAATCA TATTTCCTGC 2280 

GTTGGATCCC ATCTTTTCAT ATC CTACAAT CAGGTCTATT TATAGTATCA TCTCAAATCC 2340 

45 GGCTATTAAT TCTAATTCTC AGTGATGCGT TTTTTTATGA TGGGGTGTAT AAATTGTTTG 2400 

GCAAAAAGAA TTTAAAATGG TTAGGTGTCG TAGCAACGTT AATGATGACA TTTGTACAAC 2460 

TTGGTGGAGC CTTAGTTACC AAAACCGGAT CAGCTGATGG TTOTGGTTCT TCTTGGCCAC 2520 

60 TATGTCATGG TGCGTTGATT CCAGAATTCT TTCCTATTGA TACGATTATT OAGTTAAGTC 2580 

ATAGAGCCGT TTCAGCTTTG TCTTTATTAA TGGTCTTATG GTTAGTTATC ACTGCATGGA 2640 
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TATTGCAAGC ATTAATCGGA GCTGCTGCTG TTATTTGGCA ACAAAACGAT TACGTTTTAG 2760 

cATTGCACTT TGGTATATCA TTAATCAGTT TCTCATCTGT ATTTTTAATA ACATTGATTA 2820 

6 

TTTTCTCTAT AGATCAAAAA TATGAAGCTG ACGAATTATA TATCAAAAAG CCATTAAGAC 28 80 

GTTTAACATG GTTAATGGCA ATCATCATTT ATTGTGGTGT TTATACTGGT GCACTAGTGa 2940 

GACATGCGGA TGCAAGTTTA GCATATGGTG GTTGGCCATT GCCATTACCA CGATCTTGTA 3000 

10 

CCACATTCAG GAACAAGATT GGGTTCAACT CACGCATCGT ATCAnGGTCn nTTAA 3 OSS 
(2) INFORMATION FOR SEQ ID NO: 298: 

1$ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 





TTCTTATTTA 


AAGAAGTCAT 


TTTTAGAAAT 


TGTTGAGACT 


TTAAAAAATG 


ATCCGTATAA 


60 


25 


AATAACACAA 


TCTTTTGAAA 


AATTAGAGCC 


TAAATATTTA 


GAGCGATATT 


CAAGAAGAAT 


120 




TAACCATCAG 


CACAGGGTCG 


TCTATAC CGT 


AGATGATCGA 


AATAAAGAAG 


TATTAATACT 


180 




ATCGGCATGG 


TCACATTATG 


ATTAATGAAT 


ATTCAATATC 


TGAATAACTT 


TAATOATAAG 


240 


30 


TTAATTAAAG 


AAACTAGTAT 


TTAAGTGTAG 


GGAAAATAGC 


GACGTTAATG 


CGTTGTTATC 


300 




TCTACACTTT 


TTAATTTTAT 


AATAGCGCAA 


GACTAAACAG 


ATTGAAATTA 


GTAACAATAA 


360 


35 


AAGAATAACG 


TATTATAATA 


AGGAATTTTA 


AATTGTGACT 


TTTTCGGAAT 


ATTAAATTTT 


420 


AGAAATATGA 


GGTTTTTAAG 


CGGATTCCTC 


ACAAAATTTT 


AAAAATATTT 


AAGCCTGAAA 


480 




ATGATAAAGC 


GGTAGGGAAC 


GTTTTTCTGA 


AaGTTAGTGA 


TACAATAGTT 


TTAAGTTGAA 


540 


40 


ATACAGGAGG 


ATGAATAACA 


TGAATCAGTC 


AGTCAAATTA 


CTTAAACATT 


TAACAGATGT 


600 




AAACGGCATT 


GCTGGTTATG 


AAATGCAAGT 


TAAAGAAGCA 


ATGCGTaACT 


ATATAGAGCC 


660 




TGTCAGTGAT 


CaAATTATTG 


AAGATAACTT 


GGGTGGCATT 


TTTGGAAAGA 


AAAATGCTGA 


720 


45 


GAATGGTCAA 


TACTCAATTA 


TGGAnTTC 








748 



(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4718 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



55 



1125 



EP0 786 519 A2 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

ACCTCCGAAT AATTGATTCC ATTAACTTTT TTTGTGaAAA AtTTAAACAG GCGAAGTCTT 60 

5 

CAATAAGTGA AGATGTCCAA ATTATAAAAA ATACATTCCA AAAAGAAAAG TTAGGTACAG 120 

TAATTACtAC ycTGGCGCAA GTGGTGGTGT TACGTATAAA CCAATGATGA GTAAAGAAGA 180 

GGCGACTGAA GTTGTTAATG AGGTCATTAC TCTATTAGAA GAGAAAGAAC GTTTGTTAcC 240 

10 

TGGCGGATAT TTATTTTTAT CAGATTTGGT AGGTAATCCA TCGCTACTAA ACAAAGTTGG 300 

TAAGTTAATT GCCAGTATTT ACATGGAAGA AAAATTAGAT GCTGTTGTTA CCATTGCGAC 360 

1S AAAAGGTATT TCATTGGCAA ATGCGGTTGC TAATATTTTA AATTTACCAG TAGTAGTGAT 420 

TAGAAAAGAC AACAAGGTGA CTGAAGGTTC TACAGTTTCA ATTAATTACG TTTCAGGATC 480 

TTCAAOAAAA ATAGAAACAA TGGTACTTTC GAAGAGAACT TTAGCAGAAA ATTCAAATGT 540 

20 TTTAGTTGTC GATGATTTTA TGAGGGCTGG TGGCTCTATT AATGGTGTTA TGAATTTAAT 600 

GAATGAGTTT AAAGCCCATG TAAAAGGGGT ATCAGTACTT GTAGAATCAA AAGAAGTTAA 660 

ACAAAGATTG ATTGAAGATT ATACTTCCTT AGTGAAATTA TCTGATGTAG ATGAATATAA 720 

26 TCaAGAGTTT AACGTAGAAC CTGGCAACAG TTTATCTAAG TTTTCATAAA AGGAGTTTTA 780 

GTATTATGAA AATCATTAAC ACAACAAOAT TACCGGAAGC ACTTGGACCA TATTCGCATG 840 

CAACAGTTGT GAATGGTATG GTTTATACTT CTGGTCAGAT TCCATTGAAT ATTGATGGAC 900 

30 

ATATCGTAAG CGCTGATGTT CAAGCACAGA CAAAACAAGT TTTAGAAAAT TTAAAGGTTG 960 

TTTTGGAAGA AGCAGGATCT GATTTGAATT CTGTTGCGAA AGCGACCATT TTCATTAAAG 1020 

ATATGAATGA TTTCCAAAAA ATAAATGAAG TGTATGGTCA ATATTTTAAT GAACACAAGC 1080 

35 

CAGCGCGTAG TTGTGTAGAG GTTGCGCGTT TGCCAAAAGA TGTGAAAGTA GAAATTGAAT 1140 

TAGTAAGTAA AATTAAGGAA TTATAATTTT CGATTAATAT GTTTAATCAA GCTTCTAAAT 1200 

AAAACAGAGA GATATATACT ATAGGGGGGC TCACTACATG AAAGTGACAG ATGTAAGACT 1260 

40 

TAGAAAAATA CAAACAGATG GACGAATGAA AGCACTCGTT TCCATTACAT T AG ATGAAG C 1320 

TTTCGTAATT CATGATTTAC GTGTAATTGA AGGAAACTCT GGCTTGTTCG TTGCAATGCC 1380 

45 AAGTAAACGT ACACCAGATG GTGAATTCCG CGACATCGcg CATCCTATTA ATTCAGATAT 1440 

GAGACAAGAA ATTCAAGATG CAGTGATGAA AGTATATGAT GAAACAGATG AAGTAGTACC 1500 

AGATAAAAAC GCTACATCAG AAGATTCAGA AGAAGCTTAA TCAATTTTAT ATTTAGCGAT 1560 

60 GTAATACATT TGCAATAAGT TGATTTGATA CTGTCGATAA AGCATAAAGC TTTGTCGGCA 1620 

GTTTTTTTAG TTTGTATTAA TGTTTTTTTA TTTTTAATGA AAGGCTAATA AATATATACG 1680 
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TGaTGCTCGT 


ATTTTTGAAG 


TAAGAAAAAA 


G'lTtmTTTA 


AAATTACAAC 


GAATTAAAAA 


1800 




CAATGCCTTT 


TATATGTTGA 


AAGAGTATTG 


CaGATTAAAT 


TaTAATAATG 


ACGAaGgTAA 


1860 


5 


AATTTAATGG 


GGGTTAATGT 


TCATGCOAAG 


ACACGCGATA 


ATTTTGGCAG 


CAGGTAAAGG 


1920 




CACAAGAATG 


AAATCTAAAA AGTATAAAGT 


GCTACACGAG 


GTTGCTGGGA 


AACCTATGGT 


1980 


10 


CGAACATGTA 


TTGGAAAGTG 


TGAAAGGCTC 


TGGTGTCGAT 


CAAGTTGTAA 


CCATCGTAGG 


2040 


ACATGGTGCT 


GAAAGTGTAA 


AAGGACATTT 


AGGCGAGCGT 


TCTTTATACA 


GTTTTCAAGA 


2100 




GGAACAACTC 


GGTACTGCGC ATGCatGCAA ATGGCGAAAT 


CACACTTAGA 


AGACAAGGAA 


2160 


1S 


GGTACGACAA 


TCGTTGTATG 


TGGTGACACA 


CCGCTCATCA 


CAAAGGAAAC 


ATTAGTAACA 


2220 


TTGATTGCGC 


ATCACGAGGA 


TGCTAATGCT 


CAAGCAACTG 


TATTATCTGC 


ATCGATTCAA 


2280 




CAACCATATO 


GATACGGAAG AATCGTTCGA AATGCGTCAG 


GTCGTTTAGA 


ACGCATAGTT 


2340 


20 


GAAGAGAAAG 


ATGCAACGCA 


AGCTGAAAAG 


GATATTAATG 


AAATTAGTTC 


AGGTATTTTT 


2400 




GCGTTTAATA 


ATAAAACGTT 


GTTTGAAAAA 


TTAACACAAG 


TGAAAAATGA 


TAATGCGCAA 


2460 




GGTGAATATT 


ACCTCCCTGa 


TGTATTGTCG 


TTAATTTTAA 


ATGATGGCGG 


CATCGTAGAA 


2S20 


25 


GT CTATCGTA 


CCAATGATGT 


TGAAGAAATC 


ATGGGTGTAA 


ATGATCGTGT 


AATGCTTAGT 


2580 




CAGGCTGAGA 


AGGCOATGCA 


ACGTCGTACG 


AATCATTATC 


ACATGCTAAA 


TGGTGTGACA 


2640 




ATCATCGATC 


CTGACAGCAC 


TTATATTGGT 


CCAGACGTTA 


CAATTGGTAG 


TGATACAGTC 


2700 


30 


ATTGAACCAG 


GCGTACGAAT 


TAATGGTCGT 


ACAGAAATTG 


GCGAAGATGT 


TGTTATTGGT 


2760 




CAGTACTCTG 


AAATTAACAA 


TAGTACGATT 


GAAAATGGTG 


CATGTATTCA 


ACAGTCTGTT 


2820 


35 


GTTAATGATG 


CTAGCGTAGG 


AGCGAATACT 


AAGGTCGGAC 


CGTTTGCGCA 


ATTGAGACCA 


2880 


GGCGCGCAAT 


TAGGTGCAGA 


TGTTAAGGTT 


GGAAATTTTG 


TAGAAATTAA 


AAAAGCAGAT 


2940 




CTTMAGATG 


GTGCCAAGGT 


TTCACATTTA 


AGTTATATTG 


GCGATGCTGT 


AATTGGCGAA 


3000 


40 


CGTACTAATA 


TTGGTTGCGG 


AACGATTACA 


GTTAACTATG 


ATGGTGAAAA 


TAAATTTAAA 


3060 




AcTATCGTCG 


GCAAAGATTC 


ATTTGTAGGT 


TGCAATGTTA 


ATTTAGTAGC 


ACCTGTAACA 


3120 






ATGTATTGGT 


GGCAGCTGGT 


TCCACAATCA 




ACCAAATGAC 


3180 


45 


AGTTTAGCTG 


TGGCAAGAGC 


AAGACAAACA 


ACAAAAGAAG 


GATATAGGAA 


AT AAT CATTT 


3240 




ACGTATTTAA 


AATGGCTAGG 


ATAAAAGGAT 


AATCCTATGT 


AATATTAATG 


TAATCTTTAT 


3300 




GATTTAATGA 


TTCGCATAGT AATGGAGTTA CATyTTATAT 


ATAATAGTAA 


TTGCGTAAGT 


3360 


SO 


AAATAATTGG 


AGGACTATAA 


ATGTTAAATA 


ATGAATATAA 


GAATTCGTCA 


TTAAAGATTT 


3420 




TTTCATTGAA 


AGGAAACGAA 


GCATTAGCGC 


AAGAAGTTGC 


TGACCAAGTA 


GGAATTGAAC 


3480 
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GTATTCGTGG TTGTGACGTA TTTATTATTC AACCAACATC ATATCCTGTO AATCTACATT 3600 

TAATGGAATT ATTAATTATG ATTQATGCTT GTAAACGTGC TTCTGCAGCA ACAATCAATA 3660 

5 

TTGTAGTGCC ATATTATGGA TATGCAAGAC AAGATAGAAA AGCCCGTAGC CGTGAGCCAA 3720 

TCACTGCTAA ATTAGTTGCA AACTTAATCG AAAcAGctGG CGCAACTCGT ATGATTGCGT 3780 

TAGACTTACA TGCACCACAA ATTCAAGGAT TCTTTGATAT TCCAATTGAC CACTTAATGG 3840 

10 

GTGTGCCAAT TCTTGCTAAA CATTTCAAAG ATGATCCGAA TATTAACCCA GAAGAATGTG 3900 

TCGTTGTTcA CCAGACCATG GCGGs nTTAC ACGTGCACGT AAATTAGCTG ACATTTTAAA 3 960 

1S AACTCCAATT GCAATTATAG ATAAACGTCG TCCTAGACCA AATGTTGCTG AAGTGATGAA 4020 

CATTGTTGGT GAGATTGAAG GACGTACGGC AATTATTATT GACGATATTA TTGATACAGC 4080 

AGGTACAATC ACTTTAGCTG CACAAGCATT AAAAGATAAA GGTGCTAAAG AAGTATATGC 4140 

20 TTGTTGTACA CACCCTGTTT TATCAGGACC GGCTAAAGAA CGTATCGAAA ATTCTGCTAT 4200 

AAAAGAATTA ATCGTAACAA ACTCAATTCA TTTAGATGAA GATOGGAAAC CATCTAACAC 4260 

TAAAGAATTA TCTGTTGCTG GTTTAATCGC ACAAGCTATC ATTCGTGTAT ACGAAAGAGA 4320 

25 ATCAGTTAGC GTATTATTTG ACTAATATTT AAAAGGCGTT TGACGAACAT ATTCCAAACG 4380 

TGTATAATAG TTTCGTTCGT GATTATACGA ATAAATAAAC ACTTGCAAGC AACGATGATG 4440 

TTGATGGGTA AGTGAGGTGC TCGTTTTGAG CAAAAATGAA AGGTGGAAAT GAGAATGGCT 4500 

30 

TCATTAAAGT CAATCATCCG TCAAGGTAAA CAAACACGTT CAGATCTTAA ACAATTAAGA 4560 

AAATCTGGTA AAGTACCAGC AGTAGTATAC GGTTACGGTA CTAAAAACGT GTCAGTTAAA 4620 

GTTGATGAAG TAGAATTCAT CAAAGTTATC CGTGAAGTAG GTCGTAACGG TGTTATCGAA 4680 

35 

ttaggcgttg gttctaaaac tatcaaagtt atggttgc 4718 
(2) Information for seo id no: 300: 
4q (i) sequence characteristics: 

(A) LENGTH: 3181 base pairs 

(B) TYPE r nucleic acid 

(C) STRAND EDNESS : double 
<D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 
AAAATGCATA TAAATACATA TTAAGGAGGA TTTTATGAAT TTTCTTAAAC CTGCAAAGCA 60 
50 TATTAAGCCT TTGCCAGAAA ATCAGATAGA TGATACCTAT AAACGATTAC GTCTCCAAGT 120 

ATTTCTTGGT ATTTTCATCG GTTACGCTGG GTACTATTTA TTACGTAAAA ACTTTTCGTT 180 
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TGCTGTTTCC ATCGCATATG GATTTAGTAA GTTCTTTATG GGTACTGTAA gTGATCGGAG 


300 




CAATGCTCGG ATATTCTTAG TTCTTGGATT AGCACTCACT GCTATCGTCA 


ATTTGTTAAT 


360 


5 


GGGATTTGTA CCGTTCTTTA CATCAGGTAT CGGTATTATG TTTGTCCTAT 


TATTCTTAAA 


420 




TGGATGGTTT CAAGGTATGG GCTGGCCACC TTCAGGCCGT GTTCTCGTTC 


ACTGGTTTAG 


480 


10 


TGTAAGTGAA CGCGGAAGTA AGACTGCCCT TTGGAACGTT GCGCATAATG 


TTGGTGGAGG 


540 


TATTATGGCA CCTATTGCTG CTTGGGGTAT TACAACAACT GCATTTATCA ACTTTGGTTA 


€00 




TTTAAAAGGT TTCGAAGGTG TATTCATTTA CCCTGCACTC TTAGCACTTA 


TCATTGCCGC 


660 


15 


AATTTCATAC GTATTGATTA GAGACACACC TCAATCTCAA GGTTTACCTC 


CAATCGAAAT 


720 




TTATAAAAAT GACTTTGCTA CAAGCGATAA GAAAACATTA GAAACAGAAT 


TAACTACAAA 


7B0 




AGAAATTTTA TTTAAATATG TACTGAACAA TAAATGGGTA TGGGCAATTG 


CCTTTGCAAA 


840 


20 


TATATTTGTT TATTTCGTGC GTTATGGTGT ACTTGATTGG GCGCCAGTCT 


ACTTAAGTGA 


900 




AGAAAAACAT TTCGACTTAA AAGCATCAGG TTGGGCATAC TTCTTATACG 


AATGGGCTGG 


960 




AATTCCTGGT ACATTATTAT GTGGTTACAT TTCTGATAAA TTATTCAAAG 


GTCGTCGTGG 


1020 


25 


AC CTGCAGGT TTCTTCTTTA TGTTAGGTGT CACAGTATTT GTATTAATTT 


ATTGGTTAAA 


1080 




TCCTCCAGGC AATGCTTGGT TAGACAATGT CTCATTAATT GCCATTGGTT 


TCTTAATATA 


1140 




TGGACCAGTT ATGTTAATTG GTTTACAAGC ATTAGATTAT GTACCTAAAA 


AAGCAGCTGG 


1200 


30 


CACAgcAGCT GGATTAACAG GATTATTTGG TTATCTGTTT GGTGCTGTAA TGGC CAACAT 


1260 




CGTCTTAGGT GCTGTAGTTG ATAAATTCGG ATGGGATGTC GGTTTTATTT 


TATTAACAGC 


1320 


35 


AATTAGTGTG TTTGCAATGT TGAGCTTTAT CCTCACTTGG AATAAAGTAG 


GACAAGAAAC 


1380 


CGTTCATCAT TAAATGATAA AAAATAAAGT CAT ATGGTTA TCTTATCGAA 


AGATQATATA 


1440 




TTCATCTCTT ATAAGTTCAA CCATATGACT TTTTATTAGT ATTCAAAAAA ATATTTACAT 


1500 


40 


TGCCACTTTT GTGTTTGCCC TGCTGTTTTA TTCAATTGAT TACACCACTT 


AGGATAAACT 


1560 


CTAAAAGCCA TTTTCCCTTG ATACTTAGAT GAAGCTAAGA TACCTTTTTC 


TACCAATAAT 


1620 




TCCCTAGGAA ATAGAAAGTA GCCATTTAAT TCATCATCGA TAACAGCAAC 


AATCAGGTAA 


1680 


45 


TCAGCAAACT CTTCAATTTT GTATGGCCGA TTATAATTAT CtTCGTCTTT 


TGTC CAACAT 


1740 




GTCACGAAAT ATCCCGATTT AGTCGGTGTT TTCTTAGCTA ACCTACTTTG 


ATATGTTTCC 


1800 




TCTTTAAAGC TAAATGTTAA TGCyTCGTAA TCTTGATTAT ATTTTTCTTC AGTTAAATCT 


I860 


50 


TTAACTTCTG ATTCTTCACT AAAAATATTT TTCAACAGTA TTTTAGATTT 


ACACATATGC 


1920 




CAATCTCACT TTATTTTTTC ATAATCGTAT CATATATTTA TTTTTTTCGA 


AAAATACACT 


1980 
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5 



10 



IS 



SO 



TGTATCAACT 


6TCAAACGAT 


CACGTAATAA ATAGACGATT 


AACATTGcTG 


CTAAAGCACA 


2100 


AAGTGACTCA 


GCAATTAATA 


ATTRACT* A A AT 




AGTCCAAACA 


AAGCATTCAT 


2160 


AATAAATAAT 


ACTGGAATGA 


Inni luVAt^ 




GCCATAATTG 


TAGCACCACG 


2220 


ACCTTGCCCA 


GTCGCTTGAA 






CCTATACCAT 


TTAATAATAA 


2280 


TGATGCCATT 


GTTACTTTCA 


AAATAAA TG t 


CGCCATCTCA 


ACAATGGCTT 


GATCAGTAGT 


2340 


AAATAGTCCG 


ACCATATGAT 


GTCCAATTGT 


AAATACAGCA 


CTCATACATA 


CAACAAAGAT 


2400 


AACGCCGATA 


GACATGATAA 


CTGCTTTGAT 


AACGTCTTTC 


ATACGGCCTT 


TATTTGCCAT 


2460 


AAAGTTATAT 


GCAATTAGTG 


GTACAACACC 


TTCACATAAT 


CCCATGATAA 


TAAGTTCTGG 


2520 


aAATTCCACA 


AGTCTAAATG 


AGATACCATA 


ACTTGCaATC 


GCGAAGTTTC 


CATAATGTGC 


2580 


TAAAAATAAA 


TTTAAAACTA 


ATCCTGTGAA 


TCCCATTAAG 


ATACTCATTA AAAATGCAGG 


2640 


AATACCGATT 


TTAAAGATTT 


CAGAAAGCAT 


TTCTTTATTA 


GGTTTCGCAA 


GTTTAATATT 


2700 


AACTGACACA 


ACGTCACTAT 


TTTTCATAAA 


ATAAATGATA 


AAGAACAGAG 


CAGCAGCAAC 


2760 


ATTACTGATT 


GCAGTACCCA 


AAGCTGcACC 


AACAACGTTT 


aAATCAAAAC 


CAAAAATTAA 


2820 


AATTGGATCT 


AAAATAATAT 


TTAAGCCTAC 


ACTAGCTAAC 


ATACCAATCA 


TAGAAACCAT 


2880 


TGGTGCCCCA 


ATTGCACGTG 


CAAATTGTTC 


TAATATGAAG 


AACAAAATTA 


CAAAAGGTGC 


2940 


ACTTAAAAAC 


ATTACTTTCA 


AATAATTACT 


TGTTAAAGCT 


AACGTTTCAC 


CTCTCGCCCC 


3000 


TAAAATTGCT 


GCGATTTGAT 


CACTGAATGG 


TAAAGTAACT 


AAAATCACGA 


TAAGTCCTAG 


3060 


TGCAATACCA 


C CAT AAAT AG 


AGAAACTACT 


TACAAATTTA 


CTCTTAmTAT 


AGTCTTTCGC 


3120 


ACCTAATAAA 


CGTGAAATAT 


AAGTTCCTGC 


ACCAACGCCA 


AATAAATTAC 


CTAACCCCAT 


3180 



35 

T 3181 

(2) Information for seq id no: 3oi: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4029 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 
TCATCACCTT CAAATAAATA GGCATGCGAT AATTTATTTG AATGATATGC ATTCGTCAAT 60 
SO TGTTGCTGTT CATCCATTCA TTACAAACTC CCTTTTGCTT TTATATAAAA AGGTTGCCAA 120 

AGAGCAACTA ATTACTGCAA CAATAGCGAC CATGATTTGT GATTATTTTT AGAACCACAA 180 
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TTATTAATAC ACTATTTTAA GTATCTCTAA TTGATGATTG TATTATAGAA TTAAAATTGA 300 

TGGAATGCAT CAACTGGCAT AACAAATACA GTAGCACCGC CAACTTCAAC TTCAACTGGA 360 

5 TATGGAATGT ACGAATCCGC ACTGCCTCCC ATAGGTGTAA TAGGTGAAAC CAACTGTTCT 420 

CTATTACCAC ACGTTTGATT AATCACAOAC AATATTTCAT CTACAOGGTC ATCATTGACA 4 80 

CCACATAAGA ATGTTGTATT ACCCGCTCTT AAAAACCCAC CTGTTGTTGC CAATTTTGTT 540 

10 

GCTCTAAAGT TATTTTTAAC AAGTTGATCT GCAAGTTCCT GACTATCTTG ATCTTGTACG 600 

ATCGCTATAA TCATTTTCAT TTTATAACAC CTCTTCTAAC AATTATATCA TATCTTTTCT 660 

AAATATTTGA TGATAGTTTG ATACGTGTCT TCAACAACAT TTTCAAGAGG TTQATCTGCA 720 

IS 

TTAACGCTTT TGAACCGTTG TGATTCATTA TGAATGATTT CTTGGTAACC TTCAATTACT 780 

TTTTCGTGAA CTTTAAATCT TCTTGATCTA ATCTATTTTG ATCTCTTGAA TTTTTAATAA 840 

2 0 TACGTTCGCG ACCTACTTCA GCACTAACAT TTAAATAAAT CGTCAAGTCT GGaTATAATC 900 

CATTTATTGC AAATTCGTTT AATGCTCTTA CTTCTTCAAC GCCAATCCCT CTAGCATAAC 960 

CTTGATAAGC TAATGAACTA TCGATATAGC GATCACACAA CACAACCTTA CCTTCTTTTA 1020 

25 AAGCTGGTAT GACCTTTAAT ACAAGATGTT CTCTTCTAGA TGCAGCAAAT AACATTGcTT 1080 

CaGTtCTAAT GTCCATATCA TTGCCcTTCT AATACAATTT TACGTATTTC TTCACCAGTA 1140 

GGAACACCAC CTGGTTCTCT AGT CAT AATG ACATCATAAT CTTTTACTAA TCTATGGTAA 1200 

30 ACTTCATTAA TTACAGTTGT TTTTCCAGAG CCTTCTGGGC CCTCAAAAGT TATAAAAGCT 1260 

GACATTTTAT TCATC CTCAA CTAAAATTTT ATTATTTTTA ATTCCTTCAA CTATCATTCC 1320 

AGTTTCCAGA TATTCATTTA CCAATTCTAT CATATTTTCA GTTATTGTTT CTCCTTTGAA 1380 

35 

AATAATAGGA ATcCCTGGcG GATACGGGAC AATATGTGGC GCCAGAACTT TACCTTTgCC 1440 

TTTTcAAATC ACACCAAGTA ACATATTCAA AGCGTTTTGG TTTATAGTTA CCTTCAGTyG 1500 

TTAAAAGTTG TGTTTGTTTA ACTTTAGAAA CTGATTTTTT CGGTAAAATC ATATCTTCAA 1560 

40 

TTTTACGTAA AAGCGAATCA AATAAATACG TATCATCATG ATGCCATAAC GGCAATATCG 1620 

CTAATGCTTG ATAGTCGTCC GCTAATTCTA AATAGATATG TGCATTCATA AACCAATTTT 1680 

45 GAATATCATG ACCTGTAAAA CCTTCATATT TTATCAGCAA CTTCAACGGA TCATCAACTT 1740 

GAAGCATTTC AAAACCCTTC TTCTCCAAAC ATTCGATTAA TTGCGCTCTC TTATCAAAAA 1800 

ACACGGTACT AT CAT ATGTT TTATAGAACT CGGCAGCTGA CTCTAAACTA GCCATAATCA 1860 

. 50 AATACGAAGG ACTAGATGTT TGGAAGTAGC TTAGATATTC TATAATAGTT TCTCTATAAG 1920 

GTGCATTTTT ATGAATATAA AGTACCGAGC CCATCGTTAA AGCTGGTAAC GTTTTATGAA 1980 
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CAAAGTGCGC GCCGTGTGCT TCGTCAATGA GTACAGGAAT ATTTAATTGG TGCAAAGATT 2100 

TGATAACCTC TTCTACATTA AATGTTTCAC CGTAATAGTT AGGATAAGTC AACACAGCAA 2160 

5 GTTTGTGACC GTCATTATTC AAACGGCTTA AATTAACTTT ATTATAATGA TTCGTTAACG 2220 

GACTTTGATG CGTTTCAATA AAATGCCCTT CTTGTTGGCT AATATCGAGC GCATGTAACA 2280 

CAGATTTATG TACATTTCTT GCCATTAAGA TATCGCCTTT TTTCTGTGAA AAAGACTGGA 2340 

10 

TGACAGATAA TATTCCTGAA GTGGTGCCAT TCACTAAGAA ATAAGCATCA TAATCTGAAT 2400 

GTTTCTCCAC CTGCTTCATA CTTTCCAAAA TGACTTCTTC AGGATGATGT AAATCAnCTn 2460 

1S AATCCAGGTA TTTCAGTTTT ATCCATTGTC ATTGATAATT GAGATAAATG ACCGATAGTC 2520 

ATATTTTTAT GACCCGGAAC ATGCAAAGAA ATCGCTTCTT CTTGATTTAA ACTTTCTAAT 2580 

TTATTTAAAA TAGGTTGCTT CATGATATAC GCTTCCTTTA TTTACACTGT TTTGGAATTA 2640 

2o GTTACTTTCA AAAGTATTAA TTATATAGTA ACACTTCTTT GACAAAAGTT AGTGTTACTT 2700 

ATGCAATAGC TTGTCTATTG TATAATAATT AATTTCCTTT TTGTACTTCG ATTTAAAAGA 2760 

TATTAGACAT AAAATCTAAA AACAGCAGTA AGATGATTTA TGATTAAAAA CTATCTTACT 2820 

2S GCTGTTCACT TTTTATAATA CTTCTGAATG TCTTCACTTA TACTTCTAGT CACAGATTTA 2880 

AATAATCAAA AGTGCACATT ATTAAAATAT CAATTTCACA CTCAATGCGG CTCATCGCAT 2940 

TCATTTCTTG TCTAGCAACG TTCTACTCTA GCGGAACGTA AGTTAG CTAC CATCCTCX5CT 3000 

30 

AAGAACCTTT CTTGACTTGT GACAATCGCT TGCTTCTTTC CTCTCCTTCG GCTCTCGCTT 3060 

ACTCATTTAG CTCTACTAAA CTCGTTGCGC TCTTTTCTCG TTTCGTCAGA TTCAAACGTT 3120 

TTCACTTCGC CAAGCCATTT TTCTTTGTGT TTACTTTTTA TTTTGACGTT TTAGACATAA 3180 

3S 

AAAAAGAGAC cTCACGGTCT CAACTTGCCT GGCAACGTTC TACTCTAGCG GAACGTAAGT 3240 

TGGCTACCAT CGTCGCTAAA GACCTTTCTT GACTTGTGAC AATCGCTTGC TTCTTTCCTC 3300 

TCCTTCGGCT CTCGCTTACT CATTTAGCTC TACTAAACTC GTTGCGCTCT TTTCTCGTTT 33 GO 

40 

CGTCAGATTC AAACGTTTTC ACTTCGCCAA GCCATTTTTC TTTGTGTTTA CTTTTTATTT 3420 

TGACGTTTTA GrCATAAAAA AAAGAGACcT TGCGGTCTCA ATGCGGCTCA TCGCATCCAT 3480 

45 TTTTTGCCTG GCAACGTTCT ACTCTAGCGG AACGTAAGTT GGCTACCATC GwCGCTAAgA 3540 

aCCTTTCTTG ACTTGTGACA ATCGCTTGCT TCTTTCCTCT yCTTCGGCTC TCGCTTACTC 3600 

ATTTAGCTCT ACTAAACTCG TTGCGCTCTT TTCTCGTTTC GTCAGATTCA AACGTTTTCA 3660 

SO CTTOGCCAAG CCATTTTTCT TTGTGTTTGC TTTTTATTTT GACGTTTTAG ACATAAAAAA 3720 

AAGAGACCTT GCGGTCTCAA TGCGGCTCAT CGCATCCATT TTTTGCCTGG CAACGTTCTA 3780 
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CATGGGAACA 


GGTGTGACCT 


CCTTGCTATA 


GTCACCAGAC 


ATATGAATGT 


AATTTATACA 


3900 




TTCAAAACTA GATAGTAAGT gAAAAGTGGA TTTTGCTTCQ 


CAAAACATTT 


ATTTTGGATT 


3960 


5 


AAGTCTTCGA 
GAACCTATT 


TCGATTAGTG 


ATTCGTGCAG 


CTCCACATGT 


GCACCATGCT 


TGCCACCTCG 


4020 
4029 


10 
15 


(2) INFORMATION FOR SEQ ID NO: 302: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7159 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 










(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 






20 


GATGCAATAG 


TTGAGTAGTT 


ATAAGTAGCT 


ACATACAACC 


GCTCAAATAT 


AGGTTCAAGA 


60 




ACATTTTTAA 


TGAAGAAAGC 


GATTTTCCCm 


CAGAGTGCAA 


ACGCTTGACT 


CAATAACGAA 


120 




ATGTCACCTA 


TGTATGGCGT 


GACTTATTAT 


ACAGGaGGTG 


CAAAGTATGT 


TTGCTATTAT 


180 


25 


TGAAACAGGT 


GGAAAACAAA 


TCAAAGTAGA 


AGAAGGTCAA 




TTGAAAAATT 


240 




AGACGTAAAC 


GAAGGAGATA 


CTTTTACATT 


TGATAAAGTA 


TTATTTGTAG 


GTGGAGATTC 


300 




AGTTAAAGTT 


GGAGCGCCAA 


CAGTTGAAGG 


TGCAACAGTT 


ACTGCTACTG 


TTAATAAACA 


360 


30 


AGGtCGCGGT 


AAAAAAATCA 


CTGTATTCAC 


ATACAAACGT 


CGTAAAAATT 


CAAAACGTAA 


420 




AAAAGGCCAT 


CGTCAACCAT 


ACACTAAATT 


AACAATCGAT 


AAAATCAACG 


CGTAATTATT 


480 


35 


ATGATTACTG 


TTGATATTAC 


AGTTAATGAT 


GAAGGCAAAG 


TAACAGACGT 


TATTATGGAT 


540 


GGCCATGCTG 


aCCATGGTGa 


ATATGGTCAT 


GATATCGTTT 


GTGCTGGAGC 


TTCAgCTGTA 


600 




TTGCTTGGTA 


GTGTTAATGC 


GATTATAGGA 


TTGACATCTG 


AGAGACCAGA 


TATCAATTAT 


660 


40 


GACGACAATG 


GTGGTCATTT 


TCATATAAGA 


AGCGTTGATA 


CAAACAACGA 


TGAAGCGCAA 


720 


CTAATTCTTC 


AAACAATGCT 


TGTGTCTTTA 


CAAACTATTG 


AAGAAGAATA 


TAATGAGAAT 


780 




ATTAGATTAA 


ATTATAAGTG 


AGGTGCATTC 


CGATGTTAAA 


ATTAAACTTA 




ft A ft 


45 


CATCTAAAAA 


AGGGGTAAGT 


TCTACAAAAA 


ACGGACGTGA 


CTCTGAATCA 


AAACGCTTAG 


900 




GTGCTAAACG 


TGCTGACGGT 


CAATTCGTAA 


CAGGTGGTTC 


AATTTTATAT 


CGCCAACGTG 


960 




GTACTAAAAT 


TTACCCTGGT 


GAAAATGTAG GTCGTGGTGG 


CGATGATACA 


TTATTCGCTA 


1020 


50 


AAATCGACGG 


CGTTGTTAAA 


TTCGAACGTA 


AAGGTCGCGA 


CAAAAAACAA 


GTTTCTGTAT 


1080 




ATGCAGTAGC 


TGAATAATTT 


TGTCTAGTTA 


ACACCAGAAG 


TGAATCTTCT 


GGTGTT t TTT 


1140 
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AGACGTTATA 


CTAAATGTGC 


AcTGTATAAG 


AGCCCCTAAT 


CACTAAACTA 


TAAGGGGGAC 


1260 




AAAGGAATAC 


AGTTGCAGCG 


TTTAAAGAAT 


AAACTGTACC 


ACAATTGGTG 


CTGAGAAATA 


1320 


5 


TAAGTATTTT 


AAAGCAAAGA 


TTTATAAAAG 


TAACTGCATA 


AOAGCCCCTA 


ATTATTTACA 


,1380 




ATATATAAGG 


GGCTCTAATA 


TGCTATAATT 


ATTGGGAAAA 


TGAAAATTAT 


ATGTAAAAGA 


1440 


10 


GGTGAGATAT 


ATGTTTGTCG 


ATCAAGTCAA 


AATATCTCTT 


AAAGCCGGTG 


ATGGTGGTAA 


1500 


TGGTATTACC 


GCATACAGAA 


GAGAAAAATA 


TGTACCATTT 


GGTGGACCAG 


CTGGCGGTGA 


1560 




CGGTGGTAAA 


GGTGCTTCAG 


TCGTATTTGA 


AGTGGATGAA 


GGTTTAAGAA 


CGTTATTAGA 


1620 


1S 


TTTTAGATAT 


CAACGTCATT 


TTAAAGCAAG 


CAAAGGTGAA 


AATGGCCAAA 


GTAGTAATAT 


16B0 




GCATGGTAAA 


AATGCGGAAG 


ATTTAOTATT 


AAAAGTTCCA 


CCTGGTACAA 


TTATTAAAAA 


1740 




TGTTGAAACA 


GACGAAgTGT 


TAGCAGATCT 


TGTTGAAGAT 


GGTCAAAGAG 


CTGTAGTAGC 


1800 


20 


GAAGGGCGGT 


CGAGGTGGCC 


GAGGTAATTC 


ACGTTTTGCA 


ACACCTAGAA 


ACCCTGCACC 


I860 




TGACTTCAGT 


GAAAAAGGTG 


AACCAGGTGA 


GGAATTAGAT 


GTATCTTTAG 


AATTGAAATT 


1920 




ATTAGCTGAT 


GTAGGATTAG 


TAGGTTTCCC 


TAGTGTGGGT 


AAATCGACTT 


TATTATCTAT 


1980 


25 


CGTTTCAAAA 


GCTAAGCCTA 


AAATTGGGGC 


ATATCATTTT 


ACAACGATTA 


AACCAAATCT 


2040 




AGGTGTTGTT 


TCAACGCCTG 


ATCAACGTAG 


TTTTGTTATG 


GCAGATTTAC 


CAGGTTTAAT 


2100 




TGAAGGTGCA 


TCTGATGGCG 


TTGGATTAGG 


ACATCAATTT 


TTAAGACATG 


TAGAGAGAAC 


2160 


30 


AAAAGTTATT 


GTTCACATGA 


TTGATATGAG 


CGGTTCTGAA 


GGTAGAGAAC 


CTATTGAAGA 


222D 




TTATAAAGTC 


ATTAATCAAG 


AATTAGCTGC 


GTACGAGCAA 


CGTTTAGAAG 


ATAGACCTCA 


2280 


35 


AATOGTAGTA 


GCTAACAAGA 


TGGATTTACC 


TGAATCACAA 


GATAATTTAA 


ACTTGTTTAA 


2340. 


AGAAGAAATT 


GGOGAAGATG 


TGCCAGTTAT 


TCCAGTTTCA 


ACAATAACGC 


GTGATAATAT 


2400 




TGATCAATTA 


TTATATGCAA 


TAGCAGATAA 


ATTAGAAGAA 


TATAAAGATG 


TTGACTTCAC 


2460 


40 


AGTTGAAGAA 


GAGGAGTCAG 


TTGGCATTAA 


CCGAGTATTA 


TATAAACATA 


CACCGTCACA 


2520 




AGATAAATTT 


ACAATTTCAA 


GAGATGATGA 


TGGTGCTTAT 


GTGGTAAGTG 


GTAATGCTAT 


2580 




TGAAAGAATG 


TTTAAAATGA 


CTGACTTTAA 


CAGTGATCCA 


GCAGTACGTC 


GATTTGCTCG 


2640 


45 


TCAAATGCGT 


TCGATGGGTA 


TTGATGATGC 


GCTTAGAGAA 


CGTGGTTGTA 


AAAATGGTGA 


2700 




TATCGTTAGA 


ATTCTTGGCG 


GAGAATTTGA 


ATTCGTTGAA 


TAGGAGCGAA 


ACATGATGGA 


2760 




CAATAAAGAT 


TATAAAAAGT 


TTTATTTAAT 


TAGAGAAGAT 


GTCTTGCCTG 


AATCCGTGGT 


2820 


SO 


TAAAACATTG 


AAGATTAAAG 


ATGCCTTAAA 


AAGTGATCCG 


ACATTGTCCA 


TTTATGATGC 


2880 




CGTTAAACAG 


TTTGATCTAT 


CTAGAAGTGC 


TTTTTATAAA 


TATAGAGAAA 


CGATATTTCC 


2940 
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TGTTGGTATG TTGGCACGTG TACTAGATGT TATATCAAAG TTAGAACTAT CTGTATTAAC 3060 

GATTCATCAA AGTATTCCAA TGGAAGAAAA AGCAACAATA ACATTATCAC TGAATGCTAA 3120 

5 

ATCTAAAGAA ACTTCAGTAG AAGATGTTAT TGGCGCTTTG AGAAATTTAG ATTATGTATC 3180 

AAAAGTAGAA TTAATTAGTA TGAGTATGTA AGGATGTGCC TATAATGTAC GCGTATGTCA 3240 

AAGGTAAGTT AACACATTTA TATCCTACAC ACGTAGTTGT TGAAACTGCT GGTGTTGGTT 3300 

10 

ATGAAATTCA AACACCAAAT TCTTATCGTT TTCAAAAGCA TCTAGATCAT GAAGTTTTAA 3360 

TTCATACATC TTTAATTGTT CGTGAAGATG CACAATTATT GTATGGATTT AGTAGTGAAG 3420 

15 AAGAGAAAGA TATGTTCTTG AGTTTAATTA AAGTTACTGG TATTGGTCCG AAATCAGCTT 3480 

TAGCTATTTT AGCGACAAGT ACGCCTAATG AAGTAAAACG TGCCATTGAA AATGAAAATG 3540 

ATAOGTATTT AACTAAATTC CCAGGAATTG GTAAGAAAAC GGCAAGACAG ATTGTCTTAG 3600 

20 ATTTAAAAGG TAAAGTGAAA ATTACTGAAG AAGATAGCGA TTCATTATTA CAAGTAGACG 3660 

CTACTTCGAC GGTGCAAGAT CAATTCGTGC AAGAAGCAAT GTTAGCGTTA GAAGCATTAG 3720 

GTTATTCTAA ACGAGAGCTT GCAAAAGTTG AGAAAACGTT AAATAAAAAT AAATATGACT 3780 

25 CAGTTGATGA AG CTGTTAAG GCAGGTCTTC AATTAGTTGT ATCTTAATTT TAAATAGATT 3840 

AATAGGGGAA GTGTTGTCAT GAATGAGCGT ATGGTTGATC AATCAATGCA TAGTGAAGAA 3900 

ACTGATTTCG AATTGTCGCT TAGACCTACG AGATTACGAC AATATATTGG TCAAAATTCA 3960 

30 

ATAAAAAGTA ATTTAGAAGT ATTTATTAAA GCGGCTAAAC TTCGTCATGA ACCATTAGAT 4020 

CATGTATTGC TTTTTGGCCC CCCTGGATTA GGTAAGACAA CATTATCTAA TATCATTGCC 4 080 

AATGAAATGG AAGTTAATAT ACGTACAGTA TCAGGGCCTT CATTAGAAAG ACCTGGTGAT 4140 

35 

TTGGCTGCAA TTTTATCAGG ACTTCAACCT GGAGATGTTT TGTTTATTGA TGAAATACAC 4200 

AGACTGAGTA GTGTTGTTGA AGAAGTGTTA TACCCTGCAA TGGAAGATTT CTTTTTAGAT 42 60 

ATT AT CATTG GTAAAGGCGA TGAGGCTAGA AGTATCCGTA TCGACTTACC TCCATTCACT 4320 

40 

TTGGTAGGTG CAACAACGCG AGCTGGCAGC TTAACAGGTC CACTAAGGGA TCGATTTGGT 43 80 

GTGCACTTAA GATTAGAATA TTATAACGAA TCAGATTTAA AAGAAATCAT TATTAGAACA 4440 

45 GCTGAGGTTT TAGGCACAGG TATTGATGAA GAAAGTGCCA TTGAACTTGC TAAACGTTCT 4500 

AGAGGGACTC CAAGAGTAGC AAATCGACTA TTGAAGCGGG TAAGAGACTT CCAGCAAGTG 4560 

AATGAAGATG AACAAATATA CATTGAAACA ACGAAGCACG CATTAGGTTT ACTTCAAGTT 4620 

60 GATCAACACG GACTAGATTA CATTGATCAT AAAATGATGA ACTGTATTAT TAAGCAGTAT 4680 

AATGGCGGAC CTGTTGGTTT AGATACGATT GCCGTAACAA TTGGTGAAGA ACGTATTACA 4740 
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5 



10 



1$ 



30 



3S 



40 



GGCAGAAAAG 


CAACACCATT 


AGCTTATGAA 


CATTTTGCAA 


AGTCGAATGA 


GGAGAGAGAA 


4860 


TAACGTGAAT 


ATTGAAGAAT 


TTGACTATGA 


CTTACCAGAA 


TCATTAATTG 


CTCAAACGCC 


4920 


TTTAAAAGAT 


CGTGATCATA 


GTCGTTTATT 


AGTCATGGAT 


AGAGAAACTG 


GTGAAATGAA 


4980 


ACATTTACAT 


TTCAAAGATA 


TCATTGAGTA 


TTTTAGACCT 


GGTGATACAT 


TAGTGCTTAA 


S040 


CGATACGCGA 


GTAATGCCAG 


CTAGACTTTT 


TGGTTTAAAA 


GAAGAAACTG 


GTGCAAAAGT 


5100 


TGAAATGTTA 


ATGTtAACTC 


AAATTGAAGG 


TAATGATTGG 


GAAGTCTTAC 


TGAAACCAGC 


5160 


TAAGCGTATT 


AAAGTTGGTA 


ATAAATTGAA 


TTTTGGTAAT 


GGCAAAATTA 


TAGCTGAATG 


5220 


CATAAAAGAA 


ATGGATCAAG 


GTGGACGCAT 


CATGCGTTTA 


CATTATGAAG 


GTATTTTACA 


5280 


AOAAAGATTA 


GATGAATTAG 


GGGAAATGCC 


ACTGCCACCA 


TACATCAAAG 


AACGTTTAGA 


5340 


TGATCCAGAT 


CGTTATCAAA 


CAGTTTACGC 


TAAAGAAAGT 


GGTTCAGCGG 


CAGCACCAAC 


5400 


AGCAGGATTA 


CATTTTACTG 


ATGAGTTATT 


AATTGAAATT 


AAAAATAAAG 


GTOTTAATAT 


5460 


CGCATTTGTT 


ACATTACATG 


TTGGGTTAGG 


TACGTTTAGA 


CCGGTGAGCG 


TAGACGATGT 


5520 


GAATGACCAC 


GAAATGCATA 


GTGAATATTA 


TCAAATGACm 


CAAGAAACAG 


CTGATTTATT 


5580 


AAATGATACT 


AAGCCAAAGG 


ACATCGCATT 


ATATCAGTTG 


GTACAACTTC 


AACACGTACA 


5640 


CTTGAAACAA 


TTCGACGCGA 


TCATGATAAA 


TTTGTTGAAA 


CGAGTGGCTG 


GACTAATATA 


5700 


TTTATTTATC 


CAGGATTTGA 


TTTTAAAGCA 


ATTGATGGCC 


AGATTACTAA 


TTTTCATTTA 


5760 


CCAAAATCAA 


CATTAGTTAT 


GCTAGTATCA 


GCGTTTAGTA 


GTCGTGAAAA 


TGTTCTGAAT 


5820 


GCTTATAAAA 


CGGCAGTAAA 


TTTAGAATAT 


AGATTCTTTA 


GTTTTGGCGA 


TGCAATGTTA 


5880 


ATTATATAAA 


AAGAATGTGA 


GGATTTTGAA 


TATGCCTGCA 


GTAACATACG 


AACACATTAA 


5940 


AACTTGTAAA 


CAATCAGGTG 


CGCGTTTAGG 


TATCGTGCAC 


ACACCACACG 


GTTCATTTGA 


6000 


AACACCTATG 


TTTATGCCAG 


TTGGTACTAA 


AGCAACCGTT 


AAAACAATGA 


GTCCAGAAGA 


6060 


GTTAAGACAA 


ATTGAAGCAA 


AAATCATTTT 


GGGCAACACA 


TATCATTTGT 


GGTTACAACC 


6120 


CGGAAATGAT 


ATTATCAAAC 


ACGCTGGGGG 


ATTACATAAA 


TTCATGAATT 


GGGATGGTCC 


6180 


GATTCTTACA 


GATTCAGGCG 


GTTTCCAAGT 


GTTTAGTTT A 


AGTAATTTAC 


GTAAAATTAC 


6240 


AGAAGAAGGC 


GTGGAATTTA 


GACATCATAC 


TAATGGGTCT 


AAATTATTTT 


TGAGTCCTGA 


6300 


GAAATCAATG 


CAAATTCAAA 


ATGATTTAGG 


ATCTGATATT 


ATGATGGCAT 


TTGATGAATG 


6360 


TCCACCGATG 


CCTGCTGAAT 


ATGATTATGT 


AAAAAAATCT 


ATTGAACGTA 


CAACACGTTG 


6420 


GGCGAAAAGA 


TGTCTAGATG 


CACACCAAAG 


ACCTGAAGAT 


CAAGCATTGT 


TCGGCATTAT 


6480 


ACAAGGTGGC 


GAATATGAAG 


ATTTAAGAGA 


ACAAAGTGCA 


AAGGATTTAG 


TAGAATTAGA 


6540 
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AATGGTTGAA CATACAGAGC AGTTTATGCC TAAAGATAAA CCAAGATATT TAATGGGTGT 6660 

AGGATCTCCa GATGCGTTAA TCGAATGTAG TATTCGCGGC ATGGATATGT TTGATTGTGT 6720 

CTTACCGACA CGTATTGCCA GAAATGGTAC TTGTATGACA TCGCAAGGTC GTTTAGTTAT 67 80 

TAAAAATGCA AAATTTGCAG ATGATTTAAG ACCGTTAGAT GAGAATTGTG ACTGTTATAC 6840 

ATGTCAAAAC TATTCAAGAG CGTATATACG TCATTTAATC AAGGCAGAGG AAACTTTTGG 6900 

TATTCGTCTT ACTACTATTC ATAATTTACA TTTTCTGCTA AAATTAATGG AAGATATAAG 6960 

ACAAGCCATT CGAGAAGATC GTCTTTTAGA TTTCAAAGAA GAATTCTTCG AGCAATATGG 7020 

15 ATTAAATGTT GAGAACCCAA AAAACTTTTA AGCAAGAGGA GCGTATAAAA TGCAATTTTC 7080 

ATTACTAATA TATATAGTCG TAATTTTTGC GGTTATGTAT TTCTTGATGA TCAGACCACA 7140 

ACAAAAACTG TGCGAAACA 7159 
20 (2) INFORMATION FOR SEQ ID NO: 303: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 

30 

TCCATTTATA GAAATTTCTA AAGCAGAnAA TAAGATAGAA GATATCGGCC AAGGTGCAGA 60 

AATCATCAAA AGAACACAAG ACATTACTAG CAAACGATTA GCTATAACTC AAAACATTCA 120 

35 ATTTGATTTT GTAAAAGATA AAAAATATAA CAAAGATGCC CTAGTTGTTA AGATGCAAGG 180 

CTTCATTAGC TCTAGAaCAA CATATTCAGA CTTAAAAAAA TATCCATATA TTAAAAGAAT 240 

GATATGGCCA TTTCAATATA ATATCAGTTT GArAACGAAA GACTCTAATG TTGATTTAAT 3 00 

TAATTATCTT CCTAAAAATA rAATTGATTC AGCAGATGTT AGTCAGAAAT TAGGCTATAA 3 60 

TATCGGCGGA AACTTCCAAT CAGCGCCATC AATCGGAGGC AGTGGCTCAT TCAACTACTC 420 

TAAAACAATT AGTTATAATC AAAAAAACTA TGTTACTGAA GTAGAAAGTC AGAACTCTAA 4 80 

AGGTGTTAAA TGGGGAGTGA AAGCAAATTC ATTCGTTACA CCGAATGGTC AAGTATCTGC 540 

ATATGATCAA TACTTATTTG CACAAGACCC AACTGGTCCA GCAGCACGAG ACTATTTCGT 600 

CCCAGATAAT CAACTACCTC CTTTAATTCA AAGTGGCTTT AATCCATCAT TTATTACAAC 660 

ATTGTCACAC GAAAGAGGTA AAGGTGATAA AAGCGAGTTT GAAATcACTT ACGGCAGAAA 720 

CATGGATGCT ACATATGCTT ACGTGACAAG ACATCGTTTA GCCGTTGATA GAAAACATGA 780 
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AAAAATTAAA AGCATCACAC CTAAGTAAAC AGTTCaATCA TCTTAAAAAA TCCTGGGACA 900 

CTTCATACTT GTCTCAGGAT TTTTTmACAA ATTGAATCAG CCTCATAACA TTAAATTATT 960 

TTATCGTACA TTAAATTTAA TAATAACAAC TGATTTTTAT AAGAATAAAG TATCGAmCCA 1020 

TAGTAGATAC ACAAATAATA CAAATGAAAC AATTTAACTT GAAAGCTTAa ATAAATATTA 1080 

TCAAGTTAAT AAACAATTAA TTTTTAGATG GATTCATCAA AAATCGTAAA AAAGCACAAT 1140 

TTGTATTTTA CAAACATTAA TTAAAAAAGA AAGCAAGACA TTCGTGCAAT CGGTTACCTT 1200 

AAATTGTTTA CAACTGTCAA CAATACCAAG GTTTTATTAA CTATATTTCT CACAAAATTA 1260 

GCTTTTAGCA TTCCAAACAA AAAAGGTTAA ATCGAACGGA ATTATGGCAT TTTTAACTTA 1320 

ATTGTAAAAA AAGTTGATAA TGGTCAATTG TTAATGAACA GTTAATTATA ATAACGCCCA 1380 

AAATATATTA TTATTTAATT AAGTTAAATA AAATTATAGA AAGAAAGTGA AACTTATGCT 1440 

20 TAAAAATAAA ATATTAACTA CAACTTTATC TGTGAGCTTA CTTGCCCCTC TTGCCAATCC 1S00 

GTTATTAGAA AATGctAAAG CTGCTAACGA TACTGAAGAC ATCGGTAAAG GAAGCGATAT 1560 

AGAAATTATC AAAAGGACAG AAGATAAAAC AAGTAATAAA TGGGGCGTGA CTCAAAATAT 1620 

TCAATTTGAT TTTGTAAAGG ATAAAAAATA TAACAAAGAT GCTTTGATAT TAAAGATGCA 1680 

AGGATTCATT AGCTCTAGAA CAACATATTA CAACTATAAA AAAACTAATC ATGTTAAAGC 1740 

TATGCGATGG CCATTCCAAT aTaATATTGG TTTAAAAACA AATGATAAAT ATGTTTCTTT 1800 

AATTAATTAT TTACCTAAAA ATAAAATTGA ATCTACAAAC GTGAGTCAGA CATTAGGATA 1860 

CAATATCGGT GGTAATTTCC AATCAGCCCC ATCACTCGGT GGTAATGGAT CATTTAACTA 1920 

TTCTAAATCG ATTAGCTATA CACAACAAAA TTATGTAAGT GAAGTAGAAC AACAAAACTC 1980 

AAAAAGTGTT TTATGGGGCG TCAAAGCGAA TTCATTCGCC ACTGAATCAG GTCAAAAATC 2040 

AGCCTTTGAT AGCGATTTAT TTGTAGGCTA CAAACCTCAT AGTAAAGATC CTAGAGATTA 2100 

TTTCGTTCCA GACAGTGAGT TACCACCTCT TGTACAAAGT GGATTTAACC CTTCATTTAT 2160 

CGCCACAGTA TCTCATGAAA AAGGTTCAAG CGATACAAC^C GAATTTGAAA TTACTTACGG 2220 

AAGAAACATG GATGTCACTC ATGCCATTAA AAGATCAACG CATTATGGCA ACAGTTATTT 2280 

AGACGGACAT AGAGTCCATA ATGCATTTGT AAATAGAAAC TATACTGTGA AATACGAGGT 2340 

CAATTGGAAG ACTCATGAAA TCAAGGTGAA AGGACAGAAT TGATATGAAA ATGAATAAAT 2400 

TAGTCAAATC ATCCGTTGCT ACATCTATGG CATTATTATT ACTTTCTGGT ACTGCTAATG 2460 

CTGAAGGTAA AATAACACCA GTCAGCGTAA AAAAAGTCGA TGACAAAGTT ACTTTATACA 2520 

AAACAACAGC CACAGCAGAT TCTGATAAAT TTAAAATTTC ACAGATTTTA ACATTTAATT 2580 

55 



25 



30 



35 



45 



50 



1138 



EP0 786 519 A2 



10 



15 



ACTCAGGCTT TGTGriAACCT AATCCTAATG ACTATGACTT TTCAAAATTA TATTGGGGAG 2700 

CTAAATACAA TGTATCTATA AGCTCACAAT CTAATGATTC AGTAAACGTC GTTGATTATG 2760 

CACCAAAAAA TCAAAATGAA GAGTTTCAAG TTCAAAATAC TTTAGGCTAT ACATTTOGTG 2820 

GTGACATTAG TATCTCTAAT GGTTTATCTG GTGGACTTAA TGGAAATACA GCTTTTTCTG 28B0 

AAACAATTAA TTATAAACAA GAAAGTTACA GAACAACATT AAGTCGCAAC ACAAATTATA 2940 

AAAATGTTGG CTGGGGAGTT GAAGCACATA AAATTATGAA TAATGGTTGG GGACCTTATG 3000 

GAAGAGATAG CTTCCACCCA ACATATGOTA ATGAACTCTT CTTAGCTGGC AGACAAAGCA 3060 

GTGCATACGC TGGCCAAAAC TTCATAGCGC AACACCAAAT GCCATTATTA TCTAGAAGTA 3120 

ACTTCAATCC AGAATTTTTA AG CGT ACT AT CACACAGAC 3 IS 9 
(2) INFORMATION FOR SEQ ID NO: 304: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3821 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

25 

1 (*i) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

GCAAAAACTT TTCTCCAACT ATGTCAGACT ATAATGGCAT GAACAAAATT GATATGATGA 60 

30 

ATCAAATTAA AGTTGATACG ATGTTGCATG GTTATCACGC TGGATTTTTA TTTGCATTAT 120 

TGATTACAGT TGTTAGTTTC TTTTGTTCAT TTATGTTACA AGGTAAGAAA AAAGAAGTTG 180 

35 ATTCTCX3TCA ATAAATATAA GTTGCTAGCT ATATAAAGCT TTTTAGCAAA AGTTCAACAT 240 

ATTGACTTAT CCGGCATTTC AGATTAAAAT ATTTTTATTC CGATTAGAAT AATAAGAATA 300 

AGGAGATATA TTCTATGAAA AGACTTTTAT TTGTGATGAT AGCTTTCGTT TTCATATTGG 360 

40 CTGCATGCGG AAACAATTCG TCGAAAGACA AGGAAGCTAG TAAAGATAGC AAGACAATTA 420 

ATGTTGGGAC TGAGGGGACT TATGCACCAT TTAGTTTCCA CGATAAAGAT GGTAAATTAA 480 

CTGGTTACGA TATTGATGTT ATTAAAGCAG TGGCTAAAGA AGAAGGTTTA AAACTTAAAT 540 

TTAATGAAAC TTCTTGGGAT TCTATGTTTG CAGGTTTAGA CGCAGGGCGT TTTGATGTAA 600 

TCGCGAACCA AGTAGGTATT AATCCTGATA GAGAAAAGAA ATATAAATTT TCTAAGCCTT 660 

ACACATTCTC AAGTGCTGTT TTAGTTATTC GTGAAAATGA AAAAGATATT AAAGATTTTG 720 

ATGATGTTAA AGGTAAGAAG TTAGCACAAA CATTCACATC TAATTATGGT AAATTAGCTA 780 

AGGATAAAGG TGCTGATATT ACAAAAGTTG ATGGCTTTAA CCAATCAATG GATTTATTAT 840 
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AAAAACCTAA TGcTAAGATC AAAGCAATCA 
TTGCATTTTC TAAAAAAGCA GATGATGAAA 
5 AAATCGAGGA AAACGGTGAA TTAGCTAAAA 

CTAAATCTAA ATAGCGAACA ACAACATGCG 
ATGCTAGAAG GTTTGGTCAA ATATTCAATT 

10 

TTGATTATTG CATTATTTAC AGCATTAATG 
ATTTCACGTG TCTATGTATC TATTATTCGA 

TS ATATTTTATG GTATTCCAGA ATTAGGTAGA 

ACATTGGCAC CTGTAGTGGC TGCTATTATT 
TCGGAAATTA TTCGTGGCGG TATTATTTCT 

20 TcCaTCGGTA TGACGTATGG TCAAACGATA 

GTGTCGATTC CTGCACTAGG TAATACATTT 
GGATTTATTT TAGTGGCTGA AATGTTTAGA 

25 

GAATATTTAA CAATTTATGT GTTAGTTGCG 
TCAATTATCC AAGGTATCTA TGAATCTTAT 
TGAACAATAT CCATAAATCA TTTAATGATG 

30 

TTGAACAAGG TGAGGTTGTA ACCTTAATCG 
TACGTATGAT TAATGCATTA GAAATTCCAA 

3S CATATACATC TAAAGATAAA AAATCACAAA 

TTCAAAGTTA TAACCTTTTT CCGCATAAGA 
TCACAGTTAA AAAGTTGAAA AAGGATGAGG 

40 AAGTTGGTTT AACACATGTC AAAGATCAAC 

AACGTGTTGC TATTGTCAAG AGCACTAGCA 
CCAACATCTG CACTTGATCC TGAACTTGTG 

45 

GCTAATGAAG GCATGACAAT GGTCATTGTG 
TCTAATAACA TTGTATTTAT TCmTGAAGGC 
50 ATGTTCAATC GTCCGAAAAC AGAAGAATTA 

TAATCAAATA GAACCACGTA TCATGTTTTA 
TGGTTCTTTG TTATGTTGTC ATAATCTTGG 

55 



>0 786 519 A2 

AAGGTAATGC TGAACAAAGT AGATCTG CAT 960 

CAGTTCAAAA ATTCAATGAT GGCTTGAAAA 1020 

TAGGTAAGAA ATGGTTTGGT CAAGATGTTT 1080 

CTAGATGCTG CAAAACAAGC TTTCGGACCT 1140 

CCTATTACAT TAGTTACATT TGTTTTAGGA 1200 

CGAATTTCAA CGAGTAAAAT TTTAAGAAGT 1260 

GGAACACCAA TGATAGTACA ACTATTTATC 1320 

TTATTAACAA ATGACGCTGA CAACCAATGG 1380 

GGTTTATCAT TAAATGTAGG TGCGTATGCT 1440 

ATACCGAAAG GACAAACAGA AGCTGCaTAT 1500 

CAACGTATCA TTTTACCGCA GGCAATTCGA 1560 

TTAAGTTTAA TCAAAGATAC ATCATTATTA 1620 

AAAGCTCAAG AAGTTGCGTC TACAACATAT 1680 

CTAATGTACT GGGTGGTATG CTTTATTATT 1740 

ATTGAAAGAG GGTATCGCTC ATGATTCAAT 1800 

TTGAAGTCAT CAAAGGTATT GATTTATCTG 1860 

GTCGATCTGG TTCAGGTAAA ACAACATTGT 1920 

CTGAAGGTAC AGTTTATGTT AACGGCAAAA 1980 

TAGAAGTTCG TAAACAGTCT GGTATGGTAT 2040 

CGGCATTAGA AAATGTAATG GAAGGTCTTA 2100 

CACGTGGGAA ATCACTTGAG TTACTTGAGA 2160 

GTCCACATGC ATTATCAGGT GGTCAACAAC 2220 

ATGAACCCTA AAGTGATGTT GTTTGATGAA 2280 

AATGATGTTT TAAAGGTTAT TAAAGATTTG 2340 

ACACATGAAA TGCGTTTTGC TAAAGAAGTA 2400 

ATGATCGGAG AACAAGGGGC TCCAGAAGAG 2460 

AGACGTTTCT TAAATGTTAT AAATGAAGAA 2520 

GTATGGCGAT GAAGCCATAT ACATGATGCG 2580 

AGCGATATTT TAACGACGTT TATGATTTAA 2640 
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10 



15 



20 



25 



30 



35 



TTCTACATGT 


GCGTTAAAAC 


crrrrrTGAA 


TTGTTGGACG 


CCATAGTCTT 


CTGATGACTC 


2760 


TGAAAAGTCA 


CCGGTAATAC 


CATAAAAATT 


ATAGCGATCA 


ATATGATGCG 


CTTTAGCAAA 


2620 


CTTAATCATT 


TCCCaCTGCA 


AATGGTAGGC 


ACCCATATAA 


GCATTATATT 


TAGGGTTTGA 


2880 


ACCACTAOAT 


AAGTAATAAA 


CTTCATGCTC 


ATTGTAGATA 


AATAAAGCAG 


AAGCTAAGTT 


2940 


TAAGACTGCA 


CCATCTTGTT 


CAATTTGTTC 


TATTGTATTG 


TCGATTTTAC 


GCTTATTGCT 


3000 


ATTTAGCTGT 


TGTTCTAGCT 


GTGTGCGTTT 


CGTTTTATTT 


TTCTTTGAAT 


TAGGACTTTC 


3060 


TTCCAATGCT 


TCTTCAACAC 


CTGAGAGTTC 


AGCTGTTAAT 


TGTTGTTGCT 


TTAATTGTAA 


3120 


CGTTTTTAAA 


TACTCGTTTA 


AATCAATATA 


CGCCAACTTT 


AACATGGCGT 


GGTCATCGTA 


3180 


TAACTTTTGC 


ATTTCTTCAA 


AGTATGGTAA 


CTCACGGAAT 


TTGAAACCGT 


GCTTTTCCTC 


3240 


AGCCATATGG 


AATAAGTCGA 


AAAAAGTTTG 


CGTTTCATCA 


ATCGTTAAcG 


TTTTaGTTTT 


3300 


GACACCAAtA 


TCaTATGTtT 


tTtTAATATT 


ACGTCTCGtT 


TGATAATCCA 


TTTCTTTTAA 


3360 


AAGTTGGTCT 


TCAGTCTTAT 


CTTTTAAATC 


TAACACTGAC 


AGCCAACGGA 


TTTGGCTCAT 


3420 


TGAATCATAA 


CCTACAGGGA 


AACCTTGGTG 


TTTATAACCT 


AATTTATCCA 


TTGTTCTaAC 


3480 


AAATGCTCGG 


TTATCATAAG 


ATTTAACAAT 


TTCACCGTCT 




yi t^n^M^M^n^^ ft ft 
AHl L L L Lnnl 




TAAATATGGA 


TCTACAAGGA 


CATATAAACA 


ATTGTGTTTC 


TTTAAATATG 


ACGTTAATGC 


3600 


TTTAAAGAAA 


AATGCTACTA 


ATGATTGATT 


TCTATAATCC 


ATCACTGGCC 


CGCGATGTGT 


3660 


ATAAAAATAT 


TTGAAAAATT 


TAAGTGTGCG 


TGCTTCTGTC 


AATAAGCATC 


CTGCAATCAC 


3720 


TTGACCATTG 


TCATCTTTAA 


CCCCTACrAG 


ATGCACATCG 


CCTTTTAAAT 


CAACTCTATG 


3780 


ATTGTAATGA 


ATAGCTGATT 


GTGTGTAATG 


TGAAAAATGC 


T 




3821 



<2) INFORMATION FOR SEQ ID NO: 305: 

T (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1422 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 
GCGACACCAT TGTTTG CAGC AACTGTTAAT TTACCAGTTG ATGGTTTCGA CGGTGTTGTT 60 
so GGTTTTGATG GTGTAGTAGG GGTAGTTGTA GATTGCGTAC CCCATGGCGC CACTTTACCC 120 

ATTTTTATTA AATATTTTTC ATTAATTAAG TCATATAATT GATCATAACT ATAATTATGA 180 
CTTCTTAAAT ATCCATGTGG ATcGGCATGG TCAGTACCAC CTAAATATTT ACTTACAGCG 240 
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5 



15 



20 



25 



TATTGTAATT 


GTGTAGCTGC 


ATAGTCAGCA 


TAGTTATTCA 


TTGAACGTGC AAATGAAGCA 


360 


TAGTCGTGTG 


TGTGTACGAT 


TTCAACATTG 


ATGAATCTAG 


GGTTACCGAC 


TGCACCGACA 


420 


CCCCAAGATA 


AGTAATCCGT 


TGGTGCTGTT 


TOGATTATAC 


GATCCCCATC 


AACAAATGCA 


480 


TGTACGAATG 


CGTTTTGATA 


GTT ATTTTTC 


ATATAACTAA 


TTTCACCATT 


TATCGTCGAA 


S40 


CGATCATTAG 


CTGTATCATG 


AACTACGATA 


CCTTCAGGAC 


GACCTACGCC 


GTTACGGTAT 


600 


GCGTATTTAG 


GGAAGTAAGA 


TGTATAATCT 


TCTTCAATTT 


TAGGTGCTTT 


TAAGTTATTT 


660 


TTACAAATGT 


AATCGTTAAT 


TGAAGAGTTT 


ACTTGTGGTT 


TATATTTTGG 


CAAACTCGTT 


720 


TTTGGTGTTG 


CAGCAACTGA 


TCTTGGTTGT 


GCTGAAGCGC 


TAAAAGTAGT 


TACTTTAGGT 


780 


OTCGCTTCAG 


TTTTAGCTTT 


AGGTGCTGAT 


GTAGTTGCAG 


CTTTAGGTGC 


TGCGGTTTTA 


840 


TATTGCGTyT 


CAAGAGCTGC 


AGGTTTAGCA 


GCTGATTTAA 


X ±£^f% X A- W 




900 


TTTTCTGAAT 


TATCATCTTC 


ATCATCAACT 


AAACTATAAC 




AAGJVTTAGTG 


960 


TTAGTTTTAG 


GTGCTGTAGT 


GCTTGTTGAC 


TTTGCAACAG 




^\ X X X V3 X rlXJ X V— 


1020 


GCTGATTGAT 


TAGCACGAGT 


GTCACCATTT 


ACTTGTGCAG 


TATCAACTTT 


TTGACTTACT 


1080 


TGAGCATTGC 


CTGTTTTGTT 


ATTTGCTGTT 


TTTGGTTGGA 


CAATAGCAGG 


GTCTTGATAT 


1140 


ACTTGAGTGC 


CAGAAATGTT 


TTGCGTTGGA 


TTTTTTACCT 


CAGCTTTTGC 


TTGTTCAGTA 


1200 


GTTGCTTTAA 


CTTTATTACT 


ATCTAAAACG 


TTTTTATTAG 


TAGTTTGATC 


TTGTGTCGTC 


1260 


TCAGCTGCTT 


GAACTTGATG 


TGCAGTGACT 


GCTGAACCTA 


CAAGCGTTAA 


TGCAACCATT 


1320 


GAgGTAGTTT 


GTAATTGAAT 


TTTTTCGCCA 


TTCTATTTAT 


TACTCCyAAC ATTTATTAAT 


1380 


TAT tACT AAC 


ATTATAGTAC 


CTGTnTTATA 


TACCTGTGCG 


TA 




1422 



35 

(2) INFORMATION FOR SEQ ID NO: 306: 

r (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6076 base pairs 
40 ' <B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

ACGCGTTGGT TATTTCAATC ATAATATTAC TCTGCAAATA CACAGCCCTG TAACCGAGTA 60 

AATGGATAGA GATTTGAACA AATGAAAACA ATCAACTAAT GGAAAGGATA AAATATTATG 120 

50 

CACAACAATA ATGAATTCAA CAAAAAACTG AAAGATTTTA TAGGTAGCGA TAAACGGATG 180 

GCTTTAGTAA AGGGTTATGT TAACGAGTAT AAATTAACAA CTGTTTTAAG AGCTTTAAAT 240 

55 



1142 



EP0 786 519 A2 



GTAGATAATC GAATTATTCC TAAAAATATA ACTCAAAACA CAATCTTCAA ATTGAOTAAT 360 

TTAACCTTAG AGGTTAACCT CTATGAAAGA AACACAATTT ATCACGGAGA CTTTGCTATT 4 20 

5 

TACTATCCTG TAGAATCTGC TCTAATAATA GAAACAGATA CTCCTAGGCT AATTAATCAT 4 80 

ATAAATAACA ACCCAGTCCA AAAAATATTT ATAATAACAA CAAATGACTG GAGCTTTAAT 540 

ACACTTGaAT TAGAAAAAAT AGTAGACGAA ACTATAATTT ATGATTTAAA ACAAGAAGAT 600 

10 

CCTAAACAAT ATGAAATTTT ATATAAAAAT AAACACGGAA AATTGCCTTA TTGATGAATT 660 

AAAACGCAAA CTTAGAACAC CTTACCCCTT TTCCGTCCTT ATTGTAATTT TCCAACTACT 720 

15 CAAATTAAAA ACCACCCGTG CAAACAGGTG GTTTTATTAT ATATAATTAT TCCCACTCAA 780 

TTGTGCTTGG TGGTTTTGAT GTAATGTCAT AGACTACGCG GTTGACQTGA TCTACTTCGT 840 

TTACGATAOG ACTAGAAATC TTTTGTAAGA CTTCCCAATC GATGCGTGCG AAGTCACTTG 900 

20 TCATACCGTC GATAGATGTT ACTGCACGAA TACCTACTGT GTGATCATAC GTACGGTAGT 960 

CTCCCATAAC ACCTACTGAT TGAATGTTTG GTAACACTGT GAAGTATTGC CAAATTTCTC 1020 

TTTCAAGACC TTCTTCTCTA ATCACTTGGC GTAAAATCGC GTCTGATTCT CTAACGATTT 1080 

25 

CTAGTTTATC TTCAGTAATT TCTCCAAGTA CACGAATACC AAGACCAGGT CCTGGGAATG 1140 

GTTGTCTCCA TACTAAATGT TCTGGAATAC CTAACTCAAT ACCTAATTTA CGTACTTCAT 1200 

CTTTAAACAA TGTATTGATT GGTTCGATTA ATTCGAATTC CATGTCTTCT GGTAATCCAC 1260 

30 

CAACATTGTG GTGTGATTTG ATTGTTTGTG CTGTTTTAGT ACCTGATTCG ATGACGTCTG 1320 

TATATAGTGT TCCTTGCGCA AGGAAGTCTA CACCTTTCAG TTTTGATGCT TCATCATCAA 1380 

35 ATACGTATAC AAATTCATTA CCAATGATTT TACGTTTTTG TTCAGGATCT GAAACACCTT 1440 

TTAATTTATT CATAAAGCGA TCTTTCGCAT TAACACGAAT AATATTCATG TTGAAACCTT 1500 

CACCGAATTG CTCCATAACC ATGTCGCCTT CACCTTTACG AAGtAAGCCA TGGTCTACAA 1560 

40 AGATACATGT TAGTTGATCA CCTATTGCTT TATGCAATAG TACAGCTACA ACAGATGAAT 1620 

CTACGCCGCC ACTCATCGCA CATAATACAC GACGGTCTCC TACGCGTTGA CGAATCTTTT 1680 

CAATTTCGAT TTCGATAAAG TTTTCCATTG TCCATTGACC TCTACAATCA CAAACACGAC 1740 

45 GGACAAAATT ATTTAATAAA TCATTACCAT ATTCTGTATG ACGTACTTCT GGATGGAATT 1800 

GAACACCATA AATGCGACGT TTCTTATCTT CGATTGCTGC ATAGTCTGTG CTTGGGCTAT 1860 

CAGCGATAAC TTCAAAGCCT TCTGGAATTT CAATAACTTT ATCAGAATGA CTCATCCAAA 1920 

60 

CAGTTTGTTC TGCTGGTAAG CCAGCGAATA ACTCATCTGA CTTCGCATTA ATGATTGCTT 1980 

TACCGTATTC ACGTTCATTG GCACGTTCAA CTTTACCACC TAATAATTTA GTAGTTAATT 204 0 

55 



1143 



EP 

TAAATGAACC TTCTTCATAA ACTGAATTTG 

TCATTTTCTT AATTTCTTCA ATTGAAATTT 

5 

TTTCACGAAT TCGGCGTGTA ATTAATTGGT 

TTAACTCTTG TTCTTTTOCC ATTTCCATAT 

AGTAGTTCGG TGATTCTTTC GTAATTTGAA 

10 

CAGGACCCAT ACGTGTAAAT TGTGCTTCTT 
TATAACCCAT ACCAGCACGC ACACCGCCCA 

1S CTTTATAAGC CGTACGTCCT TCGATACCTT 

CTTGGAAGTA ACGGTCGTTT GAACCTTTTT 
ATACTTTATA TTGTCTACCT TGGAAAATTT 

20 CTAATAAGCT ACCTAACATA ACCGCATGTC 

CTGAGAATTT AATACCACCA TCAGCAATGA 
CACAATCATA AATTGCTGTA ATTTGTGGTA 

25 AAATTGAACC TGGGCCAATA CCAACTTTAA 

TTGTTGCTTC TGCAGTTGCT ACGTTACCTG 
TAATATGTTT CACTTGATCG ATAACACCTT 

30 

AGACATCCAC ACCTGCTTCG ACTAATTTTT 
CAATTGCTGC GGCTACAAGT AGACGACCAT 
TAACTTTTTC AATATCTTTA ATAGTAATAA 

35 

GTAACTTTTC AATCTTATGT TTTTGGAGAA 
CTGrGAGCTGT AATTAAATTt TCTTGCGTCA 

40 CAATAAAACG TAAGTCACGG TTTGTTAAAA 

CAACAATTGG TACACCTGAA ATACGGTATT 
TTTCTTCTGG CGTTAAGAAA AATGGGTTTG 

45 TTTGAACTTC GTCCGCTTGT TCTTCAACGC 

CTTGACGAGC CATAGCAATC GCCATTTTAG 
TAACTGGAAT ATTTAATTTA ACTTTGTCTG 

SO 

GTAAAATATC AGATTGTGCT GGAATTAATA 
CAAATTTACT TTCCCACATT AAAAACAGCC 

55 
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GACCACCTGA 


TAAGATAATA 


CCTTTTGGAT 


2160 


CATGATCGTG 


TAATTCACTA 


TAAACGCCCA 


2220 


TGTATTGGCT 


ACCAAAGTCT 


AAGACAAGGA 


2280 


TTGTCGTTCT 


CCTTTATCTT 


AATTAGAATG 


2340 


TATTATGTGG 


ATGGCTTTCT 


GCTAAACCAG 


2400 


CGCGTAATTC 


TCTTAAATCG 


TGTGAACCAG 


2460 


TTAATTGGTA 


AATTGTATCT 


TGTAACGCAC 


2520 


CAGGAACAAA 


TTTCTTAGGC 


GCTTTGTCCT 


2580 


CCATCGCACC 


TAAAGAGCCC 


ATACCACGGT 


2640 


CTGTTGCGCC 


TGGGCTTTCT 


TCAGTACCTG 


2700 


CACCAGCAGC 


TAATGCTTTA 


ATGATATCTC 


2760 


TAGCTTTACC 


ATGTTTGCGT 


GCTTCAGTtG 


2820 


CACCAACACC 


TGCTACAACA 


CGCGTCGTAC 


2880 


CAATATCTGC 


ACCCGCTTCA 


AATAAATCTT 


2940 


CTACTAATGT 


GATTTCTGGG 


TAAGTCTTCT 


3000 


TAGAGTGACC 


ATGTGCTGTA 


TCGATAACTA 


3060 


GAGCACGAAT 


ATCAGTATCT 


TTTGAAATAC 


3120 


GTTCATCTTT 


TGCTGCATTA 


GGGAATTCGA 


3180 


GACCTTCTAG 


ACGTCOGTCT 


TTAACTAATG 


3240 


TTTTTTCTGC 


TTCTTCAAGT 


GTTGTATTCA 


3300 


TTACATCTAC 


AATTTTAATC 


GAGAAGTCTT 


3360 


TACCTACTAA 


GTTGCGATCT 


TCTTTATTAT 


3420 


TACCCATTAA 


TGCTTCTGCT 


TCATAAACGC 


3480 


AAATGACACC 


ATTTTCTGAG 


CGTTTTACTT 


3540 


CCATATTTTT 


ATGAATAACA 


CCTAAACCAC 


3600 


ATTCAGTTAC 


AGTATCCATA 


CCAGCAGAAA 


3660 


ATAATTGTAC 


GCTTAAATCA ACGTCTTTCG 


3720 


ACACATCATC 


AAACGTTAAT 


GATTCTTTTG 


3780 


TCCATTTTTC 


AAATTAATTA 


GTTATATTAT 


3840 
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GCAGAGATTG CGCCTAAAAC AATTCCGTTT TGAGTCAACC ATGCAAATTG TTCACCTAAA 3 960 

CCTTTAAATG CTTGTGGTAC AGCGCTTATA CCAGTACCTA At CCTACTGA TACAGCGATA 4020 

5 

ATTAATAAAT TGTTTTGATT TTTAAAATCG ATATGTCCTA ATATACTAAC ACCATATGCC 4080 

ATTACCATGC CAAACATAGC TATCATCGCA CCGCCTAACA CAGGTAGCGG TATGATATTT 4140 

GCTAATGCGC CAAGCTTAGG TATACAACCA CATATAAGTA ATAACACGAC CATGCCGTAT 4200 

10 

ATAACATTGT TTTTCTTAGC GCCGGATAAA GAAACAAGTC CTACATTTTG CGAATAGGCT 4260 

GTATACGGAA ATGAATTGAA TATAGAACCT AAcACTATCG CTAGACCTTC CGCAGTATAA 4320 

r5 CCTTTACGAA AATCTTTTCT TTCTAACTTC TTACCGGTAA TTTCACTTAA CG CATGATAG 43 80 

ACACCTGTCG ACTCAATTAA ACTAACGATA GCTACAATAA AGAACACTAA OGTCGATGTC 4440 

ACATCAAAGC TAAATCCAGA GAATCTAAAC GGCACTGGGA TGCCTAACCA ACCGGCATGA 4500 

20 TTGACTTGAT TAATATCGAC CATCCCAAGT AAGCCAGCAC CTATCGTTCC TAAAACGAGT 4560 

CCAATTAATA TGGCAATACT CTTAATAAAT CCAGTTGTGA ATCTTTGTAA AAGAAGAATA 4620 

ATGATTAATG TCATTAAACC TAACAAAATG TTCTTAACAT CTCCATAGTC CTTTGCACCT 4680 

25 

TGACCTCCAG CTAAGTAATT CATTGCTACT GGCATTAAAT TGATACCAAT GATAGTAACA 4740 

ACACTXCCCG TTACTACTGG TGGGAAGAAT TTTACAAGAT GTGAAAAGAA AGGCGCGATG 4 800 

ATAATAACTA ATATCCCTGA TAAAAATAGC GAACCATAAA GTACATCTAT TCCTTTCGTT 4 860 

30 

TGACCAATTA AAATCATGGG CG CAACAG CC GTGAATGTAC ATC CAAGAAC GATTGGTAAT 4920 

CCTGTTCCTG TTACTTTATT GGCTTGTAAA AATGTGGCAA CCCCACACAT AAATATATCT 4980 

ACTGTAACTA AGTAAGCGAT TTGTTCAGGT GTAAACTTCA AACTTGTACC AACAATGATT 5040 

35 

GGAACTAAGA TAGCACCTGC GTACATAGCT AAAAGATGTT GAACACTTAG GATTAAATTT 5100 

TTCATTATTC TTCTCCCACC AATGTCACTT TGTTTCCTTC TAGTGAAGCA ACCTTGCAGA 5160 

AO GAGAAGAAAC TGTTAAACCT GCTTCTTCTA AACGTTGATG CCCATTTTGG AAACTCTTTT 5220 

CAACAACAAT ACCAATACCA GCTGTCTTAG CATTCGCTTG CTGTGCGATA TCGTATAATC 5280 

CTAATGAAGC ATCACCATTT GCTAAAAAGT CATCGATGAT AAGTACAGTA TCTTCTTCTG 5340 

45 ATAAAAACTC TTTTGAAACA ATGACCGTAC TTGTTTTATT TTTAGTAAAT GAATGAATAG 5400 

ATGTTTCATA ATAACCATCC GTCAAAGTGC TAGGTTTTGC TTTTTTCGCA AATAAACATG 5460 

GCACATCAAA ATGCAGTGCA GCCATGATTG CAGGTGCGAT ACCGGAAGCT TCAATGGTTA 5520 

SO 

AGATTTTAGT AATCCCTTTA TCTTTAAATT GCTCGTAAAA AGTGCGACCA ACTTCATTCA 5580 

TTAACTTTGC ATCAATTTGA TGATTTAAAA ATCCATCGAC TTTTAAAATC TTCTCATCAA 5640 

65 
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10 



ATTTGTGTGA AACATTTTGC TCTTAAATTG GTGCTAGATA CAAAAAAATC CCCAAACTAA 5760 

ATAATAGTTT CAGGGTTTAT GAGTGAACGa ACATGCATAA CGAATTTGTC ATGCAATCAA 5820 

TGTAAAAGAA GTTTCATCAA ATAAACTGTG ACCATCATAT AAAATGATAT AAATCACCCA 5880 

CCATGGTTAC AATTTAATGG CTGAAGCTAC TCCTAGTATT GTGTTGTTAC TCATAGTCAT 5940 

GTCGTTCAAG GCAACATGGT AGAAACTTCT AAAGCCATAT TCTTTAGATT ATATGAGTTT 6000 

ATGTAAATTA TTTAACGATA ATAGCAAATT TTCGGCATTT TTTCAATAAC TGCTTAGGTA 6060 

ATCTTTTAAT AGTTTT 6076 
T5 (2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
*0 (D) TOPOLOGY: linear 



2S 



30 



35 



45 



SO 



ss 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 

ACCGCCGATT GATAGCTTTA CTGCTGCGAT AAAACTGCCT TCAACAATTG GTGCATCAAC 60 

TTTTAACACA CGATGATTAC CATCATACAT TTCAATTGCC ATATCTACAT TCATTTCTGA 120 

AGATCCAATA TCGTAAAAAC ACAATGCATC ATCCTCTAAT TTAGTCAAAA . CTTCTTGGAT 180 

GATATCAAAT GAAGTTCCAA TTGAACCATC TGGTAATCCC CCGATTGGTA TAATATCAAC 240 

GTCACCTGCC ATTTGCTTTA ACAAAGATTT TGTACCACTT GcAATTTCTT TACTGTGGCT 300 

AACAAGTATA ATTTTAGGCA TTATCTTCAT CTCCAATCAA AGCGTTTAAA ATATAAACCA 360 

TACTTTGAGC ACCTGGATCA ATATAACCTT TTGATTCTTC TCCAAAATAT GCAGCTCTAC 420 

CTTTCGTTGC TACCATATCT TTTGTATTAT CTGCTAATTG CTGTAAATCA TTGAATGTTA 4 80 

AAGTTTCACC ATTTTTAAGC TTCTCTGCTG CTCGCGCTAC TACATCATAC ATTGTCTTTT 540 

CATTTAAAGT AACTTTACCA CGTGATTCAA CCGCTTCGGC AAATGCCTGA ATTAGTGTAA 600 

TGAAATCTTG ATTATCCATA TCATCTTTGG TGACTGCAGA CATTTTAACA AAGCTAAAGC 660 

CATACAGTGG TCCTGATGCA CCCCCAACAT TTGACATCAA TGCCATACCA GTTGATTTGA 720 

ACAATGATTG CATTGAGCTA TCATCAAGTT TGTCTTTAAG ACTACTAAAC CCACGAACCA 780 

TGTTAACCCC GTGGTCACCA TCACCAATTG CTCGATCTAA TTCAGTTAAT TCAGATTCAT 840 

GTTTTTTAAA CGTTTCTTCT AAATTTAATA AACGTGCTTT CATATCATTC ACTTTCATTT 900 

GTGCCACCTC ATAAATGTAT ATTTATTCAT ATTCACTTCT TATTTAAAGT ATTGACTTGT 960 
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w 



15 



ACCTTGCATG TCTAAAGATG TCATATAATC ACCAACAAAC CATTTAGCAA CATTAACCGT 1080 

TCTTGCAGCT AAATTTTGTT GAATATATTT AGTTACGATA TTTAATTCAG ATAACGGCGT 1140 

ACCACCCATA CCATTTACCA TTAATATGAC ATCATTGGCA GTAACTTCTT TATACAATTC 1200 

GTCTAACAAT GTTCCAACAA TATGATCAAT ATCCTTTACT TCTTCCCTAT GAATACCTTT 1260 

TTCACCATGT ATACCAATAC CGATTTCCAT TTTGTCGTCT TCAATATCAA AGCCATATTT 1320 

TCCAGTAGTT GGAACAAGCG GTGGCTCAAT TGCCATACCA ATACTTTTAA TTTCAGGTAA 1380 

CAACGCTTCT ACACGCGATT TTATCTCTGT TAATGAATAA CCTTTTTCAG CAAGATAACC 1440 

GGCAAGCTTA TGAACAAACA CTGTTCCTGC AACACCACGA CGTTGTACTT CGTTTGTCAC 1500 

AGCAATGTCG TCACGAACAA TAACAGTTTG AACATTTATA CCTTCCATTT CTGCAAGCTC 1560 

TTGTGCCATT TCGAAATTCA TCACGTCACC TGCATAGTTT TTTACAACTA GTAATACACC 1620 

20 ATCACCAGTA TCTACTGCTT TAATAGCTTC TAATATTTTA TCAGOTGTAG GTGATGTAAA 1680 

TACTTCGCCA CAAACCGCTG CATCTAGCAT ACCTTCTGCA ACAAAACCGG CATGCGCAGG 1740 

TTCATGTCCG CTTCCACCTC CAGAGACTAT TGCTACACCA TGTTCTTTCT TAGCTTTTTT 1800 

TACAATAACT GTATTAGCAA TCAGATCTAA CTCTGGGTGC GCAATTAACA ATCCTTCAAG 1860 

CATATCAGTT AAAAATGTTT CTTTTTTATT GATTAACTTT TTCATCATGT TGTACCTCCT 1920 

TGGTATTATC AGTCATAGTA TAAAACAGAA TAAATGAATG CGCTATCATA AAGAATTAAC 1980 

CTGATACCGT TATCAAATAT ATTTTTGATA AGATCCTCTT GATAACTTAA TGGTTCATTA 2040 

TTGAAAAAAT AAATAGTCTT GAGTGGCTCA TTAAATAAAA AATGCTATGA AAAGCCTTAT 2100 

AATAAAGTGC CTTTCATAGC aATAAGTTGT GTCCATTGAC ACTATACATT TTTCGTTTTG 2160 

TACATTAAAT ATAAGAAATA CGGTGCACCA ATAATTGCTA CGACAATACC TGCTGGAACC 2220 

CCACCTGGTT GTAATACAAT TTTGCCAATT GTATCAGCTA TAACAAGTAA ACATGCCCCT 2280 

40 ACTAAAATGG CAATTGGTAA AAACAACTGG TGACGTGGTC CAACGATACG TTTGGCAATA 2340 

TGCGGACCCA TTAATCCGAT AAACGAAATT GAACCTGCTA cTGCTACAGC AGCAGATGAT 2400 

AACATCACTG CGATAAAGAA TAATATTAAA CGTTCTCTGC TTAACCTTAC ACCTAGACCT 2460 

45 CGTGCAATAT TATCACCCGT ATGAATAATA TTTAGTGTAT TCGATTTAAA TAGTAAGTAA 2520 

GGAATAATAA TCAACACCCA CGGTAAAAAT GCAATGACAA ATGGCCATTC GTCACCCCAA 2580 

ATATTACCTG CAAACCAAGC AGCGATGAAA TCAGATTGCT TATCATCAAA TTTTGACATA 2640 

ATTGTAATTG AGCCACCATA TAATGCTGTT TGTAAACCTA CACCTATTAA TACCATACTC 2700 

GCAGGTGTAA CACCTTCATT TTTATTAAAA CTGaAAATAA AAATAATCaA TGCAGTGgTG 2760 



25 



30 



35 



SO 



55 
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CCAATTGCAA TAAATAATGC GATTGCAAAT 
GTTCAGCTAT TGGATTTTTT GTGACACTTT 

5 

CCAGCCAAAA TAGTAATCAT CATCCGAGGT 
ATGGCATCAC CTTGTCCAAT TAAAGTTTTG 
CCTGAGGTAA TACTCCAAGT ACAACCTAGA 

10 

ATCAATTGTC TGCGTTTATT ATTTGAACTA 
AATATAAAAA GTAAGGAACA CCGATAAATG 

15 AATATCGTGC TATCACATCG GCAACAAGCA 

GTAGAATTTT AGCATAATCA GTTCCAATTA 
CTACAAATGC AACTTGTCCA GCGATAGCAA 

20 TTAAACATAT GCCTCTGATC ATTGTTACAT 

CAAGATTTAA AATGGTAAGT TGTTTACTAA 
TTAATGGAAT TGCCCACTTA AGGTGTGACC 

25 AAAATGTTAC TGTTTGATTT AGTCTAAAAG 

ACATAGCACT TACTGCTGCA CCCGCTAAAA 
TAGATCGGCC TATCATTAAT ACAATAGCAC 

30 

ACATCAATAT TAAAAATGAA GTGTTTGGTA 
AAGCACCTGA ATTTAAACCT ATGAGCGCCG 

35 GTATAATCGC ACCAGAAACT GCAAGCGCCA 

GAATCCTAAT CTCATTGATG ATGTTTTGCT 
CCTCTATAAT TGTAGAOGCT TGAATTTTGG 

40 TGATTAGTAG TATCATACTT AAACCTATAA 

TCTCTCTATT TGTCATGTTA ATTGTCCTTT 
AGCTCTGCGA CATAAGTCAT AAGTTACCAG 

45 TAAAACAACA TCAATATTAA AAACTTTTTC 

TGAACCTGTA GCGATGATAT CCCCTTCTTT 
CGCTTGGTTG ATATCATGAA GAACCATGAC 

SO 

CTGAACTAAT TCTAGTATTT CTAATTGATG 
TAAAAAGATA ATATCAGTTC TTTGTGCTAA 

55 



CCGCCACtGC GTTAATACCT AATATACCTG 2880 

GCACTATTGC ACCACTAATA CTAAgCtGCG 2940 

AACCTGAAAT CTAATAAGAT TAACTCATCA 3000 

AAAAATCTTT CAACAGGTAT GTTGTATTCA 3060 

AATAGTAGAA TGCTAAAAAC AGCCAGTGCT 3120 

ATCATATTGA GCGTCCTCCT TTTTTAACTA 31 BO 

AAATGATTGC ACCAACAGGC GCTTCTCCTA 3240 

CGAGTATCCC ACCTAACAAG GCTGTTAATG 3300 

AAAATCTTGC TATATGAGGT ACCATCAAAC 3360 

CTGCAATACC TGCTAGAATC ATAGCAATAA 34 20 

TTTGACCTAA ACCTTTAGCT AATGATTCAC 3480 

TTGTTAATAT AATGAATAAC GCAATACCAA 3540 

ATGTTGTGCC TGAAACGCCT CCAGCAGTCC 3 600 

CTAATGCAAT ACCTTGACTT AGCGCTGTTA 3660 

TAATACGCAT CGGATTAAAT CCATCACGTC 3720 

CTCCTAGAAT AGCAC CTAAA AATCCAGCAA 3780 

AAACTGCATA TGTTAATGCT AAAGCAAATG 3 84 0 

GATCAGCAAG ACCATTACGA GTAACACCTT 3900 

TACCTACAAT TACTGCTGCT ATATTTCTGG 3 960 

GTTGATTGCT AGGATTATAA TTAAAAATAG 4020 

CGTCACCTAT TAATGTAGAA ATAAATAGTG 4080 

TATAGGATAA AAACTTCAAT GGCGTTGGGT 4140 

TTATCATATT AACTTACTTA ATTAAGAATA 4200 

TAAAGGTTTT CCAGTTTTAG GATCTTTACT 4260 

TAATATTTCC TGTGTTAATA CGTCTTCTGT 4320 

CATCGCAATA AGATGATCTG AGAAACGAAT 43 80 

AATTGTACAA CCTTGTTCCT GATTTAGCTT 4440 

ACAGATATCT AAATATGTTG TTGGTTCGTC 4500 

TGCCATTGCA ATCCAAACAC GTTGTCTTTG 4560 
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TGCCCAATCA ATtTCTTTCT TAtCCTCAGC AgTTAATCTA CCAAATCCTT TTTGATGTGO 4680 

AAAACGACCA TATGAAACTA ATTCCCGAAC AGTTAAGCCA TCTGCTACTT CaGGTGaTTG 4740 

5 

aGGTAAAaTG GCtATTTTTt TGcAATCyCy TTCGTAGATt GTGtATGAAT ATTTTCACCA 4 800 

TCTAAAAATA CTTCGCCTTC TTTAACTGCC AATAAACGTG ACAATGCCTT TAG CAAAGT A 4 860 

GATTTCCCGC AGCCGTTAGG ACCAATGATT GACGTCACTT TGCCATCTGG TATTTCAACA 4 920 

10 

TCTAATTTAT TTATAATCGT GTTATCCCCG TAACCAATTT TAACTTGTTG TCCATGCAAA 4 980 

CGATTCATAA TTTCCCTACT TTCAATAAAA TTCTTTCTGT TTATAAAAAA TAATTTCTAT 5040 

y5 TTTTAAATTA TCAATTTTCA AAGACATCCC AATTGATAAT GATTATCATG AACATCATTA 5100 

TAACATTTTT CAATCTTATT GACTAACATT ACTTTTTAAA TTGGATAGCT CGATTTGTCA 5160 

TGTCTTGTAT ATTACTTTTA TAAAATAAAA AACGCCCACA GATAAGTCTT CATAGTTCAA 5220 

20 AAACTTGTCC GTGGACTTCT ATTTAAGTAT GTGTGCTCAT ACCATTTATT TATTCATCTG 5280 

CAAGAAAGCC ATTACCATAG ACATCTCTTA CATCATGAAT TACGAGGAAT GCATCTTTAT 534 0 

CGATTTGTTT AATTAATCGC TTTGCTTTTG AAACTTGTGT TTTAGAAATA ACAACGTATA 5400 

25 

AGACATCTTT TTCTTCACGC GTATAATAGC CATGTCCGTT TAAAATGGTT AAACCTCTTC 5460 

CAATTTGCTC GTCTATTGCT TTGGCAAGTT TGTCGGGATT AGTTGAAATA ATCGTCATAG 5520 

CTTTTTTAGT GTTTAAACCT TCTATGACAT ATTCCATCAC TTTTGTTCCT ATATAAAGTG 5580 

30 

ATATTACTGT TACTAATACT TTATCAAGTG GAATAACTGT AAGTGAAATT GCAACAACGA 5640 

TCATATCGAA GAAAAGCAAA GCATATGGCG TGCTTACATC GAGGTATTTC GTTGCAATTC 5700 

3$ TCGCCAAAAT TGTTGTACCT GCTGTTGTAC CGCCTGCAAG GATAATTACT CCGATTCCTA 5760 

GTCCAACGCT TACACCACCA AAAATGGCAT TCACAATGCT GTTTCCAGTT TCTACTTGCC 5820 

ATGATTCTGT TAAACTCAAA AATATTGAAA TAAGAATTGT TACAAGAATA GTTAAGTACA 5880 

40 TACTTCTCTT ACTCAAAAAt TTATAAc CTA TGGCAATCAA TACTGCGTTG ACCAAGAAGT 5940 

TAGTGATGGC TGGTGAAATA TGAAACGCAT AATATAAAAT AATTGCTAAA CCTGTAACCC 6000 

CGCCTTCACC TAAGTTACCA GAAAT^ATAA ATGCATTTAC ACCTGCAGCA AAGATAAATG 6060 

45 AACCTAAGAC AACTAGTATT AAATCTTTAA CCGTTTTATT CACGAAACCA TCCCCTTTAT 6120 

ATATTTATTA GACTAT 6136 
(2) INFORMATION FOR SEQ ID NO: 308: 

60 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

55 
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<xi> SEQUENCE DESCRIPTION: i 

5 

GATATCGTAs CTACTGAAAA GTCATCACCA 
TTCTATGCAA TTGCAATTCA ATACTTTGTT 
TGGTTACCGA CGTATTTAAC AGAAGTATTC 

10 

TCATTACCTT GGTTATTAAT GTTCTTCTTA 
GTATTAGGAT TAGGTCGTTC AAAATTCGTA 

IS ATTGTGTTTG CAGTTTCAAT TATCTTTGCT 

TTCT GGT T A T CACTAGGTCT TGGTGGTATC 
GCAACTGACT TAGOACGTAA CTTCTCTGGT 

20 AATATAGGTG CATTAATCAG TCCGCTATTA 

ACAAXX3ACAT TCCAATTGTT AATCGTTCCA 
GTGAAACCAG ATCAACCTTT AATTGTTAGT 

25 

CAAGCAGTAA GCTTTCACAT AGTTGGGGCT 
TTTTAAAACA GATATGGTTT AAGATGAAAA 
ATAAGGTTTT AAAACATTAG TCAGGTAACG 

30 

CGCAAAGGTG GGCGACCAAA GtTcAACGAT 
GGACTTTAAA AGTTTTTAAA TTTAGATAAT 

3S GCGTAATTAA AATTTATAGC AACAAATTCA 

AATATTTTGA CAAGGAAAGG AGAAGTCGAA 
TCAACCGACT TAACTTAGAC TTTTATAAAG 

40 CTGCAATAAT CGGTTACTTA TGTGGTAATT 

CACTAGCCCA TATTTATGTT TTTAAAGGTC 

TTTGTAATTT AGCGTTTGCA ATATGTATGA 

45 

TCGTTTTTGG AATGACATTA TTAATTGTTA 

TAAAAATAGC TGGACCGTCA TCGACATTCT 

TACCTATAGC TCCCGAAGAA GCATTATATA 

50 

TTGCCACTAT GATGGTGTTA ATCACGATCG 
CAATTCAAAA TGATTTTAAA CTCATATCTA 

55 



iEQ ID NO: 306: 

CCATGGAATG ATTTCTTTAA ACGTTTTAGC 60 
GTACAATTTA TCATTACATT ATTCTTAATT 120 
CACGTTAACT TTAAAGAAAT GAGCATTAGT 180 
ATCTTATCAG CAGGTGCAAT TTCTGACCGT 240 
GCTAGAGGTG TAATTGCAAT TGCAGGATTT 3 00 

GTACGCACAG GAAATTTATA TGTAAGTATT 360 
GGTATTTCAA TGGGTATGAG TTGGGCTGCA 420 
ACAGTATCAG GGTGGATGAA CTTATGGGGT 480 
GCAGGTCTAT TCGTAGAACA TTTGGGTTGG 540 
GCAGTAATCG CTGTGATTAT GTGGTTCTAT 600 
GATGATAAAG CAATAGAAAA ATAATTTAAA 660 
TATTGCTTTT TTTGCGTTGA AATTGAAACT 720 
TGAAGTTATT GAAATGATAT ATGTAAAGAA 780 
CTTGTaAAAG TACATATAAA TTTTAACTAG 840 
GTTAAATAAC aTTAGrAATT AATTTTAATT 900 
TGAGCATAAG GTGTTATAAT GACATATGTT 960 

TTTTAACTAT GCTAATAAAA AGATTATGGA 1020 

ATGACATCTT TTTGACATCA CTCATAAAAA 1080 

GTGTAAGACA GGGACTGTTA ATGATTATTC 1140 

TCCAATTTGG ATTATTAGTT GCAACCGGAA 1200 

CGTCGCGATC TAAGCTGCGA ACTGTAATAA 1260 

TGCTTGGTAC GCTAACAGCC AAAACGCCAC 1320 

CGGTTATACC ATTTTATATA TTTACTGCCT 1380 

TCATTGTGAC ATTCAGTCTA CCCATTAACT 1440 

GAGGCTTTGC GATTTTAGTA GGCGGTATAC 1500 

TATTTTCTAA AAACAAAGCT GAAGAACAAG 1560 

AGTTGTTACA CACTTATAAT GATAAATCTG 1620 
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TCACTTCTAC 


TTCAAGTAAC 


GATAAATTAA 


GTAGACGTTT 


CCAAAAATTA 


TTATTATTAC 


1740 




ACACATCTGC 


CCAAGGGATT 


TATTCTGAAC 


TGTTAGAGTT 


GAACGCTAAA 


CAAATTCGAC 


1800 


5 


CATTGCCAGA 


TGAGTTAATT 


GAAATGATGG 


ATCATATCAT 


TGCACAACTA 


GATAATAGTG 


1860 




AGGAAAATGT 


AAGATATTGG 


CGAAAAGAAG 


TGACAGTAAC 


AGAGGAATTT 


CAAAATTTAT 


1920 


10 


TCAACCATAT 


ATTGAAAATT 


GATGAAATGG 


TGCATGCAAA 


TGAAGCGCGT 


ATTG CGTATG 


1980 


AAGCAGACAT 


GCGAAAACCT 


TTATATAGTA 


AACG CATTTA 


TCcLAAATTTA 


ACaTTAGACT 


2040 




C tATk GTTTT 


TAGAAATACA 


TTGAGATATA 


CAGCGATTAT 


GATGATAGCG 


ATATTTATTG 


2100 


15 


, CGTTAATGTT 


TGATTTTGAA 


AAAGCATACT 


GGATACCGTT 


ATCTGCACAT 


ACAATATTAC 


2160 




TAGGAACATC 


AACTATACAT 


GCAATCGAGA 


GAGGTATGGC 


ACGAGGTTTA 


GGTACTATTT 


2220 




TAGGTGTGTT 


AGTACTTTCA 


GTCATATTGT 


TGTTTTCAAT 


ACCAACACCT 


GTTGCAGTAA 


2280 


20 


TTTTAATGGG 


CATTGCAGCA 


TTGTTTACTG 


AAGCATTGGT 


GGGAGCAAAT 


TATGCGATTG 


2340 




CAGTAGTTTT 


TATTACAATA 


CAAGTTATTT 


TAATGAACGG 


ATTAGCATCA 


CAGAATTTAA 


2400 




CAATTAACAT 


TGCGTTTCCA 


AGAGTTATTG 


ACGTTGCAAT 


GGGTATTGTG 


ATTGCAATCA 


2460 


25 


TAGGTTTATT 


TGTCCTTGGA 


CAACGTACCG 


CATCCGCATT 


GCTTCCTAAT 


GTAATGGCTG 


2520 




AAGTTGTTCG 


TAAAGAAGCA 


ACGCTCTTTC 


ATTATTTATT 


TTCTGAAAAT 


CAATAT . 


2576 



(2) INFORMATION FOR SEQ ID NO: 309: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 



- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 

CAAGCACATT ACGATGGTCT AATACTTTGC TAATAATTTC TTCTTGTCCT GGTCGAAACG 60 

TTTCATACCC AAAGTAATGC GATAATGTTT GTTGCATCAT AAATTGACCC CTTATTGTTG 120 

TTCTTTAATT TCTTCTAACT CACTCCATCT TGTGATGTCT AAATCATATT GAATTTCAAG 180 

TTGTTCTTTT TCTTCGTTTA ATTCTTTAAT TTTCCCATAA TCTGCACTTG CCTCAATCAT 240 

GAGCACATCA ATTTCTTCCA TTCTTACTTC CGCTTGTTCT ATGCGTTTCA TCAATTGTTC 300 

ATATTCTAAT TTTTCTTTAT ATGATAAACC ATTTTTCTTA CGTACAGTTG TAGAAGATTT 360 

AGATTGTTGC TTCAATGTGG ATTTATTTTT ATCTAATGAT TTTTTATAAC TTTCATAATC 420 

TTCAAAAGTT CCGATAATCT TTTCCATCTG ACCATCATGA ATAAACCAAT ATGACTGTGC 480 
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AATATAATCT TCAAGTATTG TTAAAGTCTC AGTATCTAAA TCATTTGTCG GTTCATCTAA 600 
CAACAGAACA TTTGGCTGGT GTAC6AGTAG ACGTAATAAA TACAAACGCT TTTGCTCTCC 660 

5 

ACCAGATA 668 

(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 12173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

1S 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 



20 



2S 



30 



35 



60 



CATCAGTTTA 


TTTTGAAAGG 


CAATGCGATC 


ATTTCATGTA 


TTTATGTTGT 


TTGAAACATC 


60 


GTTAGATAAC 


AATAGTGATA 


TTGCACATTT 


TAAGCTGAAG 


ATGGAAAAAT 


ATTCGATAAT 


120 


CAAATAAACA 


ATGAATTTTA 


GAAGGTACAA 


TGACGTTCAC 


TAATTTAAAT 


ATAGCTGAAT 


180 


GTGTTGGTGA 


GTGATGTTCA 


CTATAGATTT 


ATATTAATAT 


ACAAAGACAA 


AGGTTGTTAA 


240 


TTTTTATTAA 


GCGTTAGGTT 


GAATGTATGA 


GAATTTTAGA 


TTTATAATAG 


AAGATAGAAA 


300 


OGAAAATTTT 


TCTTAAAAGC 


AGTAATGTTG 


ACTCAAAATA 


AGCTATAATA 


ATGACACTTA 


360 


TTTAATTGAT 


TAACATTTGC 


TAATAAATAT 


CAATATAGAA 


TATAACTTTC 


CAATAATGAC 


420 


TGAGAAAATC 


GAAATGTCAG 


TCTCGAATCA 


TATAATTAGA 


AAATTGATTA 


TTTTCTGTCA 


480 


ATTTAGGGTT 


GAACTATACA 


TATGATATTG 


TTAGAATATT 


TTTTAACATT AT ATTTT ATT 


540 


GCTTTAAAGT 


GGAATATACT 


TGAAATAATT 


AGTAGAGGTG 


AGTAAGGATG 


AGTAATAAAT 


600 


TAGAATCATA 


CAGAAGTGAG 


ATTGTATCAC 


TGAATCATCA 


AATTTTAGAC 


TTATTATCTA 


660 


AACqTGGTGA 


ACTAGCACAA 


AAAATTGGGG 


AAGAAAAATT 


AAAACAAGGT 


ACACGTATTT 


720 


ATGATCCACA 


ACGTGAAAAA 


GAAATGCTTA 


ACGACTTAAT 


CGATAGTAAC 


AAAGGACCAT 


780 


TCAACGATAA 


TACTATTAAG 


CAATTATTTA 


AAGAAATTTT 


CAAAGCCTCT 


ACAGATTTAC 


840 


AAAAATCTGA 


AAATGAAAAA 


CATTTATATG 


TATCACGTAA 


GTTGAAACCT 


GAAGATACGA 


900 


TTGTAACATT 


TGATAATGGG 


GGCATTATAG 


GAGACGGCAA 


TAAATCATTT 


GTATTTGGGC 


960 


CATGTTCAGT 


TGAATCATTT 


GAACAAGTTG 


AAGCTGTTGC 


TAAAAACTTA 


CATGCTAAAG 


1020 


GTGAAAAATT 


TATTCGTGGC 


GGTGCATTTA 


AACCACGTAC 


ATCACCATAT 


GATTTCCAAG 


1080 


GCCTAGGTGT 


TGAAGGACTT 


AAAATACTTA 


AACAGATTAA 


AGATAAATAT 


GATTTAAATG 


1140 


TTGTCAGCGA 


AATCGTAAAT 


CCAAATGATT 


TTGAAGTGGC 


TGATGAGTAT 


TTAGACGTAT 


1200 



SS 
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10 



15 



AAAAGCCTAT TCTATTAAAA CGTGGTTTAT CTGCTACAAT CGAAGAGTTT GTTTATQCAG 1320 

CTGAATACAT TGCTTCACAA . GGTAATCAAA ACATTATTTT ATGTGAACGT GGAATCCGAA 1380 

CTTATGAAAA GGCGACACGT AACACTTTAG ATATTTCAGC AGTACCAATT TTAAAACAAG 1440 

GTACACACTT ACCAGTCATG GTAGATGTTA CGCATAGTAC AGGTCGTAAA GATATCATGT 1500 

TACCAACTGC GAArsAgCAT TAGCAGTTGG TGCTGATGGA GTTATGGCTG AGGTGCATCC 1560 

AGATCCATCT GTTGCACTTA GTGATGCGGG TCAACAAATG GATTTAGATG AATTCCAAGC 1620 

ATTTTATGAT GAATTAAAGC CTTTAGCTGA TTTATATAAC GCTAAAAAGT TAAAATAATA 1680 

TTCCAAGGAA ACTATAGACT ACTTAACTAA TATGTCATGT TGAAGTAGAA TATTATCTTT 1740 

GAATCGACAA TTTTAAACTT ACAGCCATTC TAAGAGTATA TTACTTTTAG AGTGGCTATT 1800 

ATTTTTTGTA TAGAAATAAA GGTATACTGC ACTTAACGAT TGTTATAATA CTTCGACACT I860 

20 TGTTCAATTT CACAATTATT AAAGATTATG ACTGATAGCA GTAATTAAAA TTATAACTAT 1920 

GAATTATCTG TAAAATATAA TAGATTCAGA CATTTGTTGC TGAAATGTGA ACATTTTTCA 1980 

ACAAATGCAA TTGATATTTG AAAAGGCTTT CTCAAAACAT TACAATTAAA AATGAAAAAA 2040 

25 GTTTATATAA AATTAAAATA TATCGTTCGT TATCATTTAG CGTTTGTTTT TATTTCAAGC 2100 

TTTTCGCTAA ATTTTTCCAA ACAAAAATAT GTTACTGTAA ATTAAAATAT GGTAAACTAT 2160 

GAAAATGAAA TGAAAACATG TTATTATAAT GAATAAAACG TTTACAAGGA GGAAATTATG 2220 

ACAGTTACTA TATATGATGT AGCAAGAGAA GCGCGTGTCT CTATGGCCAC AGTGTCGCGT 2280 

GTTGTTAATG GGAACCAAAA TGTTAAAGCA GAAACTAAAA ATAAAGTTAA CGAAGTCATT 2340 

AAGCGTTTGA ATTATCGTCC AAATGCTGTT GCTAGAGGTT TAGCTAGTAA AAAGACAACA 24 00 

ACAGTAGGTG TGATCATTCC AGATATATCT AATATCTATT ATTCACAACT TGCTCGTGGA 24 60 

CTTGAAGATA TTGCAACAAT GTATAAATAT CACTCAATTA TTTCAAATTC AGATAACGAT 2520 

CCTGAAAAGG AAAAAGAAAT TTTTAATAAC TTATTAAGTA AACAGGTTGA TGGTATTATT 2580 

TTCCTTGGTG GTACAATTAC TGAAGAAATG AAAGAATTGA TAAATCAATC ATCTGTACCT 2640 

GTAGTAGTAT CAGGAACAAA TGGTAAGGAT GCACATATAG CATCAGTTAA TATTGATTTT 2700 

45 ACTGAAGCTG CGAAAGAAAT TACGGGAGAA TTAATTGAAA AAGGCGCTAA ATCATTTGCT 2760 

TTAGTAGGTG GAGAACATTC TAAAAAAGCT CAAGAAGATG TTTTAGAAGG TTTAACTGAA 2820 

GTGTTAAATA AAAATGGCCT TCAATTAGGT GATACATTGA ATTGTTCTGG TGCTGAAAGT 2880 

60 

TATAAAGAAG GCGTAAAAGC TTTTGCCAAA ATGAAAGGCA ATTTGCCAGA TGCCATTTTA 2940 

TGTATCAGCG ACGAAGAAGC AATTGGTATT ATGCATAGTG CAATGGATGC TGGTATTAAA 3000 
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CCACAACTTT CTAGTGTTAT TCAACCATTA TATGATATCG GTGCAGTAGG GATGCGCTTA 3120 

TTAACAAAAT ATATGAACGA TGAAAAOATA GAAGAACCAA ATGTAGTTTT ACCTCACAGA 3180 

5 ATTGAATACC GAGGAACTAC AAAATAAATT CACAAAATTA GGCATTCATC TAACGACCCA 3240 

AATTATATGG GTGTTGGAAG AATGCCTTTT ATTTATCTTT TAAAATCGTT GCAGATTAGG 3300 

TTACTTATTG ACGAGTAGAT TCGTACCAAC TCGCTATATG TAAAGCTAAT TTTTTATTTT 3360 

10 

TTTCACTAAT TTCTTTTGTG CGGGGGACAT AGGTATAATC ATTTAAACGA TCTTCCCATC 3420 

TTTTAGGTAA TAATTCAGAT GAATAATGTT TCCATTTATT AATCCATTCT AACGGTAAAT 3480 

1S AACCACTTTG AATTGGTTGA TCAATTAAAC TTAAGAATAC ATGACTCCAT GCACGTGGTA 3540 

OGACTCTCCA AATATTGTAG CCTCCGCCAC CAAACATAAT TACCTTTCCA TTCGTATAAG 3600 

AATCAGCTAA ATATTTTACA AAATATGGAA TTTCATATAA TGAATGTAAC GTACAATTTA 3660 

20 GATGAGTTAG TGGATCACGA TAATGTATAT CGACACCATT TACGCTTAGA ATAATATCAG 3720 

GTTTAAAACT CTTTACGACA GGCTCAACTG TTAATTTAAA ACACTCCAAA AATG ATQ CAT 3780 

CTTCTGTATA CGGTTCAAGT GGGACATTTA CAGTGTGTCC ATAGCCGATA TCTTCACCGC 3 840 

25 GCTCAGTATA GTGACCAGAG CCTGGGAAAA GAAATTTTCC GGTTTCATGG ATAGAATAAG 3 900 

TAGTAACATG GTTATCGGCA TAGAAACTCC ATTGTGTACC ATCTCCATGA TGTGCATCGG 396 0 

TATCTATGAT TAAAACGCGT TGATTGTATT CTTTAGCTAA GTATTGTGCG GTAATTGCAA 4020 

30 

TATCATTGTA TATACAAAAA CCACTTGCTC G ACCAGG TTG AGCGTGATGC AAACCACCAC 4 080 

CTAAGTGACA ACCATTTAAT ACTTTGCCTG ACATAATAAG ATCTGCTAAA GTTAAAGCGC 4140 

CTCCAACAAT TGTGGCACTA TGGCGGTGCA TATGCTTAAA TTGACCATTC TCTTCATCAT 4200 

35 

TTAATCCATA TTTCTTAGCC TCATCTTCAC TGATAATGCC ATGTGAAGCA TGCTTAATAG 4260 

CTTCGACGTA ATCATATTTA TGAATTAACA TTAATTCGTC ATCTGTTGCA ATTCTAGGTT 4320 

40 GTACTATTTG TTCTGGAGAC AATAAATTTG CATTCAAAAG TAGCTCTGTT GTTAATTTTA 4380 

AACGCATTTG ATTGAAGGGA TGTTGGTCAT GAAATCGATA TTGTAATAAC TTATCTGAAT 4440 

AAACATATGC AGTTTTTGAT GAATGTTGTT GCATATAATC CCTCCGATAT TCCAAAAATT 4500 

45 AAAAGAAAAA CCGATTCATA TAACGAATAT CATCAAACGC TTGTTGCTGT TCTAATGTAA 4560 

TGTTTTTGCC AATTCTTGCC ATTAAACAAT TAGCTGGATG ACTTGTTATT TCTGGATCAT 4620 

CTGTAGCGAA TATTTCAAGT CCACCAGTTG CCATTAACCG CTGCATTAAT TTTTTATAGT 4680 

60 

CAAATACATC TAACTTTGAA TTTTTTAAAT CCCAATGCCA GTAATATTCT GTAGTTATAA 4740 

CGATATAATT CTCGAATTCT GGTGTAGAAA GGCTAAGTTG TATCAGCTTT TCTGCAAGTT 4800 
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TACCTGTAGA CCAGCGCTCA ATTTCATCAG GATAGTGGAA AGTGACATAA CCCACAATGA 4920 

GTTGATCTTG GCGAGCAACA TAAATTCTAC CTTCTGGTAA TGTTGTAATT TCTAACAAAG 4980 

5 

CTTTATACTG ATCTTCAGCA TCTCTAAATG CGGTTAAATG CGCATCGAAA GTAAGCGCTT 5040 

TCAAATCTtC GTGTGTTAAA GGACCTTCAA TAACAAATTG CTTGTCATGA ATGTAATAAT 5X00 

CTTCGGATTG ATACGTCTTT AAATGATTCA TATTTTCAAC TCCTCAATCG ACGTCGTGAT 5160 

10 

TGTATTAAAT TCATTATATA GAAaATTTAC AATAATTAAT ACTAGAAAAA GGATAAAAGT 5220 

AAAAATTTTG AATAATTAGA AATGTTATGT ATAATATTGA GAAAGAAAGC GTTTTCACAT 5280 

r5 AACAAAGGGG GAGTTTCAAA TGAAAGTCGA AGTTTATAAA GGAGCGCAAG GTAAACATAA 5340 

CCTTAAAGAT TATGAAGAAA CATATAATAC TTTTGATTGG AAAGACGTAG AACAAGCATT 5400 

TTCTTGGAGT GAAACTGGAA AAATGAACAT GGCATATGAA TGCATAGATC GCCATGTAGA 5460 

20 TCAAGGATTA GGGGATAAAA TAGCGTTAAA TTACAAAGAT GAGCACAGAA AAGAATCGTA 5520 

TACTTATAAA GATATGCAAC GGTTATCTAA TAAAGCAGCG AATGTTTTGT CTGAACATGC 5580 

AGAAGTTGAC AAAGGTGACA GAGTATTTAT ATTTATGTCG CGTACACCTG AACTATATTT 5640 

25 TGCGTTGTTA GGTGTTTTAA AAATTGGTGC AATTGTTGGG CCGTTATTTG AAGCATTTAT 5700 

GGAAAAGGCA GTTGCGGATA GATTAGAGAA CAGTGAAGCT AAAGTGTTAA TTACTAATAA 5760 

GGCATTGTTA CCTCGAGTAC CTGTAGATAA ATTACCAAAC TTGAAAAAAA TTGTTGTCGT 5820 

30 

AGATGAGGAT GTAGAAGAGA ATTACATAGA CTTCATTAGT TTGATGGAAA CTGCTAGCGA 5880 

TGAATTTGAC ATTGAATGGT TAAAGTCGGA TGATGGTTTG ATTTTACATT ATACATCAGG 5940 

35 TTCTACTGGG CAACCTAAAG GtGTATTGCA TGTTCAACAA GCAATGTTAG TGCACTATAT 6000 

TTCTGGAAAA TATGTATTAG ATTTACAAGA AGATGATGTT TATTGGTGTA CAGCAGATCC 6060 

AGGTTGGGTT aCAGGAACAT CTTATGGTAT TTTTGCACCA TGGTTAAATG GCG CTACAAA 6120 

40 TTGTAT AG CT GGTGGTCGCT TTTCGCCAGA ACAGTGGTAT AGTATGATTG AAGATTTTAA 6180 

AGTGACGATT TGGTATACGG CACCAACAGC TTTAAGAATG TTAATGAGTG CTGGTGACGA 6240 

TATTGTTGAG AAATATGACT TGTCATCGTT ACGTTCGATT CTATCAGTAG GTGAGCCTTT 6300 

45 AAATCCTGAA GTTATAAAAT GGGCGAAAAA AGTATACGGT TTAACGGTGT TAGATACTTG 6360 

GTGGATGACA GAAACAGGTG GACATATGAT TGTTAACTAT CCAACOATGG ACGTcAAGCT 6420 

TGGCTCAATG GGCAAACCAT T AC CTGGT AT TCAAGCTGCA ATTATCGATG ATGCAGGGAA 6480 

60 

TGAATTACCA CCAAATCGAA TGGGCAACCT TGCTATAAAA AAAGGCTGGC CATCAATGAT 654 0 

GTATCGTATC TGGAAGAATC CAGAAAAATA TAAATCATAT TTTATTGGAG ACTGGTATGT 6600 
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TO 



TGATGTAATT ATGACAGCTG GTGAACGAGT TGGACCATTT GAGGTTGAGT CTAAATTGGT 6720 

TGAACACGAA GCAGTTGCCG AAGCAGGAAT TATTGGTAAA CCTGATCCGG TTCGCGGTGA 6780 

AATAATTAAG GCGTTTGTTG CACTGAGAAA AGGATATGAA CCAACAGAOG AATTAAAAGA 6840 

AGAAATTCGT ATATTTGTTA AAGAAGGTTT GTCGGCACAT GCAGCACCAC GTGAAATCGA 6900 

ATTTAAAGAT AAATTACCTA AAACACGGTC AGGTAAAATT ATGAGACGTG TATTAAAAGC 6960 

TTGGGAATTA AATTTAGATG CTGGGGATTT AAGTACAATG GAATAATGAC ATGAATGTTA 7020 

TTGAAGATTT TTTTCGAAGA ATAAAGGGTG ACAACATATT TCATGTCAAT GTTTAAATAA 7080 

1S TCGTTTACTT TACGATAAGC AATATAAAGA ACTGTTAACT TGTGTCATAT CATTTCGTAG 7140 

AAAGCATTTG AAAATGATGA CATAACAATA ATGGCATATC TTTATATTGC TTTTTATTTT 7200 

TAATATGATC TTTGGAAGAT GATTATTTTA AATAATAGAA AAATATAGTT ATCAATAGTA 7260 

20 TCAAGCGCTA AAAGTTGTAT AATACAAAAC TTTAATAAGT GAATTTATTG CAAAAATGAA 7320 

AG CGCTAACC CGATTTAGTC GACAAGTTTT TAACAGTTCG TTATTATATG AATGTAAGTA 7380 

AAAATTTCTT AGCTACAACT TACATATTAT AAATGCATAA ATTAAACAAA AAGGGGCGAA 7440 

AAAAGTTGAC TCATTTATCA GATTTAGATA TTGCGAATCA ATCAACACTA CAACCAATTA 7500 

AGGATATTGC TGCATCAGTA GGTATTTCAG AGGATGCATT AGAACCTTAT . GGTCATTACA 7560 

AAGCTAAAAT CGACATTAAT AAAATTACGC CAAGAGAAAA CAAAGGGAAA GTTGTTTTAG 7620 

TAACTGCGAT GAGCCCAACA CCAGCTGGTG AAGGTAAATC AACGGTTACA GTTGGTTTAG 7680 

CTGATGCATT CCATGAGTTA AATAAAAACG TTATGGTTGC ATTAAGAGAG CCTGCTTTAG 7740 

GACCAACATT TGGTATCAAA GGTGGTGCGA CTGGTGGTGG TTATGCG CAA GTCTTACCTA 7800 

TGGAAGATAT CAACTTACAT TTCAACGGAG ATTTCCATGC GATTAGAACT GCAAATAATG 7860 

CATXSTCTGC GTTTATCGAT AATCATATTC ACCAAGGTAA CGAATTAGGA ATCGATCAAA 7920 

40 GACGTATTGA GTGGAAACGT GTATTAGATA TGAATGATCG TGCACTTAGA CATGTAAACG 7980 

TTGGGTTAGG TGGACCTACA AATGGTGTAC CACGTGAAGA TGGCTTTAAT ATTACAGTAG 8040 

CGTCTGAAAT TATGGCGATT TTATGTTTAA GTAGAAGTAT TAAAGACTTA AAAGATAAAA 8100 

TTAGTCGTAT TACTATTGGT TACACTAGAG ATCGCAAGCC AGTTACAGTT GCAGATTTAA 8160 

AAGTGGAAGG TGCACTTGCA ATGATTTTAA AAGATGCAAT AAAACCAAAC TTAGTACAAT 8220 

CAATTGAAGG GACACCTGCA TTAGTTCATG GTGGACCATT TGCGAATATC GCACACGGTT 8280 

GTAACTCAAT TTTAGCAACT GAAACAGCAC GTGATTTAGC TGATATCGTT GTAACGGAAG 8340 

CTGGATTTGG TTCAGACTTA GGCGCTGAAA AATTCATGGA CATTAAAGCG CGTGAAGCAG 8400 

£5 
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GTGTAGCGAA AGATAATTTA AAAGAAGAAA ATGTAGAAGC AGTAAAAGCA GGAATTGTTA 8520 

ATTTAGAGCG TCATGTTAAT AATATTAAAA AATTCGGTGT AGAACCGGTT GTTGCAATTA 8S80 

5 

ATGCATTTAT ACATGATACC GATGCAGAAG TAGAATATGT AAAATCTTGG GCTAAAGAAA 8640 

ATAACGTACG AATTGCCTTA ACTGAAGTTT GGGAAAAAGG TGGTAAAGGT GGCGTTGACT 8700 

TAGCAAATGA AGTATTAGAA GTCATTGATC AACCTAATTC ATTTAAACCT TTATATGAAT 8760 

W 

TAGAATTACC ATTAGAGCAA AAGATTGAAA AGATTGTGAC TGAAATCTAT GGCGGTTCAA 8820 

AAGTAACGTT TAGCAGTAAA GCGCAAAAAC AATTAAAACA ATTTAAAGAA AATGGTTGGG 8880 

1S ATAATTACCC AGTATGTATG GCGAAAACAC AATATTCATT CTCAGATGAT CAAACGTTGT 8940 

TAGGTGCACC ATCAGGATTT GAAATTACAA TTCGTGAATT AGAAGCGAAA ACAGGTGCAG 9000 

GATTTATCGT AGCGTTGACA GGTGCAATCA TGACTATGCC TGGTTTACCT AAAAAACCAG 9060 

20 CAGCATTAAA CATGGATGTT ACTGATGATG GTCATGCAAT TGGGTTATTC TAATAAATCA 9X20 

TGTCAATTGT TTAATAAAGA TAAGTAAATA GTTTAATAGA CCGGACTGTT GGAGATGCAT 9180 

TATTTCAGCA GTTCGGTTTT TTGCTGTGCT AAAAATAGAT TCAATTTGGC GAATCTAACG 924 0 

25 ACAATGTTTG AAGGTGGTTA ATTAATGTAT ATGAAGATAA AAAGTGGGCT TGAAGAATAG 93 00 

GAAAGCGATG CAATGAATAT TCCATATTAA AAAAAATTAA TAAAATAGGT TGCAATATTT 9360 

AATTGGGATG CGCTACAATT AACACTAATA ATTGATATTG ATAATTATTA TCAATTAAAT 94 20 

30 

ATAATCTTAT AGGAGTTGTT AACAACATGA ACAAACATCA CCCAAAATTA AGGTCTTTCT 9480 

ATTCTATTAG AAAATCAACT CTAGGCGTTG CATCX3GTCAT TGTCAGTACA CTATTTTTAA 954 0 

TTACTTCTCA ACATCAAGCA CAAGCAGCAG AAAATACAAA TACTTCAGAT AAAATCTCGG 9600 

3S 

AAAATCAAAA TAATAATGCA ACTACAACTC AGCCACCTAA GGATACAAAT CAAACACAAC 9660 

CTGctACGCA ACCAGCAAAC ACTGCGAAAA ACTATCCTGC AGCGGATGAA TCACTTAAAG 9720 

40 ATGCAATTAA AGATCCTGCA TTAGAAAATA AAGAACATGA TATAGGTCCA AGAGAACAAG 9780 

TCAATTTCCA GTTATTAGAT AAAAACAATG AAACGCAGTA CTATCACTTT TTCAGCATCA 9840 

AAGATCCAGC AGATGTGTAT TACACTAAAA AGAAAGCAGA AGTTGAATTA GACATCAATA 9900 

45 CTGCTTCAAC ATGGAAGAAG TTTGAAGTCT ATGAAAACAA TCAAAAATTG CCAGTGAGAC 9960 

TTGTATCATA TAGTCCTGTA CCAGAAGACC ATGCCTATAT TCGATTCCCA GTTTCAGATG 10020 

GCACACAAGA ATTGAAAATT GTTTCTTCGA CTCAAATTGA TGATGGAGAA GAAACAAATT 10080 

50 

ATGATTATAC TAAATTAGTA TTTGCTAAAC CTATTTATAA CGATCCTTCA CTTGTAAAAT 1014 0 

CAGATACAAA TGATGCAGTA GTAACGAATG ATCAATCAAG TTCAGTCGCA AGTAATCAAA 10200 
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AGGCAACGAC CAATATGAGT CAACCTGCAC AACCAAAATC GTCAACGAAT GCAGATCAAG 10320 

CGTCAAGCCA ACCAGCTCAT GAAACAAATT CTAATGGTAA TACTAACGAT AAAACGAATG 10380 

5 AGTCAAGTAA TCAGTCGGAT GTTAATCAAC AGTATCCACC AGCAGATGAA TCACTACAAG 10440 

ATGCAATTAA AAACCCGGCT ATCATCGATA AaGAACATAC AGCTGATAAT TGGCGACCAA 10500 

TTGATTTTCA AATGAAAAAT GATAAAGGTG AAAGACAGTT CTATCATTAT GCTAGTACTG 10560 

10 

TTGAACCAGC AACTGTCATT TTTACAAAAA CAGOACCAAT AATTGAATTA GGTTTAAAGA 10620 

CAGCTTCAAC ATGGAAGAAA TTTGAAGTTT ATGAAGGTGA CAAAAAGTTA CCAGTCGAAT 10680 

15 TAGTATCATA TGATTCTGAT AAAGATTATG CCTATATTCG TTTCCCAGTA TCTAATGGTA 10740 

CGAGAGAAGT TAAAATTGTG TCATCTATTG AATATGGTGA GAACATCCAT GAAGACTATG 10800 

ATTATACGCT AATGGTCTTT GCACAGCCTA TTACTAATAA CCCAGACGAC TATGTGGATG 10860 

20 AAGAAACATA CAATTTACAA AAATTATTAG CTCCGTATCA GAAAGCTAAA ACGTTAGAAA 10920 

GACAAGTTTA TGAATTAGAA AAATTACAAG AGAAATTGCC AGAAAAATAT AAGGCGGAAT 10980 

ATAAAAAGAA ATTAGATCAA ACTAGAGTAG AGTTAGCTGA TCAAGTTAAA TCAGCAGTGA 11040 

25 CGGAATTTGA AAATGTtACA CCTACAAATG ATCAATTAAC AGATTTACAA GAAGCGCATT 11100 

TTGTTGTTTT TGAAAGTGAA GAAAATAGTG AGTCAGTTAT GGACGGCTTT GTTGAACATC 1116 0 

CATTCTATAC AGCAACTTTA AATGGTCAAA AATATGTAGT GATGAAAACA AAGGATGACA 11220 

30 

GTTACTGGAA AGATTTAATT GTAGAAGGTA AACGTGTCAC TACTGTTTCT AAAGATCCTA 11280 

AAAATAATTC TAGAACGCTG ATTTTCCCAT ATATACCTGA CAAAGCAGTT TACAATGCGA 1134 0 

TTGTTAAAGT CGTTGTGGCA AACATTGGTT ATGAAGGTCA ATATCATGTC . AGAATTATAA 11400 

35 

AT CAGGAT AT CAATACAAAA GATGATGATA CATCACAAAA TAACACGAGT GAACCGCTAA 11460 

ATGTACAAAC AGGACAAGAA GGTAAGGTTG CTGATACAGA TGTAGCTGAA AATAGCAGCA 11520 

40 CTGCAACAAA TCCTAAAGAT GCGTCTGATA AAGCAGATGT GATAGAACCA GAGTCTGACG 11580 

TGGTTAAAGA TGCTGATAAT AATATTGATA AAGATGTGCA ACATGATGTT GATCATTTAT 11640 

CCGATATGTC GGATAATAAT CACTTCGATA AATATGATTT AAAAGAAATG GATACTCAAA 11700 

45 TTGCCAAAGA TACTGATAGA AATGTGGATA AAGATGCCGA TAATAGCGTT GGTATGTCAT 11760 

CTAATGTCGA TACTGATAAA GACTCTAATA AAAATAAAGA CAAAGTCATA CAGCTGAATC 11820 

ATATTGCOGA TAAAAATAAT CATACTGGAA AAGCAGCAAA GCTTGACGTA GTGAAACAAA 11880 

50 

ATTATAATAA TACAGACAAA GTTACTGACA AAAAAACAAC TGAACATCTG C CGAGTGAT A 11940 

TTCATAAAAC TGTAGATAAA ACAGTGAAAA CAAAAGAAAA AGCCGGCACA CCATCGAAAG 12000 

55 
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CATGGTGGGG CTTATATOCG TTATTAGGTA TGTTAGCTTT ATTCATTCCT AAATTCAGAA 12120 
AAGAATCTAA ATAATTAnCT AAATATAGCA TATGTATGAT TAACTTTGTA GAC 12173 

' 6 

(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1316 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

1S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

CAACATTAAT ATTGATATTA AATCTTCCTG GATAACGTGC TTGTTCGAGT GATAAGTATG 60 

CACGCACTTG ACTTAACTCT TTATCTAAAG TAATCGTATG TTGCTTAGAG CCTTGTAAAT 120 

20 TCGCTCTGAA AAAATAACTC AATTCTAATA GTAACTCTCG TGCCTTTTCG CTATTTATTC 180 

TAACTAAAGC TGAGATCGTG TTAATTGAAT TGAAGAAAAA ATGTGGACTC ACTTGTGCCT 24 0 

GTAATGACTT AATCTCAGCA TCTTTCAATA ACTTACTTTG CGTTTCGGCT TCACCAAGTT 300 

25 CAATTTGGCT ACTAAAAATA TTTGCCAATC CTTCTGCAAG TTGACGTTCC ACAAAAGTTA 360 

AATCATTAGG GTTTGTAAAA TACATCTTCA ATGTACCGAC GATAGAACCA TGCATCTCAA 420 

GTGGTATCAC GATAGCTGCT CTAAGCGGGC AATTCGGATG ACTACAACCA ATCTCTTCTT 480 

30 

TAGTATGAAC TTCTTTCAAC TTTCCTGATT TCAATACATC TTTAGACAGA CTTGTTAATA 54 0 

TTTCATTTGT TGGTATGTGA TGATCACTAC CTGCACCTAC ATGCGATAAG ATTTCATTTT 600 

TGCTTGTAAT TGCTACGGCA GATACTTTCA TTAAATTTTT AATAATCATC GCAATTTGCT 660 

35 

GTGCCGATTC TCTATTCAAT CCTTCTTTAA AATACGGCAA TGTCTGGTTC ATCAATTGCA 720 

GTACATCATG TGTTTGAACA GCCTTCATTT GCTCCTCTTG CTTTAATGTT GAAATGATAA 780 

40 TAGACATAAA AATCGCCGTA CCAACGCTAT TAACAATAAT CATTGGTAGT GCAATTAATG 84 0 

ATATGAGGTC AACCGCATAT GCTTTGTCGT GGGAAAATGT TAAAATGCTC AACATTTGAA 900 

TCATTTCCAT AACAATTCCA ATCATGGCAC TTTTCGCAAT ACTCGGGTAA CGCTTGCGTC 960 

45 TTTGAGCTTG TAAGCCAAAA TAACCAGCAA TTATACCAAT AAATATAGAT GAGATAAGAT 1020 

AAACTTGTGC ATCCGCCCCA CCCATATACA CTCTGAAAAT ACCTGAAATA ACGCCAACAA 1080 

ATAGACCTAC AAAAGGGCCA CCAACTAATC CTGCGACACC TATCGTTAAT ACACGTGTGT 114 0 

60 

TAGCTAAAGA TACATCATCA TCTAAACGGA AGTACACACT TCCTGACAAA CTATGTTGAT 1200 

GATCGATGAC GATACCAGTT AAATTAGACA TTAAGGCAAA CAAACTGAAA ATAATACATA 1260 

5S 
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(2) INFORMATION FOR SBQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 7972 base pairs 

<B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

10 

(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

TATAAATATT ATTTTATTAT CGTTTATCGC TATTTGTGTT ACATAATCAA AAC CAT AAAT €0 

15 TCTTACTCAT TCAGATTTAC CCAATATTTT TACTTTTATA ATGTAATGCG TTTTATCCAA 120 

GTTATTTTTT AAAAATAAAT ATTGAATTnG GGGCTGnTTT CATGTCATTA AGAGATGAAG 180 

CATTGGAAAT GCACAAACGT AATCAAGGTA AATTAGAAGT TAAACCAAAT GTAAAAGTTA 240 

20 CTAATAAAGA GGAATTAAGT TTAGCATACT CACCTGGCGT TGCTGAACCG TGTAAAGATA 300 

TTTATGAAGA TAAAAGAAAA GTATATGATT ACACAATTAA AGGAAATACA GTTGCAGTTA 360 

TTACTGATGG AACAGCGGTA TTAGGTTTAG GTAACATTGG ACCTGAAGCA AGTATTCCTG 420 

TAATGGAAGG TAAAGCAGTA TTATTCAAAA GCTTCGCTGG TATCAATGGG GTGCCTATTG 480 

CGTTAAATAC AACTGATACC GAAGAAaTCA TTAAAACAGT TAAGTTGTTA GAAC CTAATT 540 

ATGGTGGTAT TAATTTAGAG GATATTTCGG CACCACGTt G TTTTGAAATT GAAGAACGAT 600 

TGAAAAAAGA AACTAATATT CCGGTATTCC ATGACGATCA ACATGGTACA GCAATTGTAA 660 

CATTGGCAGG TTTGGTAAAT GCATTGAGAG TTGTTAACAA AGATATTGCT AAAATAAAAG 720 

35 TTGTACTAAA TGGTGCTGGT GCAGCAGGAA TAGCCATTGT TAAATTACTA TACGCGTATG 780 

GTGTAAGAAA TATGGTTATG TGTGACTCAA GAGGCGCAAT TTTTGAAGGA CGTTCATATG 840 

GTATGAATCC TACGAAAGAT GTTGTAGCAA AATGGACAAA TAAAGATAAG ATTGAAGGGT 900 

40 CTTTAGAAGA AGTCGTAAAA GACGCAGATG TATTTAT CGG GGTTTCTGTA GCTAATGCGC 960 

TGTCACAAGA TATGGTTAAG AGTATGGCAG ATAATCCAAT TATATTTGCA ATGGCTAATC 1020 

CAAATCCTGA AATAATACCT GATGATGCCA AAGCGGCAGG TGCACGAGTT GTTGGTACAG 1080 

GACGTTCAGA CTATCCTAAC CAAATTAATA ATGT ATT AG C TTTCCCTGGT ATTTTTAGAG 1140 

GTGCATTAGA GGTTGAAGCT ACACATATAA ATGAAGAAAT GAAAAAGGCA GCTGTAGAAG 1200 

CGATTGCTGA TTTAATCGAT AGTTCTGAAT TAAATGAAGA CTACTGTATC CCAGGACCGT 1260 

TTGATAAACG TGTAGCGCCA TCAGTTGCTC GTAATGTTGC TAAAGCGGCA ATGGAATCTG 1320 

GAGTAGCTAG GATTGAAGTT GATCCGCAAG ATGTGTATGA TAAAACAATG AAACTTACAG 1380 

SS 



25 



30 



45 
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ATTAAAATGA TGAAAATGAA AGTTTATGAT AAACATTCAA CAGTCAAACG AATATAAATC 1500 

AAATAAATTT AAACCCGTTT TTAACTGGTC AAGTTCAGTT TAAGGCTCTA AATGGTTAGA 1560 

5 

ACAGAGGTTA TTTGGAGGTT TTCCTATGTT TAAAGATTTT TTTAATCGAA CAAAGAAAAA 1620 

GAAATATCTT ACAGTACAAG ACTCTAAAAA TAATGATGTG CCTGCAGGTA TTATGACTAA 1680 

GTGTCCAAAG TGTAAGAAAA TTATGTACAC AAAAGAATTA GCTGAAAATT TAAATGTGTG 1740 

10 

CTTTAATTGT GAT CAT CAT A TTGCTTTAAC TGCGTATAAA CGTATAGAAG CAATTTCTGA 1800 

TGAAGGATCA TTTACAGAAT TCGATAAGGG AATGACCTCT GCGAATCCAT TAGATTTTCC 1860 

rs aAGTTATTTA GAAAAAATTG AAAAGGACCA ACAAAAGACA GGTCTTAAAG AAGCAGTTGT 1920 

GACTGGTACA GCACAACTAG ATGGTATGAA ATTTGGCGTT GCTGTCATGG ATTCACGTTT 1980 

TAGAATGGGA AGTATGGGAT CGGTTATCGG TGAAAAGATA TGTCGCATCA TTGATTACTG 2040 

20 CACTGAGAAC CGTTTACCAT TTATTCTTTT CTCTGCAAGT GGTGGTGCAC GTATGCAAGA 2100 

AGGTATTATT TCCTTGATGC AAATGGGTAA AACCAGTGTA TCTTTAAAAC GTCATTCTGA 2160 

CGCTGGACTA TTATATATAT CATATTTAAC ACATCCAACT ACTGGTGGTG TATCTGCAAG 2220 

25 

TTTTGCATCA GTTGGTGATA TAAATTTAAG TGAGCCAAAA G CGTTG AT AG GTTTTGCAGG 2280 

TCGTCGAGTT ATTGAACAGA CAATAAACGA AAAATTGCCA GATGATTTCC AAACTGCAGA 2340 

ATTTTTATTA GAG CATGG AC AATTGGATAA AGTTGTACAT CGTAATGATA TGCGTCAAAC 24 00 

30 

ATTGTCTGAA ATTCTAAAAA TCCATCAAGA GGTGACTAAA TAATGTTAGA TTTTGAAAAA 2460 

CCACTTTTTG AAATTCGAAA TAAAATTGAA TCTTTAAAAG AATCTCAAGA TAAAAATGAT 2520 

35 GTGGATTTAC AAGAAGAAAT TGACATGCTT GAAGCGTcAT TGGAACGAGA AACTAAAAAA 2580 

ATATATACAA ATCTAAAACC ATGGGATCGT GTGCAAATTG CGCGTTTGCA AGAAAGACCT 2640 

ACGACCCTAG ATT a TATTCC ATATATCTTT GATTCGTTTA TGGAACTACA TGGTGATCGT 2700 

40 AATTTTAGAG ATGATCCAGC AATGATTGGT GGTATTGGCT TTTTAAATGG TCGTGCTGTT 2760 

ACAGTTaTTG GACAACAACG TGGAAAAGAT ACAAAAGATA ATATTTATCG AAATTTTGGT 2820 

ATGGCGCATC CAGAAGGTTA TCGAAAAGCA TTACGTTTAA TGAAACAAGC TGAAAAATTC 2880 

45 

AATCGTCCTA TCTTTACATT TATAGATACA AAAGGTGCAT ATCCTGGTAA AGCTGCTGAA 2940 

GAACGTGGAC AAAGTGAATC TATCGCAACA AATTTGATTG AGATGGCTTC ATTAAAAGTA 3000 

CCAGTTATTG CGATTGTCAT TGGTGAAGGT GGCAGTGGAG GTGCTCTAGG TATTGGTATT 3060 

50 

GCCAATAAAG TATTGATGTT AGAGAATAGT ACTTACTCTG TTATATCTCC TGAAGGTGCA 3120 

GCGGCATTAT TATGGAAAGA CAGTAATTTG GCTAAAATTG CAGCTGAAAC AATGAAAATT 3180 

55 
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GGTGCACATA AAGATATTGA ACAGCAAGCT 
TTAGATTCAC TTGAGTCATT ATCACGTGAT 
5 AGAAATATCG GTTCTTATAT AGAATAATCA 

GGGTTTTACC ATAAATTGAA GTACATTAAA 
TAACTAAGAT TAGTAGTCAA AATTGTGGCT 

10 

AATTCAATAT TTAAAACGTT TACATCAATT 
CAATTTGTGG TAATTTAGAG TTAAAGATAA 

1S GAAAATTGCA GTTTTAACTA GTGGTGGAGA 

AGTTGTTCGT ACAGCAATTT ACAATGAAAT 
AGGATTGTTA AATGATGATA TTCATAAACT 

20 GCGTGGAGGT ACATTCTTGT ATTCAGCAAG 

TAAAGTTGCA ATCGAAAACT TACGTAAAAG 
TGACGGTAGT TATCGCGGTG CACAACGCAT 

25 CGGTATTCCT GOTACGATTG ACAATGATAT 

CACAGCATTA AATACGATTA TTGGCTTAGT 
CGCACGAACA TTTATCATTG AAGCAATGGG 

30 

TGGATTATCA GTTGGTGCTG AGACAATTGT 
AATAGCTGAT AAAATTGAAC AAGGTATTAA 
AGCAGAAGGT TGTATGACTG CGCAAGATTG 

35 

TGATAATAGA GTGTCTGTGT TAGGTCACGT 
TAGASTTTTA GCATCACGTT TAGGTGGATA 

40 AGCTAAGGGT GTTGGAATTA AGAACAATAA 

TGATGGTAAA GATCATAAAT TTGATTATAG 
ATAAGATTTC AGGAGGAATT ATAAAATGAG 

45 AGCTTCAGAA TCAGAAGAAA TGATTGAGAA 

ATTAAACTTT TCACATGGTA GTCATGAAGA 
AGTAGCTAAA AGATTAGACA AAATTGtAGC 

50 

TCGTACGCAT AATATGAAAG ACGGTATCAT 
TAGCATGAAT GAAGTTGAAG GAACACCTGA 

55 



TTAGCTATTA AATCAGCGTT TGTTGCACAG 3300 

GAAATTGCTA ATGATCGCTT TGAAAAATTC 3360 

ACTTGAGCAT TTTTATGTTA AATCGATACT 3420 

ACAATAATTT AATATTTAGA TACTGAATTT 3480 

ACTAATCTTT TTTTAATTAA GTTAAAATAA 3540 

CAATACATTA GTTTTGATGG AATGACATAT 3 600 

ATCAGTTATA GAAAGGTATG TCGTCATGAA 3 660 

TTCACCTGGA ATGAATGCTG CCGTAAGAGC 3720 

TGAAGTTTAT GGTGTGTATC ATGGTTACCA 3 780 

TGAATTAGGA TCAGTTGGGG ATACGATTCA 3 840 

ATGTCCAGAG TTTAAGGAGC AAGAAGTACG 3 900 

AGGGATTGAG GGCCTTGTAG TTATTGGTGG 3960 

CAGTGAGGAA TGTAAAGAAA TTCAAACTAT 4020 

CAATGGTACT GATTTTACAA TTGGATTTGA 4 080 

CGACAAAATT AGAGATACTO CGTCAAGTCA 4140 

CCGTGATTGT GGAGATCTAG CATTATGGGC 4200 

AGTTCCAGAA GTGAAAACAG ATATTAAAGA 4260 

ACGTGGTAAG AAACACTCAA TCGTTCTTGT 4320 

TCAAAAAGAA . TTATCACAAT ACATCAATGT 4380 

TCAACGTGGT GGTAGCCCAA CAGGTGCGGA 4440 

TGCGGTAGAC TTATTAATGC aAGGTGAAAC 4500 

AATTGTAGCA ACATCTTTTG ATGAAATTTT 4560 

TCTATATGAA CTTGCTAACA AGTTATCTAT 4620 

AAAAACTAAA ATTGTATGTA CAATTGGACC 4680 

ATTAATCAAT GCTGGTATGA ACGTTGCACG 4740 

GCATAAAGGT AGAATTGATA CAATTCGTAA 4 800 

AATTTTATTA GATACAAAAG GTCCAGAAAT 4860 

TGAACTTGAA CGTGGCAACG AAGTTATTGT 4920 

AAAGTTCTCA GTAACATATG aAAACTTAAT 4980 
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TAAAGATATT GACCATGCTA AAAAAGAAGT TAAATGTGAT ATTTTAAACT CTGGTGAGCT 5100 

TAAAAACAAA AAAGGTGTTA ACTTACCTGG CGTAAGAGTA AGTTTACCTG GTATTACAGA 5160 

5 

AAAAGATGCT GAAGATATCC GTTTCGGTAT TAAAGAAAAT GTTGACTTCA TTGCAGCAAG 5220 

TTTCGTACGT CGTCCTAGTG ATGTTTTAGA AATTCGTGAA ATTTTAGAAG AACAAAAAGC 5280 

TAACATTTCA GTATTCCCTA AAATTGAAAA CCAAGAAGGT ATTGATAATA TTGCGGAAAT 5340 

10 

TCTTGAAGTG TCTGATGGTT TAATGGTTGC ACGTGGTGAC ATGGGTGTTG AAATTCCACC 5400 

TGAAAAAGTA CCAATGGTTC AAAAAGATTT AATCAGACAA TGTAACAAAT TAGGTAAACC 5460 

1S AGTTATTACA GCTACACAAA TGTTAGATTC TATGCAACGT AACCCACQTG CTACACGTGC 5520 

AGAAGCTAGT GACGTTGCCA ACGCAATCTA TGATGGTACA GATGCAGTAA TGTTATCTGG 5580 

TOAAACTGCT GCTGG TTTAT ATCCTGAAGA AGCTGTTAAA ACAATGAGAA ATATTGCTGT 5640 

20 ATCAGCTGAA GCAGCCCAAG ATTACAAAAA GTTATTGTCA GATCGTACTA AATTAGTTGA 5700 

AACTTCATTA GTGAATGCTA TCGGTATTTC GGTTGCACAT ACAGCTTTAA ACTTAAATGT 5760 

TAAAGCAATT GTAGCTGCTA CTGAAAGTGG TTCAACGGCA CGTACTATCT CCAAATATCG 5820 

25 TCCACATTCA GACATTATTG CGGTGACTCC AAGTGAAGAA ACTGCACGTC AATGTTCAAT 5880 

TGTTTGGGGA GTTCAACCTG TAGTTAAAAA AGGACGTAAG AGTACAGATG CATTGTTAAA 5940 

CAATGCAGTT GCAACAGCTG TTGAAACTGG TAGAGTATCT AATGGTGATT TAATCATTAT 6 000 

30 

TACTGCTGGT GTACCAACTG GTGAAACTGG AACTACTAAT ATGATGAAAA TCCACCTAGT 6060 

TGGTGACGAA ATTGCTAATG GTCAAGGTAT TGGACGTGGA TCAGTTGTTG GTACTACGTT 6120 

AGTTGCTGAA ACTGTTAAAG ATTTAGAAGG TAAAGATTTA TCTGACAAAG TTATOGTTAC 6180 

35 

TAACTCAATC GATGAAACGT TTGTACCTTA TGTAGAAAAA GCTTTAGGCT TAATTACAGA 6240 

AGAAAATGGT ATTACATCAC CAAGTGCAAT TGTTGGTTTA GAAAAAGGTA TTCCAACAGT 6300 

40 TGTAGGTGTA GAAAAAGCTG TTAAAAACAT AAGCAATAAC ATGTTAGTTA CGATTGATGC 63 60 

TGCTCAAGGT AAAATCTTTG AAGGATATGC AAACGTACTA TAATTTATAA AAAAACGTCT 6420 

TTCCATTTAT CAACAATGGA AAGGCGTTTT TTGGTTcATC TGGTATTTTA . TGACGT AATT 6480 

45 AATAGGTTAT TTGATAATGA TAGTGTATGA ATGG CAATCT ATATAAATGT TTATATCTTT 654 0 

TATACATGTA CATTATCACC TTCAAACTTT CACTCATATT ACTTTGGAAA TTTATTATAA 6600 

AATAGAAGTA TGGATGTATT TCTGAAATGA TACATTATTA AATAGATGAG AAAGTAAAAG 6660 

60 

TTTTGAGCCA AGTACGCAAT TTAATATTAT AAGTTGCATA TAAAACAGGA TGGGACATAA 6720 

ATCCCTAAAA AAACAGCAGT AAGATAATTT TCAATTAGAA AATATCTTAC TGCTGTTCTC 6780 

ss 
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5 



IS 



20 



35 



tCTTCGACTG 


GCACTGCTCC 


CTCAGGAGTC 


TCGCCATTAA 


TACTAOGTAT 


TAACATGTAA 


6900 


TTTTACTTTT 


ACATACTTTA 


AAAAAATAAA 


ACACTTTGCC 


CAACTTGCAC ATAAATGTAA 


6960 


AATTCAATAA 


AATGAATTTT 


CTGTGTTGGG 


TCCCTTCGTA 


TAATTTAATA 


AATACCACTA 


7020 


AACTAAATTA 


ACGAGGTGCC 


TTATGTATAA 


AATTTATAAC 




x x nw%^> x /mAv^* 


7080 


AATAGAAACC 


TCTGTTAGAA 


TTCCTCAAAA 


TGATATTTCG 


Lunlnlul 1A 


nlUnnnl X VY X 


7140 


TGAAACGATA 


CCTGATAGCG 


AATTCGATGA 


ATTCAGACAT 


uA I LAj x\j\j\_\j 


(_AA(_A x A 


7200 


TCATCCAAAA 


ATGATGTTAA 


AAATCATCTT 


ATATGCATAT 


ACTCAATCTG 


TATTTTCTGG 


7260 


TCGTAGAATA 


GArAAATTAC 


TTCATGACAG 


TATTCGAATG 


ATGTGGx lAG 


CTCAAGATCA 


7320 


AACACCTTCT 


TATAAAACTA 


TTAATCGTTT 


TAGAGTGAAT 


CCTAATACTG 


ATGCGTTAAT 


7380 


TGAATCTTTA 


TTTATTCAGT 


TCCATAGTCA 


ATGTTTAAAG 


CAAAATCTTA 


TTGATAATAA 


7440 


TTCAATTTTT 


ATTGATGGTA 


CAAAAGTAGA 


AGCTAATGCC 


AATAGATATA 


CATTTGTGTG 


7500 


GAAGAAAAGT 


ATTCAAAATC 


ACGAATCGAA 


ATTGAACGAA 


AATTCAAAAA 


CATTATATCG 


7560 


TGACTTAGTT 


GAAGAAAAAA 


TAATACCAOA 


GATAAAAGAA 


GATGGAGATA 


GCGATTTAAC 


7620 


AATAGAAGAA 


ATAGATTTAA 


TTGGTAGTCA 


TTTAGATAAA 


GAAATCGAAG 


ATTTAAATCA 


7680 


TTCTATTGAG 


AACGAAOATT 


GTGCTCAAAT 


TAGAAAACAG 


ACCCGTAAAA AAATAACTGA 


774 0 


GATTAAGAAG 


TTCAAAAAGA 


AATTTGATGA 


TTATTCCGAA 


AGAAAAAATA 


AATATGAAGA 


7800 


ACAAAAATCG 


ATTCTTAAAG 


ATAGAAATAG 


TTTTTCTAAA 


ACTGATCTGA 


TCATGATGCA 


7860 


ACTTTTATGA 


GAATGAAGGA 


AGACCATATG 


AAAAATGGCC 


AACTTAAGCC 


AGGATACAAT 


7920 


TTACAAATAG 


CGACAAATTC 


TCAAAAATGT 


TTTATCCTAT 


GACCTATTTC 


AA 


7972 



(2) INFORMATION FOR SEQ ID NO: 313: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3175 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

ATTTTTTAGT TAATTGTCTT TCTTAAAATA ATTTTAGCTT TCATTAAATT AAACAATTTT 60 

ACAAGCTTGG AACACCAATC AAAATCCTAA GTTCTAAAAT G CAAT ATT AG TAGTCGTTGA 120 

SO 

CTGAATGAAC ATATGCTTAT AATATTTTTT TGCAATGCTA GTCAAGTTGA TTTATGCTCA 180 

CAAGGATATG CGATTTATAT TTTCTTACAA CAATGAAAAT GCCTGATACA ATGCGATCCT 240 



55 
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AATTAAATTA TTGTTGTTTT AACCATTGTA CTATCGTCGG AATCATAAAT CCTGTTGGCC 3 60 

CTTTTGGACC ATTATATGAA GCTTTATTAO TCGTTGCTGG ACCTGCAATA TCGAAATGAA 4 20 

5 TGTGAGGTGT TTGACCACTA AAATGTGTTA CAAAACTTGC CGCAAATAGC GCTTTACCTT 4 80 

GTCCATTCGT ATGGTTAACT AAATCAGCGA TATCACTQTG TTTAATACTT GCACGTTCGG S40 

TTGCAGTAAT CGGTAATTCA AAT AC CATTT CATCGACTTC AGAACTTATT TGTAATATAT 600 

10 

CGTTTAATAT CACTTTACTA TTCGATTCAA ATGCAGCAGC TTTATCAT CG CCTAGTGCAa 660 

CAATTGCTGC ACCCGTTAAT GTAGCAAAGT CCATAATCAC ACTAGGCTGA TATTGATTTG 720 

y5 CATAAAACAC AGCATCTGCA AGGACTAATC TACCTTCAGC GTCTGTATTC ATTACTTCTA 780 

CAGTTTCACC ACTTAATGCT GTAAATACAT CATCTGGcTT CATTGATGCT TCATTTATCA 84 0 

TATTTTCAGC ACACGCAAGC ACTCCGACAA TATTTACAGG CAGTTGTAAA CGAcTAGCCG 900 

20 CTTCAATGAT ACCAACGACA TTCGCAGCGC CACACATGTC AAACTTCATT GTAGCCATGC 960 

CATTCTTCGT TTTAATACTA TAACCACCAG AATCATACGT TATACCTTTA CCAACTAAGG 1020 

CAATTGGTGC TTCATCTTTG TCTTTGCCAT TATATGTGAT GGTTACTAAt CTCGGTTTAT 1080 

25 GCTTACTACC TTTACCTACT GCTTGTAAAA GTCCGAATCC TTCAGAAACT AAAGTGT CAT 1140 

AATCTTTAAC ATCTACTTTG ACCTTTGTAT TTTTAAAATG ATTAACAATA TCTTCTGcAA 1200 

ATGTTTGTGG TGtTAATACA TTCGGTGGcA TATTACTAAA GTCTCyTGcC AAATTAATGG 1260 

30 

aTTGGcCAAT ACTGATACCC TCATGAATAA AATCTAATTC AATTAATGAT TCACTAATAA 1320 

GGkTTAAATT CGTCTTAAAC GGtGCCTTCT kACTTgrTTT ATAATGATCG aATyCATATG 1380 

kTGcACGCTC ACTTTGGAAT ACCGCATGCC ATTAATACAT CTGgATAACT GgATCcATAT 144 0 

35 

TTTGgAnATA AATGaATCCa TAAGTAAATA CGTATCTTCa ATGTGTTCTG ACTTTATGTA 1500 

TTGGAAAAGA TGTCCCCaTA TTTTCAACAT ATCTTGATAA tACGTGTCTT TAAGTTTCCT 1560 

40 AAACGAACTG TAATTAAACG ATATGTTTGA TCTTGTACAT CAAATGCAGT TGTATAAATC 1620 

TTCCCAACTT TACTACCAAT AATATGTTGA TGTTTTAGTC TTTCAAGTGA TTCTGTAATA 1680 

TCGATATGAT TAAAACTAAT GCGCTCTAAC TGATTTAAAT GTTCTGGTAT ACCAATAATC 1740 

45 AATGTATTTA TTTCGTTGCT TAGTGTGTTA TTTAATTTAA AATTCATAAT GTACCTCCTT 1800 

AGATTTGATA TGTTACTCAC TTCAAATTGT ACAACAATAA AGCCCTcAGT GACACTGAAG 1860 

GACTTTATAA ATGAATAATT TAATTTTATG TGTTTAGCTG AAACTTTATT TAAAGCCTAG 1920 

60 AATTTACCTT TTTTGAATGC TAAACCGATA CCACCGATTT TGAATACCGC ACGTGTATCA 1980 

ATCACTTTTT TCATGAATGC TGCTTTTTTA CCAGCGATAG GTTTACCAAA TACCATACCT 2040 

55 
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5 



IS 



25 



(a 1 IviAi 1 CAC 


CGTTTAAGAT 


GCGTTTAATG 


TTTTTAGCAA 


CACTTTCACC 


TTGTTGCATT 


2160 


1 1 1 vj 1 v» 




TAATGGACGT 


TCTTCTCCAG 


CTGGGATAAA 


CGCTGAACAG 


2220 






GTCGTAACGA 


TTGATTGTTA 


AATCTTGCTT 


TGTAACGATA 


2280 




1AACGCCTTC 


AAATGATTCT 


TCCATTAATT 


TACT AC CACG 


TACACCAGCT 


2340 


GCC CATACLG 


AAGTACCTGC 


ATTTAATTGT 


TGTTTTTCAC 


CATCTACTTC 


AACTACAAAA 


2400 


C^T\ri~i~I"CGT 


TAGAAGCAAC 


GATTGGTGTA 


GCAATTTTAA 


ATTCAACACC 


GCGGTCTTCT 


2460 


AAGTAGCTAA 


CTGCGTGGTT 


AACTAATTCT 


TCTGAGAACA 


TTGGTAACAT 


TTTAGGTGCT 


2520 


GCTTCAACAC 


AAGTGATTTT 


AACTTTATTT 


TGATCCACAC 


CATATTTGCT 


ACATAATTCA 


25B0 


GGAATTCTGT 


CTGTTAATTC 


ACCTAAGAAT 


TCAACACCAG 


TGAATCCAGC 


ACCACCAACT 


2640 


AAGATAGATA 


AATCGTTATC 


ATCTTTTTCT 


TTTGATGCTG 


CATAGTTAGC 


AAATTTGTCT 


2700 


TCGATATGAC 


GTGATAATTC 


ACGTGCTGTG 


ATAACATTTT 


CAATTTGGAA 


AGCATGATCT 


2760 


TTCATACCTT 


CGATGCCGAA 


TGTTTCACTA 


ACGAAACCTA 


ATGCTACTAC 


TAAAATATCA 


2820 


AAGTCATAAA 


TACCTTGATT 


TGTTTCTACC 


TTTTTAGCAT 


CACGGTCAAT 


TTTTGTTACT 


2880 


TCTGCtTGAA 


CAAAGTTCAC 


TTTGTCTTTC 


TTCAAGACAC 


TTTCCACAGG 


ATATAATACA 


2940 


TCTTCATAGT 


TTAGTGTACC 


TGCTGATGCT 


TCATGTAACC 


ATGTTGCTTC 


ATAGTGATAT 


3000 


TCATTTTTAT 


TAATAAGCGT 


AATTTCTGCT 


TCTTCTGTTG 


ATATCGCTTT 


TTGCAATTTA 


3060 


GTTACAGTTT 


GTAAACCTGC 


ATAACCAGCA 


CCAAGTACAA 


gtactttttt 


ACGATCTTGA 


3120 


GCCATTTAAT 


TnCACCTAAG 


CnTTCATATT 


TTTTTAACCA 


AATGCTGATA 


ATTAC 


3175 



(2) INFORMATION FOR SEQ ID NO: 314; 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 
45 CATCATTATT AAAGATTTTC AATCAATACA gAATCACAAT ACGTACGCAT TGTGCACGAT 60 

AAAAATACAG ATGTGTATAT TAACTATGAA CTACAAGAGC AACTAACGAA CAAAGCTTAC 120 
ATTGGTGATC ATATTTATGT TGAAGGGATA TGGCTCGAAG TACAAGCTGA TGGTTTAAAT 180 

so 

. GTATTGAGTC AGAATACAGT GGCATCGTCA TTAATTCGCT TAACACAAGA GATGCCACAT 240 
GCACAGGCAG ATGATTACAA TACGTACCAT CGTTCGCCAA GGATTATTCA CCGTGAACCG 300 

55 
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TGGCGTTCCA TTATACCGCC ATTAGTAATG ATTGCTTTAA CTGTTGTCAT CTTTTTAGTG 420 

AOACCAATTG GTATTTATAT TTTAATGATG ATTGGTATGA GTACAGTAAC GATAGTATTT 480 

5 GGTATTACAA CGTATTTCTC TGAAAAGAAA AAGTATAACA AAGATGTTGA AAAACGAGAG 540 

AAAGATTACA AAGCTTATTT GGATAATAAA TCTAAAGAAA TTAATAAAGC GATTAAAGCA 600 

CAACGTTTTA GTTTGAATTA CCATTATCCA ACGGtTGCTG AAATTAAAGA TATCGTTGAA 660 

10 

ACGAAAGCAC CAAGAATATA TGAnAAAACC ATCGGCATCA TC 702 

<2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 4121 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 





TATGTTCCGA 


CAACGAAACA 


AAGTGTAATT 


ACAAGAGCAA 


AGATAACTTT 


GAATGTTTGT 


60 


25 


AAACGTCCAT 


CTTTACCTTC 


AGTTAAATGC 


ATGAACATTA 


ATAATTGArG 


TCCTGCTTGG 


120 




ACGAATGCAA 


AGCCAAAGAT 


AATTGTCAAC 


TTCGCGTGGA 


ATGTTAATGA 


CGTGTATAGT 


180 




GTTAdGTAAA 


CTGCTAAAAG 


CGTTAATACG 


ATAGATGCGA 


TAAATCCTAC 


AGTATGTTTC 


240 


30 


ATTATTGTAC 


TCATCCGCTA 


TACACCATCC 


CTATCATATA 


TACGGCAGTA 


AAGATGAAAA 


300 




CCCAAACAAC 


ATCTAAGAAG 


TGCCAGTATA 


AACTTACTAT 


AAATAATTTT 


GGCG CATTAT 


360 


35 


ATTTGTCTAA 


TCCGCGTCGT 


TGGATTTGGA 


TTAATAAACA 


AATGGCCCAA 


ACGATACCTA 


420 


GCGATACGTG 


ACAACCATGC 


GTTCCTAATA 


GGATAAAGAA 


ACTAGACCAG 


TAAGAACCAA 


480 




TTGTTGGGTT 


AACGCCTTCT 


GATGCATAGT 


GTGCGAATTC 


ATAAATTTCG 


AATCCAACAA 


540 


40 


AGACTAAACC 


TAAAAGTAAC 


GTAATGATCA 


TCCAAAACAT 


CATTAACTTT 


TGTTTTTCTT 


600 




GGCGCATGTA 


GTAAATAGCA 


AT AC CACATG 


TGTAAGAACT 


GAATAATAAT 


GCAAACGTCA 


660 




TTATTAAAAC 


AAGAGGCAAT 


TCAAATAACT 


CAGTAGTCAT 


TTTACCTGCA 


TAATCGCCAC 


720 


45 


CATGTTGCAA 


AGTTAATAGT 


GTCGCAAATA 


GGGTACCGAA 


TAACGCAAAT 


TCGGCTGTAA 


780 




TGAAAATCCA 


AAAGCCAAGC 


TTATTTAATT 


CGCCTTCATG 


TGTGCGTGAA 


TCAATAGTGT 


840 




TTGTATCATG 


AOTCATGACT 


TACAGCCTCC 


CTTTCTTTAA 


TACGCGCTTC 


TCTTAATCTA 


900 


50 


GCTTCAGTTT 


CTGCAACTTC 


AGCAGCAGGG 


ATATGATATC 


CGTGATCGAT 


TTGGAAACTG 


960 




CGATAAATCA 


TAGTACCAAA 


AATACCGAAT 


AAACAAATTA 


ATGCTGGAAT 


TACAGTTTCG 


1020 
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TTGTTTGGCA TATGAATGTC TTTGTAATTA TGGTTGTCTA AGTAATGACG ACCATGTTCT 1140 

TTCATATCAA CAAATGTGTC GTAGTCATTC CAATCTGGTG TAATGGCAAA GTTGTATTTA 1200 

5 

GGTGGAATTG CTGATGCAGT AGTCCACTCT AGAGTACGAC CAAGGCCATC CCAGTTATCT 1260 

CCAGTTGCTT CACGTGGAGA TTTGAAGTGA CTGTATACGA TACTAACAAC AAGGAATAAG 1320 

AATCCGATTG CCATTAATAA TGCACCGATA GTTGAAATTA AGTTTAATAA GAACCAACCA 1380 

10 

TCTGATGGCA TATAAGTGTA TAAACGACGT GGCATACCAT CTAATCCAAG AATGAATTGT 1440 

GGTAAGAAAC AAACGTTAAA TCCGATCATG AAGAACCAGA AGCACCATTT GTTTAATGTT 1500 

1S TCGTTTAATT TGTAACCCAT CATCTTTGGA TACCAGAAGA TTAAACCAGC TAAGCAGGCA 1560 

AATACAACAC CAGTAACCAA TGTATAGTGG AAGTGAGCTA CTAAGAAGTA CGTATTGTGA 1620 

TATTGATAGT CAGCTGATGC CATTGCTAAC ATTACACCCG TAACACCACC TAAAAGGAAG 1680 

20 TTAGGGATAA ATGCTAATGA GAAT AG CATT GGTGACTCAA ATGTAATACG TCCTTTATAT 1740 

AATGTTAATA ACCAGTTAAA CAATTTCACA CCAGTTGGAA TACCAATCAG CATTGTTGAA 1800 

ATTGAGAAGA ATGAGTTGAT TAACGCACCA TTACCCATTG TGAAGAAATG GTGAACCCAA 1860 

25 ACTAAGAAAC TAAGGAACGC GATACCGGCA GTTGCCCATA CCATACTTTG ATGTCCGAAT 1920 

AAACGCTTAC GAGOGAATGT CGGGATAATT TCTGAGTAAA TACCAAATGC TGGAAGGATA 198 0 

ACGATATAAA CTTCAGGGTG CCCCCATACC CAGAAGAAGT TAGCCCAAAG CATTGG CAT A 204 0 

30 

CCGCCATGTG CAACTGTGAA GAATGCTGTG TCAAATATTC TATCAGTTGT CATTAATGCT 2100 

AACGCTACTG TTAAAGGAGG GAAAGCAAGA ATAACAATTA ATGTAGTAAT AAATGTTGTT 2160 

ACTGTAAACA TTGGCATTTG CATAAACTTC ATAGTTGGTG TTTTACATCT TAAAATTGTT 2220 

35 

ACAAAGAAGT TGATACCTGT AGCTAAGGTA CCAAGCCCTG AAATTTGTAT AGCTATTAAG 2280 

TAATAGTTAA CACCCGGACC AGGACTGAAT TCACCTGCTA GTGGCGCATA GTTTGTCCAA 2340 

40 CCAGCTGCTG GTGAACCACC AATAATAAAT GACAGGTTGA ATAAAATCAT ACCTGCAAAG 2400 

AATAGCCAGA AACTTACGTT GTTTAATACT GGGAATGCAA CATCACGTGC TCCAATTTGT 2460 

AATGGAACAA CGATATTCCA TAAACCAAAG ATAAATGGCA TTGCCATGAA GATAATCATG 2520 

45 ATTACACCAT GTGTACTAAA AATTTCGTTA TAGTGGTTAG ATTCTAAAAA TTTGTTATCA 2580 

GGTACTGTTA ATTGCGCACG AATAAGTAAC GCATCAATAC CACCACGGAC GAACATTAAT 2640 

ACGGCACAGA TTAAATACAT AATACCGATT TTCTTATGGT CTACAGATGT GAACCATTCT 2700 

60 

TTGTAAAGAT ATTTCCATAA TTTAAAGTAA GTAATTACTG CGATTAAACC AATAACTAAG 2760 

AATGGGGCAC CAATTTGTGC CATTGTAATC ATcCAGTTAC CTTTAACTAG TAATTGATCC 2820 

55 
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TTGAAATTTT CTTCATTTCT TTCGCATTTT TCGATTCATC TTTCTTGAAC TCATTGTTAT 2940 

ATGGTTCGTC ATTTCCAAGA ATCATCAACT TCATACCATG TCGTTTATAG TTCGCATTTG 3000 

TAATTTGAGC TTTACGAGCA GGTATTAATG GTTTGTCTGA TACATCTTTA AACATATTTT 3060 

CTTCACTAGT GAAGTTTGGA TCTTTCAATT CGAAATTGAA ACGTTTATAT GCATAGAAGA 3120 

TGTATTCTGG ATCGGCTGCT GGATCAACAA ACGCCATATG TGTACCATTA AATTCTAAAG 3180 

CTTTATTAGG TGTGCTTGGT AATAATTGTT TATCAAATGT ATCTTGATCT AACGTTTTCT 3240 

TACCTTTAAC TTCTTTCACC CATTTGTCGT AGTCTTTTTG ACTAACGGCA TTTACTTTAA 3300 

ATGTTTGACG TGAGAATCCT TCACCATTGA AGTTAGAGTT ACGACCTCTG AACGTACCAG 3360 

TTTGAGATGC TTCTAAOGTC CAATTCATTG TCATGCCAGT CATGGCATAT TTTTGACCAC 3420 

CTAATTGTOG AATCCAGAAA CTTGTCATTG TATCCATAGC TTGAAGCTTA AATACAACAG 34 80 

GAOGATCTTT AGGGATTGTT AATGTATTAA CAGTCTCTAT ATGTTCATCT GGATAAGCAA 3 540 

AGAACCATTT GTATCCTGCA CTTACTGCAT ATACAACCAT TGGATCTTTC TCACTCTTCG 3600 

GTGGTTTTTC GTAATCGTAT AAAGTTTTAA CTGTAGGAAT AGcfcAAAGCA GCAACGATTA 3660 

TGATAGGTAT TACAAACCAT ATTGTTTCAA TGATGGCATT ATGGTGCATC TTACCAGATT 3720 

CGGCATTCTT ATTATAACTA TACTTGTAAA TAAAAATGGC GAACATGCCA AGTACAACGA 3780 

AACAAATAAC AAGCATGAAG ACGATTGAAT AAAGAATCAA GAACTTCTGA CTACTTGCTA 3 84 0 

30 CTGGCCCTTT TGCGTTGAAA ATTT CT AT AT TTGAACAACC ACTAAGTAAA ATTAGTGTGG 3 900 

CAAATAATAG AAGCAAAGAC TTAAATTTTG ACACTTTTTT GACCTCCTAA TACTACAAAT 3 960 

GTAGGGCTTA ACATTAATTT TAAGTTATTA CACAATATTT ACAAGGGCTT ATGGGAAAAA 4 020 

AATTAATAAA ATTGTAT CAA AAATGTTGAT AAATCAAGGT GTGACGTGGG TTCACACATT 4080 

TGTTAAAATT ATGTGTACAT TTTGTGACTA ATAGCGTTTT T 4121 
(2) INFORMATION FOR SEQ ID NO: 316: 



10 



15 



20 



25 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9310 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 
CGAGTGAGTA CAmACATATT TTTATTTGCA AGGGGTAAAT GGCATATAAC TATCTTTTTT 60 
ATGTAAGCTG GTATAAAATT TATACTAATA GGAGGGATAG TATGAATATA GTAGGGCATC 120 
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TTGGATTACG ATTAGTTGAA AAGTCGGTTA ATCAAGACAA TCCTTCAATG TATCATTTGT 240 

TTTATGGGGA CGAAgTAGGT ACAGCCGGAA CAATTTTAAG CTTTTTTGAA ATTCCCAATG 300 

CGGGTCATAA GCAGCCAGGT ACTGAAACGA TTTATCOATT TTCATTATTA GTACCAAATC 360 

AAGCGGCACT TCATTATTTT GAAAAACGTC TTGAGAATAA TGGTATTAAG TCTGAACGTT 420 

TGTACTATCT TGGACAAGAA GGTGTTGTCT TTAAAGATGA AGACGACTTA GAAATCATAT 4 80 

TGCTTGTTAA TGATAGTTTT GAAGTACCAC ATCAATGGCA ACATAACGCT TATAGTGAAA 540 

TACCTCAAGC ATATCAAATT TTAGGAATAG GGCCAGTCGA ATTAAGAGTT AGAAATGCAG 600 

CGOGTACGGT AGAATTTTTG GAAAATGTCT TAGGTTATCG CAAAAGAGAT AATAAATCAT 660 

TCGATGTGCT GACATTAGCA CCACAAGGTT TATATTCGGA TTTTGTAGTT ATTGAGCAAC 720 

AGGGACAACG TGAAAGACCT GGACGAGGTT ATATCCATCA TATTGCAGTT AATACACCAC 780 

AAATGAGTGA CTTAGATGCA ATTTACAAGA AATTACAACA ACAACCACAA AGTAATTCAG 840 

GTATAATTGA TCGCTATTTC TTTAAATCAT TATACTATCG CCATAATTCA ATTATGTATG 900 

AATTTGCGAC TGAAGCGCCT GGATTTACTA TTGATACACC TGTTGAACAA TTAGGAAGTC 960 

2s AATTGAACTT GCCTGACTTT TTAGAAGCAG AACGTGAACA AATTGAAAGT AAGTTACACG 1020 

AAATATAAAG GAGAATGTTT AATGGCCAAA TTAGAAATGA ATAAAAATAC GCCTCTTGAG 1080 

TTTGGTTTGT ATTCCTTAGG TGATCATTTA TTGAATCCAT TGAAAGGTGA . AAAAGTT AGT 1140 

30 TATGAGCAAC GTATTAATGA AATTATTGAA GCAAGTAAAT TAGCAGATGA AGCAGGTATT 1200 

GATGTTTTTG CAGTTGGTGA AAGTCATCAG GAGCATTTTA CAACACAGGC ACATACGGTT 1260 

GTGTTAGGTG CAATTGCCCA AGCGACAAAG CATATTAAAG TTTCAAGTTC TTCAACX5ATT 1320 

ATTAGTGCAC AGATCCTGTA AGAGTATTTG AAGACTTCGC GAGATTAGAT TTGATTTCTC 13 80 

ATGGTAGAGC CGAAATTGTA GCTGGCAGAG CATCAAGAAC AGGTATTTTT GACTTGTTTG 144 0 

GCTATGATTT AAAAGACTAT GATGAATTGT TTGAAGAAAA ATTAGGTTTA CTTTTAGAGT 1500 

TAAATAAAAC TGAGCGTATT ACTTGGTCTG GAAAATATCG TCCAGAACTT AGAAATATGA 1560 

AAATATTCCC AAGACCAATC GATAATATAT TGCCAATATG GCGTGCTGTT GGTGGTCCAC 1620 

CTGCAAGTGC TATTAAAGCG GGAAAACAAG GTGTGCCAAT GATGATTACA ACCCTTGGTG 1680 

GCCCAGCAAT GAACTTTAAA GGTTCTATAG ATGCTTATCG TCAAGCGGCA ACTGAAGCAG 1740 

GTTTCGATgc TTCGCCTAAG TCTTTACCAG TAAGTACAGC GAGTCTGTTT TATACAGCTG 1800 

SO AAACAACTCA GGATGCTATG AGAGAATTTT ATCCACATTT GAATACAGGG ATGTCATTTA 1860 

TTCGTGGTGT TGGTTATCCG AAACAGCAAT TTGCTAATTC GTCAGATTAT CGAGAAGCGC 1920 
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GTCATCAACG TTTTATOOCA CAGCTTOATT TTGGCGGTQT GCCATTTGAA AATGTTATGA 2040 

AGAATATTGA GTTAATTGGC AACGACATTA TACCGGCGAT TAAAAAGCAT TTATCAAAAT 2100 

AGGAGGGGCG TCATCATGAA TATTGTATTA TTGTCAGGTT CCACAGTAGG TTCTAAAACG 2160 

AGAATTGCTA TGGATGATTT AAAAAATGAA CTAGAAGTCA TCAATGAGOG ACATCAAATA 2220 

GAGTTGATGG ATTTACGAGA ACTTGAATTA GAATTTAGCG TTGGAAAGAA TTATCTAGAT 2280 

ACTACAGGAG ATGTATATAA ATTAACGACG TCGTTAATGC AGGCTGATGT GATTTTTATT 234 0 

GGTTTTCCAA TTTTTCAAGC TTCCATCCCT GGTGCTTTGA AAAATGTGTT TGATCTACTT 2400 

CCAGTCAATG CGTTTCGTGA CAAGGTAATA GGACTTGTAG CGACAGCAGG TTCTAGTAAA 2460 

CATTATTTAA TTCCTOAAAT GCATTTAAAA CCAATATTGA GTTACATGAA AGCACATACG 2520 

ATGCAAACGT ATGTATTTAT TGAAGAGAAA GATTTTTCAA ATCAACAAAT TGTCAATGAT 2580 

GATGTTGTAT TTCGGTTAAA AGCGTTGGCA CAATCCACAA TGCGAACTGC CAAAGTACAA 264 0 

CAACAAGTGT TTGAAOAAGA AAACAACCAA TACGACTTTT AAAGTATAAA AATAAGACGC 2700 

TOGGCACACT AAATTTGTAA GTGTTTGAGC GTCTTTTCAT ATTAACTATA TAQCCAATGA 2760 

2s ACGACGATAA AGGCAAGTGA TG ACAAG CAT ATTGAGGTAA TAATGATTGT CATAAGCGGT 2820 

TTAAGTGCGC GATTTTTAAG ATCTTTAAAT GCAACATTTA ACCCTAAAGC AACCATGGCC 2880 

ATTAATAAGC AAATTGTTGA TACAGTATTT AAAATATTTA GCAATG CTGA CGGAATAGTT 294 0 

30 ACATATGTAT TCACTAAGGC CATAATGACA AATCCAATTA AAAAGTATGG AATGCTTATT 3 000 

CGACCCTTGC TAGATGATTC TGATGAACGG AAACGCATAA TTAAAATAAG TACGATGGTT 3060 

AATGGAATCA GTAAGAATAC TCTACCAAGT TTACCAAGAA GTGCAATTTT AAGTGCATCA 3120 

CTACCACGAA AGCCACCAGC TAAGACAACG TGTGCAATTT CATGAAGACT AACACCAGAC 3180 

CAAGCGCCAT AAACATTTGT CGTCATTGAA AAGATAGCGT AGATAGCTGT ATATATAAGT 3240 

GAAAATATCG TACCAATCAA TGCGATGATA CCGATACTAA TAGCTGTATC CTTTTCACGT 3 300 

GATTTGAATA TTGGAGCGAC TGCGGCAATA GCAGCAGCAC CACAAACGCC TGTGCCGACA 3360 

CCTAGTAATA ATGCGATGTT TTTGTCACCA TGCAACAGTT TGTTGACAAA GAGCATCATT 3420 

ACAATACTGA AAATAACGAC ACCTACATCG ATGGCTAATA GTTTACTACC TTGACCGATA 3480 

ATATCGAATA TATTGAGTTT AAGTCCATAT AGGATGATTG CAAATCTTAA TAAATATTTA 3540 

GATGAAAACG TAATACCTGA GCTATATTGT TCAGGATATC CTCTAAAGTG ACGATATAGA 3600 

ATAG CG ATT A ATATCGCGAT AGTTAATGCG CCAACCTTAT CTAGGATTGG CAATTTAGCT 3660 

GCTAAAAAGC TAAATAATGC GACTATAAAT GTTAGTGATA GCCCAATCAT AAAATGCTTA 3720 
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ATTTTTAAAT ATAAATTTGG AATGAATAAT AAAGTAGTGA TTAAATTAAG TTQTGTOATA 3840 

GGAAACTTGG ACATCAATCA AAGTAATAGG CACTACAACG CTTATTGGCG GGGCCCCAAC 3900 

5 AAAGAAGCTG ACGAAAAGTC agCTTGcAAT AATGTGCAAG TTGGGGATGG GCCCCAACAT 3960 

AGAGAAATTG GGTCCGTAAT TTCTACAGAC AATGCAAGTT OGCGGQGCCC CAACATAGAG 4020 

AATTTCGAAA AGAAATTCTA CAAGCAATGC AAGTTGGGGA AGGACAACAA ATTTAAGATA 40B0 

10 CAATGCGTAA CATTAATATG TTATTATAAT GATAATTTAC AGAATTATAT GAAAAATGAA 4140 

TGAGGATGTG ATGGTATGTT TGGAATGAAA GTGAATGAAC AAATAACATT AAAAATTTTA 4200 

GAAGCTCATG ACACAGAAGC GCTTTTCAAT TTAGTCAATC GTTCAAGAAA TTCACTTAGG 4260 

15 

GAATGGTTAC CTTGGGTAGA TGCAACTGAG CAACCATCAG ATAOGOGTGC ATTTATTAAA 4320 

AGAGGACTTT TGCAATTTGC TGATGGTAAT GGATTTCAGT GTGGCATTTG GTATGAAGGA 4380 

ACGCTAGTTG GTGTCATCGG TTTACATGAA ATTAATCACA TGCACAGAAA AACTTCATTA 4440 

20 

GGGTACTATT TAGATAAAGA ATTTGAGGGT CATGGGATTA TGACACAAGC AGTTGAGGCA 4500 

TTGATAAAGT ATTGTTTCGA AGAGCTTGAC TTAAACCGAA TTGAGATTAG TGC CGCAGTT 4560 

25 AATAATGAAA AAAGCOGGGC TATTCCTGAA AGGCTGGGAT TTACTAGAGA AGGTATGTTA 462 0 

CGTGACAATG AATTACTAAA TGGTATTTAT TCATGGAGTT ACATCTATAG TTTATTAAAA 468 0 

TCAGAATACG ACCAAAAATG ACAAATTAGA CTTACAAAAG AGTGATGACA TTTAAAATGG 474 0 

30 CAGCGCTCTT TTATTTAATT TTTGAAAATA AAAGGTTGTT GACAGTATTA TTTTATAACA 4800 

ATATAATGAT TTTGATAATT ATTATCAACT AGATGATGTT TATGGGAGGA TGCTTTAAAA 4 86 0 

CAG CCGTTTT AAGTGTAATG TATTATTTTA GCGTGTAGGG AATGCGAAAA TAATATTTAT 4 920 

35 AAGAACACAT CTATGGGGAT AATAGAATTT CT AT AATGAG GTGTCAAAAT GAAAAAGTTA 4980 

ACAACGCTAT TATTAGCATC AACGTTATTA ATTGCTGCAT GTGGGAACGA CGATAGTAAG 5040 

AAGGATGATT CAAAGACATC GAAAAAAGAT GATGGTGTTA AAGCAGAATT AAAACAAGCA 5100 

40 

ACAAAAGCAT ATGATAAATA TACTGATGAA CAGTTAAATG AATTTTTAAA AGGTACAGAA 5160 

AAATTTGTTA AAGCGATTGA AAATAATGAT ATGGCCCAAG CAAAAGCGTT ATATCCAAAA 5220 

GTTCGTATGT ATTATGAACG CTCTGAACCA GTTGCAGAAG CATTTGGAGA TTTAGATCCT 5280 

45 

AAAATTGATG CACGTCTTGC AGATATGAAA GAAGAGAAAA AGGAAAAAGA ATGGTCAGGA 5340 

TATCATAAGA TTGAAAAAGC ATTATACGAA GATAAGAAAA TTGATGATGT GACTAAAAAA 5400 

SO GATGCACAAC AATTATTGAA AGATGCAAAA GAATTGCATG CCAAAGCTGA TACATTAGAT 5460 

ATCACACCAA AATTAATGTT ACAAGGTTCT GTTGACCTAT TAAATGAAGT TGCAACTTCT 5520 

SS 



1172 



GTTGAAGGCG CACAAAAAAT TTATGACTTA 
AAATTAAGTG ATGATATCCA AATGAACTTC 

5 AAAGATAACA ACGGCGGTTA TGAGTCATTT 

TTTGCGGATG CTGTTAATGC ATTAGGAGAG 
TGACAAATTA TGAACAAGTT AACGATAGTA 

10 TGTTAGGTAT TGGCGGTGCC GGTGTTGCAA 

CTTTCAAATC AATGTTCAAT ACACCAGAAG 
GTAAAGTGCA ACCAGGCATT ACCACACCCA 

15 

ATTTGAAGTC AAAAGATAGA GATGCAATTA 
CTGATCGTAT GATGGATGGT GATACAGTTG 
CAGTAGATAC OGGTGAATCG ATAGGATTAG 

20 

TTAGTAAGTC TTTGATGAAG AAAATTGGGT 
ATTTACCGCA TTTTCCGAAT GATCAGTTAA 

2S TTCAAGCATG CTCAAATGAT TCGCAAGTAT 

CATTTCGAGA TATTGTTAAG GTACGTTGGG 
AGGAAACACC TAGAAATTTA ATGG CATTT A 

30 ATCAACTTAA AGATTATGTG TTTATTGATG 

TTGTCAGACG TATTCAAATA CACATTGAAA 
AGGCTACATT TGGTCGGAAA CGACATAGTG 

55 ATGAAATTGA CTTAAAAGCG AAAGATAGTC 

ATACGAGGCT AGCGAAAGAA GCAAATACGT 
ATGGTACGGA TGACCGCACA GGTAACTTCG 

40 

AAGCGACAAA ACAATTT AT C GATATACAAA 
AATATATTAC ACATAGAGGT TCTGCTTCAT 
GATACCTTGG TGAAACATTA TTTGACTAAA 

45 

GTGTGTAGCT TTGGGTTACT GAAAAGTCAG 
TATAGTGTGA TAACGGATGC GAAATCTGCA 
60 AAGCAGAAAG CAATTGAGCA AGTGGTAAGT 

AGTGAAAGTA ATGCTGTCAA ATCAGATGTG 

55 
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TTTAAACCTA TTTTAGAGAA AAAAGATAAA 5640 

GATAAAGTGA ATCAATTATT GGATAAATAT 5700 

GAAAAAGTAT CGAAGAAAGA CCGTAAAGCA 5760 

CCACTAAGTA AAATGGCTGT GATTACTGAA 5820 

CGCAATTTTC AAGACGTACA TTTTTGAAAA 5880 

TTGGCGCAAG TGGTGTTGGT AGCATGTGGT 5940 

ATCCGGAAAA AGATGCGTAT GAATTTTATG 6000 

CGCAAAAAAC ATGCAATTTC GTTGCGTTAG 6060 

AGGCAATGTT TAAAAAGTGG ACGGTTATGG 6120 

GCAAGCCGAG TAACAATCCT TTAATGCCAC 6180 

GTGCAAGCAA GTTAACGATT ACCTTTGGGA 6240 

TATCTAGTAA AATTCCCGAT GCCTTTAAAG 6300 

TAGACGATTA CAGCGATGGT GATATTATGA 6360 

CCTTTCATGC GGTTCATAAT TTAGTTCGTC 6420 

CGCAATCTGG TTTTATCTCT GCTAAAGGTA 6480 

AAGATGGAAC AATTAATCCT AGGAAGAATA 6540 

ACGGATGGGC GAAACATGGA ACTTATTGCG 6600 

CGTGGGATCG TACTGGGGTG GAAGAACAAG 6660 

GTGCACCGTT AACAGGTGGG AAAGAGTTTG 6720 

ATGGCGAGTA TATTATTGAT AAAGATGCCC 6780 

CAATTTTACG TAGAGCCTTT AATTATGTGG 6840 

AAACAGGCTT ACTTTTTATT GCTTTTCAAA 6900 

ATAATTTAGG TAGTAATGAT AAATTAAATG 6960 

TTTTAGTATT ACCAGGTGTT AGTAAGGGAG 7020 

TTTGTAGCAA TGCTAATAAC TGCTGCTATG 7080 

GCAGCAGAAC AACAAAGTAT TAGTGATGTA 7140 

CTTTCTAATA ATTCGATATC GAATGACAAT 7200 

G CAGTTAAG A AATTATCGCT TGAAGATAAT 7260 

AGAAAGCTTG AAGATGCAAA AGCGAATGAT 7320 
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GCTAGTAAAG ATGCGGGTTC TAAAATTAAA CTATTGCAAC AGCAAGTCGA TGCTAAAGAT 7440 

GCTGCGATGA CAAAAGCGAT TAAAGATAAA AATAAAGCGG AATTAGAATC TTTGAACAAT 7500 

* AGTTTGAATC AGATTTGGAC AAGTAATGAA ACAGTGATTC GCAATTATGA CGCAAATCAA 7560 

TATGGACAAA TTGAAGTCGC ATTATTACAA CTTAGAATTG CAATTCATAA GTCACCATTA 7620 

GATACGGCAA AAGTGTCACA TGCTTGGACA ACTTTTAAAT CAAATATTGA TCATGTCGAT 7680 

10 AAAAAAAGTA ATACGTCTGC AAATGATCAA TACCATGTAT CACAATTAAA TGATGOGTTA 7740 

GAGAAGGCGA TTAAAGCTAT CGACGACAAT CAATTGTCGG ATGCTGcaTg TGCGCTTACA 7800 

CATTTTATAG AAACTTGGCC GTATGTTGAA GGTCAAATTC AAACTAAAGA CGGTGCTTTG 7860 

1S 

TATACGAAAA TTGAAGATAA AATACCATAT TATCAAAGTG TATTAGACGA ACATAATAAA 7920 

GCACATGTGA AAGATGGTTT AGTAGATTTA AATAACCAAA TTAAAGAGGT TGTTGGCCAT 7980 

AGTTATAGCT TCGTCGATGT GATGATTATC TTTTTACGTG AAGGGCTAGA AGTGTTGTTA 8040 

20 

ATTGTAATGA CATTGACTAC CATGACGCGT AATGTAAAAG ATAAGAAAGG GACTGCAAGT 8100 

GTGATTGGTG GTGCAATTGC CGGACTTGTA CTGAGTATTA TCTTAG CAAT TACGTTTGTA 8160 

2S GAAACTTTAG GGAATAGTGG CATTCTTCGT GAAAGTATGG AAGCGGGATT AGGTATCGTT 8220 

GCGGTCATAT TAATGTTTAT CGTTGGTGTT TGGATGCACA AACGTTCAAA TGCAAAACGT 8280 

TGGAATGACA TGATTAAAAA TATGTATGCT AATGCGATTA GTAATGGTAA TTTGGTATTG 8340 

30 . . TTAGCGACGA TTGGTTTAAT ATCTGTGTTG CGTGAAGGTG TCGAGGTTAT CATTTTCTAT 84 00 

ATGGGGATGA TAGGTGAGCT AGCGACCAAA GATTTTATTA TTGGTATTGC TTTAGCTATC 8460 

GTTATTTTAA TCATCTTTGC ATTATTATTT AGATTTATAG TTAAATTAAT ACCT ATTTTC 8520 

35 TATATATTTA GAGTGTTGTC GATCTTTATT TTTATTATGG GATTCAAAAT . GCTTGGCGTA .8580 

AG T ATTCAAA AGTTACAATT ATTAGGTGCG ATGCCAAGAC ATGTTATTGA AGGATTCCCA 8640 

ACGATTAACT GGTTGGGCTT TTATCCAAGT TATGAACCAT TGATAGCACA AGGTGCTTAT 8700 

40 

ATTATGGTAG TTGCTATCTT AATCTTTAAA TTTAAAAAAT AAAAAACAGG CCGAGTGCCT 8760 

GTTTTTTTTG TTGCTATATT GGAAATATTC GGTATTGCAG TATAACGATA ATCACAGCAT 8820 

TGATTCGTAT AAGGTTAATG TGTTGGCGGT TTGCCTCGGC ATGTGAACTT AACGATGAAC 8880 

45 

ATACTGAACT CAAAGAGCAA TATGAGTGGC AATGTGAGTA ATATATTTAA TGTTAAATCG 8940 

GGTGGTGCAA TGATACTXGC TAATACAAAG CAAGCGAAAT AAATATATTT ACGrTAATGT 9000 

50 TTCAATGATG TGGTATCTAT AAGACCGAAT TTTGCAAGAC CCATAAATAA TATTGGTAAT 9060 

TGAAATAGAA GACCAAATGT GAATAACCAA CGTATGAGTT CAATCAAATA TGCTTTAAAG 9120 

55 
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GGAAAGCCAA CATAAAATOC AAAAGCGACG CCAGCACAGA ATAATAACAC GCTOAAAAAA 9240 

CTATATTTAT AAATAAATTG ACGTTCATTA TTATOTAATC CAGGTGCAAT GAATGCCCAC 9300 

5 AATTGATAAA 9310 



(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3458 base pairs 
10 (B) TYPE t nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 
ATTAGTATTA CACACTAATG TAAATAGATT GGTTGGAGAA GAGATATTTG CTAATAAGTG 60 

' CCTTGCTAAT AATGATGTTC AAATTATGAA TTCAATAAAA AAATTAATTG AAGCTGAGTT 120 

20 

ATTAACAACG ACTAATGATT TTGAAGTTAG TATATATAAA AAGACAAGAC CTGAATTACA 180 

AAGTATTTTA AAAAGTTTTG GTATAAAAAC AACAGGTAAT AAACCTGACT TAATTAAACG 240 

25 TATTGACGAC AATTTTCATA TTATTAATAA CTTAGATTTA CCATATGTAT ATATACCAAT 300 

TAAGAAAGGA GAAGAAATTT TAAAGAAAAC AGAGTACTTA ACCTCTTTTA TACAAAGTTA 360 

TGGTGAAATT TCTCTTGAGC GTGCTTATTA TTTGGTTGAA AACTATATAG ATGAAAATTG 420 

30 CGATGATAAA GTTGCAGAAA TATACAAGTT TGAATTTCAA AGAAAATATG ACAATGGCGA 4 80 

GTTTGATTTT AATCATGGAT ATAATTTCGA ATTGAATATG TTGATAGATC ACTATAAAAG 540 

AGATGTAAAA GACTACGATA ATGCCAGAAA GTATTCAAAT ATTTATCTTT ACTTTGGTTT 600 

35 GAGAGATTTC TTAAAAAAAT TAATGAGTAA TTATTCATAT TACGATAgTA AAGGGAATAT 660 

AGATTTGAAC GAAATACAAA ACGATCTGAA TAGATTTATC AACTCTAGCG CTTCTGGTAT 720 

GTACGAGCGA TTAATATATA ATGAAAATTT ATCCAATAAT ATTATGTTTG AATTATTTAA 780 

40 

AAAGGACACA CAAGATTATA GTGATTTGGA AGAACAATTG ATTGAAAAGT TCATAAACTA 840 

TGTAGTGTCT AATGTAAAAA AAGAAAGTAG GAGTAATACT CTTATAGAGT TATCAAAAAT 900 

TTTAGAGAAC GGATATACAA TTGATAAAGA AGAATTTAAA AAAGAAGATG ATTATCTTTC 960 

45 

TAAGTACATA TTTACTGACA TAGATTATTT GAAAAAGTTA GAATCAAAAA TAAACGTTGC 1020 

TATTGATATT CGAAGTGGAG AAATTCATTT GGTATTAGAT GATGATAGCC TTGATATATT 1080 

so AATACAAAAT CAAAAATACG GCAATGAGTT TTGAGTCATA ACTAAATATA ATATGTTGAA 1140 

GAGAGGTTTT ATTTAAATGG CTAAAATTGG TTATGATCGT GTATCAACGA AAGATTAATA 1200 

S3 
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AACGTACAGA GCTTGTTAAG TGTTTAGATT 
ATCAACTTGA TCGOTTAGGT AGAACGACAA 
5 ATGATAACGG AATTGACTTA CAAATTAGTA 

GCAAAATGTT TTTTACGATG ATGAGTGCAT 
GAGTGTATAA AATAGACTTA GCAGCAACAA 

10 

CTTTACCAGA GAATAAAAAA TGAGAAATTA 
GGGAAGAAAT AGCTAGTTAG ACAAGGGTAT 
AAATGAAAAA ACTTATACTA TGAATTACTG 

1S 

GAGCAAGTTG GATAGATGGT GGCTAATCTC 
TTGTTACTCC TAGnAAAGGA CTAGCATGTC 

20 AATCATTGGT TTAGTATTGA TTAGTACTCA 

TGACCGGCAT TTTTGAAGGC TATTTTTAAA 
GCTCATTAGG GTAACATGTT TCCGAGTGTT 

25 TTCATCTTTT GCTTCTACTT GCTACAAATA 

GTTCTGAACA TAATCAGAAT TAATAAAACC 
AAGTATAGAA TGTAATCACG AATTATATAA 

30 AATTCACTTA GTTATTTTAG AAAGAAGCCC 

ACAGTTgGAT TACGCATGAA CTGCATaAGg 

AAAATTTATT TTGGTTGATT GTCTTCTGGT 

35 

TCATTTAATG AATCTTGTGC TTTTTCTGTT 

TTTTCTACTT TTGTATTTTT TGGTTCAGTG 

ACTTTATCTT TAGTAATTTT TTGACCACGT 

40 

ATGATTGCAT GAATAACAAC GAATAAAATA 
TTTTCTGCAA CACTTTGAGA GAGTGATTGC 

4$ GTTTTAAATA AAATAATAAT TAAGTTTAAT 

GTTTTTTGAC CATTTTTCAT TGCTTTAAAT 
TTAGTTGATT CaACGAAACT AGTCaTAAAG 

SO ATAATTGCTT TAATCAATAA AACAACAATG 

CCAATCAGGT GTGTACTATC TG CAT ATGAT 

55 



ATTTACGAGA GAGOGACACA TTAGTTGTCT 1320 

AACTATTaAT TGAATTaTCA CAATGATTCG 13 80 

ACATGAACAT TTCAACGAAA GACACAATGG 1440 

TTTTCaGGAT TAGAAGTTAA TTTACTATGT 1500 

GAGCGAGAGG CCGAAAAAGC GGGCGCCCCT 1560 

AATTTTTATA TGATGAACAA ACGATAACAG 1620 

GTCACTCAAC TGTTTATCGA GTTGATTAAG 1680 

TTTAAAAGTG TGCATGTTAT AATATTTATT 1740 

TTAATAAAGG GGTGATGCCT ATGGTTATAG 1800 

TGATTTTGAA ATGCTTATGG TTGTATTAAC 1860 

AGACCATAAA AAATAaCCTT CTATTOGCTT 1920 

TAAAATATAA GGTCACCGTC TTTTTAACGG 1980 

GCCCTTTTTG TGTTTCAAGA GTTAATGATT 2040 

TATTTTAACA CATTTTTCTT ATGAATTGTA 2100 

AACTTTCCAT ACAG CAG AAA ATACAATTAA 2160 

AAAGTATTGG AGACCTCTTC ATATATAGAT 2220 

CTAaCAACTA AAGTTGAAAA ATAGAGGAAC 2280 

CCCCTAAmAA CTAAAGTTGT AAGGGGCyCT 2340 

TTATCTGAAG TCATTGTTTT TGTTGTATTA 2400 

TTTGTTTCAA CAGATGTTGT CACTTTATCA 2460 

ACAACTTTTT TATTCTTGTC AGCTGTTTTT 2520 

TTTAAGTAAT ATTGAACGAT ACCCATTAAA 2580 

ATTGTCATTA CTGTATAAAC TCTTATGATA 2640 

GTATTAAATG AAATCAAGTA ACCAACAGGT 2700 

AATAGTATTC CAATGAAAAA TTTGAAAAAT 2760 

CCATTAGCTA AGTGTGTTTT TACTTTATCG 2820 

TTGAAAATAG GTATTAACAA TAACCAAGTA 2880 

ATTATGATGC TTTGAGTTGT AATAGCAATA 2940 

GATTGTACTG AATTCATAAT CATTTCTGAT 3000 
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ATAGATACAA GACCGATCAG AACGCTCTTT AAATAATTAC CTTTTTTAAA TGCAATAAAT 3X20 
AAATCGGTAA ATTTTACTTT TTCATGACTC ATTGCTCGTT TCATAACATT TOTAATTCCG 3180 
5 ATAAAAATTT GAACTAAAAC AAACAGTGAA ACTACAGCTG CTATAAGTAA TACAGCGATA 324 0 

ACTTTTAAAT ATGCATCTAC TGGTGGTTTT TGCCCAAATT GTGAATAAAT TGCAATAGTT 3300 
TGTGCATACT TAGCAAGCTG GAAATTAGCA AGTAAATATA CAACTGCAAT TACAGCAAAT 3360 

10 

GCTATAAGTG CATATATCAA TGTTATTAAT AATTGTGGTT TAGCATTTTT AAATGCTGAT 3420 
TTAAAGTAAG TAAACAAAGT GGTGCCTCCT TTTCTTCA 3458 
(2) INFORMATION FOR SEQ ID NO: 318: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
2o (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 



25 


GCTTGCAAGA 


TGCTTTCATT 


AAAGACAAAG 


TCATCGATAT 


TATGAATATG 


TTCCAAAATC 


60 




ACACTGATAT 


CACTTATACG 


TTGAATAAGT 


CTCATGCACA 


TATTATATGT 


ACGCCAGAaA 


120 




TATTTGCTAA 


ATTGTTACAT 


ACGATTGCAA 


CTAGAAATAT 


CGACATTCTA 


TCTGCCAATT 


180 


30 


ATAGATCGTC 


ATCTATGAGC 


AAAGCTCGTA 


TATCATAATA 


AAAGACACAT 


CTATACGATG 


240 




ATCATTTACA 


ATGATTGCGT 


ATAGTGATGT 




GTATTATTTA 


TCTTCGGAcG 


300 


35 


. ATTGATCCTC 


AACCATTTGT 


TCAAATGCTT 


CACGAACTTG 


TGGTACTGTC 


ATACCTACTA 


360 


C^^CTTGGAT 


ACTCTTACCA 


CTCTTTACTA 


ATCCGTGTGC 


CATTTGTTGA 


TGCGTGAAAT 


420 




ATTCCGTATC 


TGCAACTTTA 


CTTTCATCAT 


AAACAGTTAA 


GCGTAATCTT 


GTTGTACAGT 


480 


40 


TAGTAACATC 


TTTGATATTT 


TCTTTGCCGC 


CTAAACCGTC 


GAGGTAATAT 


GCAGCTTTGT 


540 




CTTCGTATTC 


ATTACCAGAT 


GAAGCACGTT 


TAGAATCTAC 


GCTATCACCT 


TTTTTATTTT 


600 




TGTAATCTTG 


CTTAGAAAAT 


AATTTAACTT 


CCTCTTCTGT TTyCTTACGT 


CCAGGTAATG 


660 


45 


GAATATCAAA 


TTyCAAAATT 


AAGAATCTGA 


ATAAG 






695 



(2) INFORMATION FOR SEQ ID NO: 319: 



U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 



5 



10 



15 



20 



2S 



AATATTGaTa 


TTATATATAG 


TCGAATCaAT 


GATGACCCaC 


ATCGACCtAA 


TGCAAATAAT 


60 


AAGACAATCA 


AGCAGTTAAA 


TGACTTGTAT 


TGCTCGCGGA 


ATTTAAGATA 


TAATGAAATC 


120 


GCATTCAAGA 


AATTTGATAG 


TCATTTGCTA 


TCAATTTCAG 


AAATATATTA 


TGAATTGCTA 


180 


AATTTAATAA 


AAGCGAGTGA 


TCAGTATTAG 


AGAGAATAGA 


GCGTTAAGAC 


TCTATCGCCG 


240 


AAG tGCAAGT 


AATTTATTAC 


GAAACTCTCA 


GGcAAAAGGr 


TAATACTGTA 


ACGCGTTCCT 


300 


GAATTGGTGA 


TTTATAAACA 


GGGTAGCGAT 


TGCTATCCTG 


TTTTTATAAT 


TTTAAGGGGG 


360 


TATTTCAATG 


TCAAGTGATT 


TAAAACAAAC 


ACCTTTATAT 


CAAAATTATG 


TTGATAGAGG 


420 


TGCAAAAATT 


GTGGAATTCG 


GAGGATGGGC 


GATGCCTGTT 


CAATTTTCAA 


GTATTAAAGA 


480 


GGAGCATAAT 


GCTGTTCGAT 


ACGAAATTGG 


CCTGTTTGAT 


GTTAGTCATA 


TGGGTGAAAT 


540 


TGAAGTAACA 


GGTAAAGATG 


CTAGTCAGTT 


TGTGCAATAT 


TTATTATCAA 


ATGATACTGA 


600 


TAATTTAACT 


ACTTCAAAAG 


CATTATATAC 


TGCTTTATGT 


AATGAAGAAG 


GCGGTATTAT 


660 


TGATGATTTA 


GTAATATATA 


AATTAGCTGA 


CGACAATTAT 


TTATTAGTTG 


TTAATGCTGC 


720 


TAATACTGAA 


AAAGATTTTA 


ATTGGATTTT 


AAAACACAAA 


GAGAAATTTG 


ATGTTGAAGT 


780 


ACAAAATGTA 


TCAAACCAAT 


ATGGTCAATT 


AGCAATACAA 


GGACCAAAAG 


CtAGAGATTT 


840 


AATTAATCAA 


TTAGTTGATG 


AAGAkGTAAC 


TGAAA 






875 



(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5897 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

40 



45 



SO 



TCTTTTTACG 


AAGGCTACCG 


TTTAGGATCT 


GATAACTTAA 


CTGTAGGAGA 


AATTGTATTT 


60 


AATACAGCGA 


TGACAGGTTA 


TCAAGAAACT 


ATTTCAGATC 


CATCATATAC 


AGGTCAGATC 


120 


ATTACTTTTA 


CGTATCCATT 


AATCGGTAAT 


TATGGTATCA 


ATAGAGACGA 


TTTTGAATCA 


180 


TTAGTACCTA 


CATTAAACGG 


TATTGTAGTG 


AAAGAAGCGA 


GTGCGCATCC 


AAGTAATTTT 


240 


AGACAGCAAA 


AGACACTTCA 


TGACGTTTTA 


GAATTGCATC 


AAATTCCAGG 


GATTGCAGGT 


300 


GTTGATACAA 


GAAGTATTAC 


GCGTAAAATT 


CGACAACACG 


GTGTGTTAAA 


AGCTGGTTTT 


360 


ACTGATCGAA 


AAGAAGATAT 


TGATCAACTT 


GTCAAACATT 


TACAACAAGT 


AGAATTACCT 


420 
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w 



IS 



AGTGTCGTAC TTGTAGACTT TGGTAAGAAG CAAAATATTG TTCOAGAATT AAACGTCAGA 54 0 

GGTTGTAACG TCACAGTTGT ACCATATACA ACTACTGCCG AAGAAATTTT AGCAATGGCT 600 

CCAGATGGCG TTATGCTATC AAACGGACCA GGTAATCCTG AAGTTCTAGA ATGTGOGATT 660 

CCAATGATTC AAGGAATTTT AGGGAAAATT COGTTCTTTG GTATCTGTCT AGGACATCAA 720 

CTTTTTGCAT TATCTCAAGG AGCAAGCTCA TTTAAAATGA AGTTTGGTCA TCGTGGTGCG 780 

AACCATCCAO TTAAAAATTT AGAOACTGOA AAAGTTGATA TTACGAGTCA AAACCATGGA 840 

TATGCAATAG ATATAGATTC GTTAAAAAGT ACTGATTTAG AAGTTACTCA TCTTGCATTA 900 

AATGATGGTA CTGTAGAAGG TTTAAAACAT AAAACATTAC CAGCATTTTC TGTTCAATAC 960 

CATCCTGAAG CAAATCCAGG ACCGTCAOAT TCAAACTATC TATTTGATGA TTTTGTAGCA 1020 

ATGATGACTA ATTTTAAGGA AAAGGAGCGT CATATCAATG CCTAAACGTA ATGATATCAA 1080 

20 AACAATTTTA GTAATAGGGT CTGGGCCAAT TATCATAGGT CAAGCAGCTG AATTTGATTA 1140 

TGCTGGAACA CAAGCATGTC TAGCTTTAAA AGAAGAGGGA TATCGAGTTA TTCTTGTAAA 1200 

TTCAAATCCA GCGACAATCA TGACTGATAA GGAAATTGCG GATAAAGTAT ATATCGAACC 1260 

25 GTTAACTCAT GATTTTATAG CG CGAATTAT ACGTAAAGAG CAACCTGACG CTTTACTTCC 1320 

AACTTTAGGT GGTCAAACAG GTTTAAACAT GGCGATTCAA CTACAOGAAA GTGGTGTGCT 1380 

TCAAGATAAT AACGTCCAAT TATTAGGAAC TGAGCTAACA TCAATTCAAC AAGCAGAAGA 1440 

30 

CCGTGAAATG TTTAGAACAT TAATGAATGA TTTAAACGTT CCTGTACCAG AGAGTGACAT 1500 

TGTAAATACA GTAGAGCAAG CCTTTAAATT CAAAGAGCAA GTGGGATACC CGCTAATTGT 1560 

TAGACCGGCA TTTACGATGG GTGGTACCGG AGgCGGTATT TGTCATAATG ATGAAGAATT 1620 

35 

ACATGAAATC GTCTCAAATG GTCTTCATTA TAGTCCAGCA ACGCAATGTT TATTAGAAAA 1680 

ATCTATCGCA GGTTTTAAAG AAATCGAATA CGAgTAATgC GTGaTAAAAA CGATAATGCC 1740 

ATCGTTGTAT GTAACATGGA AAATATTGAT CCAGTTGGTA TTCATACAGG CGATTCAATT 1800 

40 

GTTGTGGCTC CTAGTCAAAC ATTATCAGAT GTTGAGTATC AAATGTTACG TGATGTTTCA 1860 

TTAAAAGTTA TTCGAGCTTT AGGTATCGAA GGTGGTTGTA ATGTTCAATT AGCATTAGAT 1920 

45 CCCCATTCAT TCGATTATTA TATTATAGAA GTAAATCCGC GTGTATCACG TTCATCAGOG 1980 

TTAgCTTCAA AAGCAACAGG ATATCCTATT GCAAAATTAG CTGCTAAAAT CGCGGTTGGT 2040 

CTAACATTAG ATGAAATGTT AAATCCAATT ACAGGAACAT CTTATGCAGC GTTTGAACCA 2100 

60 ACTTTAGACT ATGTGATTTC AAAAATACCA AGATTTCCTT TTGATAAATT TGAAAAAGGA 2160 

GAACGAGAGC TTGGCACACA AATGAAAGCA ACAGGTGAAG TTATGGCCAT TGGTCGAACT 2220 



£5 
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TTACCAAATG GTGAAAGCTT CGATCTTGAT TATATTAAAG AACGTATTTC ACACCAAGAT 2340 

GATGAACGAT TATTTTTCAT CGGCGAACAA TTAGAAGAGG CACAACATTA GAAGAAATTC 2400 

5 

ATAATATGAC TCAGATTGAT TACTTCTTCT TACACAAGTT CCAAAACATT ATTGATATTG 2460 

AGCATCAATT AAAAGAGCAT CAAGGTGATT TAGAATATCT TAAATATGCA AAAGATTATG 2520 

GATTTAGTGA TAAAACAATA GCGCATCGCT TTAATATGAC GGAAGAAGAA GTATATCAAT 2580 

10 

TGCGTATGGA AAATGATATT AAACCTGTTT ACAAGATGGT TGATACTTGC GCAgCTGAAT 2640 

TTGAATCTTC AACACCATAT TATTATGGTA CATACGAAAC TGAAAATGAA TCCATAGTTA 2700 

1£ CTGACAAAGA AAAAATCTTA GTATTAGGCT CTGGACCAAT TCGAATCGGC CAAGGTGTAG 2760 

AATTTGACTA TGCGACAGTT CACGCCGTTT GGGCAATTCA AAAAGCAGGG TACGAAGCGA 2820 

TAATTGTGAA TAACAATCCA GAAACAGTTT CAACAGACTT CTCAATTTCT GACAAATTAT 28 80 

20 ACTTTGAACC TTTAACTGAA GAAGATGTGA TGAATATCAT TAATTTAGAA AAACCTAAAG 2940 

GTGTCGTTGT ACAATTTGGA GGACAAACAG CGATTAATTT AGCAGACAAA TTGGCTAAAC 3000 

ATGGTGTTAA AATACTTGGT ACTTCACTAG AAAATCTAAA TCGTGCTGAA GATAGAAAAG 3060 

25 AATTTGAAGC ACTATTAAGA AAAATTAACG TGCCACAGCC ACAAGGGAAA ACAGCTACAT 3120 

CACCTGAGGA AGCATTAGCG AATGCTGCAG AAATCGGATA TCCGGTTGTA GTAAGACCTT 3160 

CTTATGTATT AGGTGGTCGC GCAATGGAAA TTGTAGACAA TGACAAAGAG TTAGAAAACT 3240 

30 ATATGAC CCA GGCTGTAAAA GCGAGTCCGG AACATCCGGT ACTAGTCGAT AGATATTTAA 3300 

CTGGTAAAGA AATTGAAGTT GATGCGATTT GTGATGGAGA AACGGTCATT ATTCCAGGAA 3360 

TCATGGAACA TATTGAACGT GCTGGTGTGC ATAGTGGTGA GTCAATCGCT GTATATCCAC 3420 

3S 

CACAAACTTT GACAGAAGAC GAGTTAGCAA CACTTGAGGA CTATACTATA AAATTAGCTA 3480 

AAGGTTTAAA CATCATTGGC TTAATCAACA TTCAATTCGT TATAGCTCAC GATGQTGTGT 3540 

ATGTTTTAGA AGTAAATCCA CGTTCTAGTA GAACGGTACC ATTCTTAAGT AAAATTACTG 3600 

40 

ATATTCCAAT GGCACAATTA GCTATGCGAG CAATCATTGG GGAAAAACTA ACAGATATGG 3660 

GTTATCAAGA AGGGGTTCAA CCATATGCTG AGGGTGTCTT TGTGAAAGCA CCAGTATTTA 3720 

45 GTTTTAATAA ATTGAAAAAT GTTGATATTA CTTTAGGACC TGAAATGAAG TCAACAGGTG 3780 

AAGTGATGGG GAAAGATACT ACATTAGAAA AGGCGTTATT CAAAGGGTTA ACAGGTAGTG 3840 

GCGTTGAAGT TAAAGATCAC GGTACAGTAT TAATGACCGT CAGTGACAAA GATAAAGAGG 3900 

60 AAGTTGTTAA ATTGGCACAA CGCTTAAATG AAGTTGGCTA TAAAATTTTA GCAACGTCTG 3960 

GAAcAGCTAA TAAATTAGCT GAGTATGACA TACCTGCAGA AGTAGTAGGC AAAATTGGTG 4020 

55 
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TGACTAAAGG TAAAGAAOTA GAAAGGGATG 
ATGGTATTCC ATGTTTGACA TCTTTAGATA 

5 

GTATGACATT TACAATGCGT CAAATGTAAA 
TAACCATTTA CTTAAAGAAG TTTATATTAC 
TAAAGGGAGA CaTATATGAT GAAAGATTTA 

10 

GAAAAAGTAA ATCAATTTTT AGATTTATTT 
, GAACTTTTTT ATCAAGAAGG TCCTCAATTA 

15 GTATTTTTAG ATTTAAAACT GCATGATATT 

CTAGCTAAAT TGAATGTTGA TCTGGTAAAT 
TCTGAGGCCA TTAAAGGATT AAGAAAACAT 

20 CAGCTTACGT CAACAACAGA AGACATGTTA 

GAAGAGGCCG TTTTAAATTA TGCCAAGTTA 
TGTTCACCTC TTGAAAGTCG TATGTTGACT 

25 ACACCAGGTA TTAGACCTAA AGGTGCATCT 

GAAGAAGCAA GACAGCTTGG TTCGACGCAT 
GACAATCCAG TCGAAAGTTA TCATAAAATT 

30 

AAATTGCAAA ATCATTATTA GATATTGAAG 
CATGGAGTTC AGGTATTAAA TCACCGATTT 
CTTTAGTTCG AGGCGCAATC CGCGATGGTT 

35 

AAGTAGAAGT TATTTCTGGT ACTGCAACAG 
AAAAATTAAA ATTACCAATG AATTATGTTC 
ATCAAATCGA AGGTGCTAAA AGTGAAGGTA 

40 

CGACAGGGGG ATCTTCAGTC ACAGCAGTTG 
TAGGTGTTGT AGCTATCTTT ACTTACGGTT 

45 TTCAACTACC TTTTTACACT TTAAGTGATT 

AAGGTAAAAT TTCTAGTGAA GATATCCAAA 
AATATAGACA CTAGAAGGAG GAATTCAACA 

50 TGGGAAGATA AAACATTGTA ACGAATTTAT 

TAGTCACGAT GTCGACGAAA GTTCAACGAA 

55 



GCTTCCAAAT 


TAGACGTACT 


ACAGTTGAAA 


4140 


CAGCTAATGC 


CTTAACGAAT 


GTAATTGAAA 


4200 


TCAATCAAAC 


TGTATCGGTG 


GGGCTGTAAT 


4260 


AGCCTCATTA 


TTTTAATGAA 


TTTCTTAATA 


4320 


CCAATTATTG 


CATTAGATTT 


TGAATCAAAA 


4380 


GATGAATCAT 


TATTCGTAAA 


AGTAGGTATG 


4440 


ATTAATGAGA 


TAAAAGAAAG 


AGGCCATGAT 


4500 


CCTAATACAG 


TTGGTAAGGC 


GATGGAAGGA 


4560 


GTTCATGCTG 


CTGGTGGCGT 


AAAAATGATG 


4620 


AATCAAGATA 


CAAAAATTAT 


TGCAGTAACA 


4660 


CGACACGAAC 


AAAATATACA 


AACATCGATT 


4740 


GCAAATGCAG 


CTGGTTTAGA 


TGGCGTTGTT 


4600 


GAAAAGTTAG 


GTACATCATT 


TTTAAAAGTA 


4860 


CAAAATGACC 


AACACCGTAT 


TACGACACCG 


4920 


ATTGTAGTCG 


GTAGACCGAT 


TACACAAAGT 


4960 


AAAGAAAGTT 


GGTTAGTATA 


ATGGCTAAAG 


5040 


CTGTAACATT 


ATCACCAAAT 


GATTTATATA 


5100 


ACTGTGATAA 


C CGTGTTACG 


TTAGGTTATC 


5160 


TAATTAACTT 


AATTAAAGAA 


CACTTTCCTG 


5220 


CTGnTATTCC 


ACATGCAGCT 


TTTATTGCTG 


5280 


GTTCATCAAA 


TAAGAGTCAT 


GGTAAGCAAA 


5340 


AAAAAGTAGT 


TGTGATAGAA 


GATTTAATTT 


5400 


AAGCCTTAAA 


ACTAGCAGOT 


GCAGAAGTAT 


5460 




AGATGATACA TTTAGCAATA 




ACAATGAATT 


AATTGAAGTA GCTGAmAmTG 


5580 


CATTAGTTGA 


ATGGAGAGAC 


AACTTAGCAT 


5640 


AATGAATGAC 


AAAACATCTA 


ATGATTTATA 


5700 


CAATCATTCA 


AATGATTCCA 


ATCTATCTAG 


5760 


GCAAAAAGAT 


ATAAAAAATA 


AAACAACTAT 


5820 
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10 



15 



TTAAAAACAA AAAAGCT 5897 
(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

TCATTCTAAA TCAACTTATT TTCCATTGCA TAAATTGCTG CTTGTGTACG ATCGCTAACT €0 

TGTAATTTAC TAAATATATG ACTGACATGT GTTTTAATTG TTTTTTCAGA TACAAATAAA 120 

GTTTCTGCAA TCTCTTTATT TGTTTTACCT TTAACCATTT CACGTAACAC TTCAATTTCT 180 

20 CTCTTTGACA ACTTATTCGT GTAGTGTGGT TTTTGGCTAA CTGTTTCGAA TACATCTTGT 240 

GCCTTAGGAT GTATCATTTT TTCACCGTTC ATAACTCGTC TAATAGTTTC AATTAATTGC 300 

TGAGGCTCAA CGTCTTTCAT TTCATAACCA TCAGCACCTT TATTGATTGC TGAAATTACA 360 

2& TGTTCATCAT CAACATAACT TGTTAATACC AAAACTTTAA TATCCGGATA ATGTGCCTTA 420 

ATATATTCCG TAATTTCAAT ACCATTCATG CCAGGCATCA CTAAATCTAA TAGCACAATA 480 

TCAGGGTGCT CATGCTCTTT TAAATATTCT AAAAATGTTT CTCCATCTGC AAAGTCTTGT 540 

30 AAAACTTCTA TGTTTTCAaT CGTGGaTAAT AAAAATCGCA ATCCTTGtCG CACAATATAA 600 

kGGwCATCTA CTAATATnAC TTTGTTCATG GGTTATCTCC TTAAAtCAAG CTATTTTATA 660 

GGAATTGTGA ATTGTATTTG TGTACCCTTT GTTGGCTGAG AATGAAAGGT CACTTT AC CT 720 

3S 

CTTAATAATT TAACTCTTTG TTTTATGTTA TTAATACCGT GTGATGAAGC TATCTGAACA 780 

TTATOGATCT CAAATCCTTG ACCATAATCA ATCACGTCAA TATATAGTAT ATCGTTCATT 840 

TGTTTTAATG TAAGATCCAT TTTATTCGTA TCAGCATGTT TCTTAACATT ATTAATACAC 900 

40 

TCTTGTAATG CTCTGTATAT GTTTTCTTCG ATTTCATTAG ATAAATCGAT TAAACCTTCT 960 

ACATTTACAT TTAATTGTAT ATGCATTAAT TTACTATATG CTGTCAAAGC ATGAATTAAA 1020 

45 CCTTGCTCAA GTCCAACTGG CTTAAGTnGC CAAATCAATG CACGCATTTC ATTAACGGCA 1080 

TTTTGACTCG TTtCCTCAAT CGTCTTGAAT GCTTGTTTAG CGATGGATTC GTTTGACATG 1140 

CCATACGCAG CATGTGCtGT TAGTTTTACA GAAAATAACA TTTGATTTAC TGAATCATGT 1200 

50 AAATCTCTAG CTAAACGATT ACGTTCATTA ATTTTTGCCG CTTCTTTTTC ACGGTCTGTT 1260 

AAATAAATAC GTTTGATGGC TGACCCTAAT TGAAATGCGA CAGACTCTAG CAACTCTAAA 1320 
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TGACCCGATT TAAGTGGTAG CGTTGCATGA 
GCTTTAGAGG CTAAGTTAAT ACGAGAACAA 
6 CGTTGATTAA ATGCTTTCAC ACACCAACAA 

GCTGTCAAAG ATTGTGGTAG AGCCACATGT 
ATGAAAAATA TCCAGCCTGT CGTGAAATTA 

10 

TGGGTCATGC TATACATTTC rGTTTCTTCG 
GCTAGTCGCG TCCTTTGTTC CATCAAATCG 

^ TATACATTGA GTTATTATAT TTTTAAATCT 

TACTCAATAG TTATATTTCG GAGAACTGAT 
ATTTTATGAA ATTTAAAATA CCAGAAAACT 

20 AACAACTTAA GGTACCTAAA AAAGATTTAC 

TTAATGATAA ACCTGCGCGA TTAATGGATA 
CAACCATCGA TGAAAAAAGT AATTATGTTC 

25 AAGACGATGA TATGGCAATC GTAATGAAAC 

TAAAAGAAAG CAATACTTTA ATGAATCATG 
AACCAATTCA TCGACTGGAC CAGGAAACAG 

30 TAATGAAAAA AATTCTTGAT CGCATGTTAG 

CAAATGTTAA GGCACTTTTA CCTTTAAAAC 
ATAAATTCCA TTCGAATAAA CGACGTGTGT 

35 

TTTTAACTTC AAAAATGATA AAAGAAGCTG 
GACGTACTCA TCAAATwCGT GTGCATTTAG 
CTTTATATGG TGATTCAACG TTAAGACAAT 

40 

TGcATCCCTT GACTAAGGAA GTCATTTCCG 
GCATGATATT tAAACATGCA ATACCG CATT 

45 TAGATGGAAA ATTATTAATT TTTTCAGATG 

GATTTTATAT TTTGATATTT TAGTAATTAT 
GTAG TCTTTT TCAACTGCGT TACGTGTATC 

30 TATCACAGGT TGTTTGGCCA ATACATGTAC 

TAATAAAACG GACTCCATGA AAAGTTTTTC 

55 



TGTGTAATAT TGTCATTTTG GCTAGGAAAT 1440 

TTGACGATAT TCGACGCTTT CATTAGCCTA 1500 

GACCCATCTT TAATATAGTG ACAGTGGTCT 1560 

GATACAAGTT CATGTTCACC TACGCTATTG 1620 

CTGCCCTCAA TTAAATATTT TAACGCACCT 1680 

TTTAAAAATT CGGCAATCTC TTTTAATAAA 1740 

CTCCAATTCA TTTTTACGTG TATTAACTAT 1800 

TAGACGTAAA CATGATAAAA TGGCCTTGAT 1860 

TTGTGATATG ATATTAAAGA CTATAGGAGG 1920 

TTAATGACTT AAGTTTACGA GATATTTTCC 1980 

ATCATTTAAA TATGTCTAAA GATATTACTA 2040 

AAGTGCATAC TGGCGACGAT GTATTTGTTC 2100 

CAAGTTATCG TTATGCACAA ATTAAATACG 2160 

CTAAAGGTGT TAAGACTCAC CCTAATGATT 2220 

TGATTTACAC TATTGATAGT GACTATGTCG 2280 

TAGGATTATT AATTGTTGCT AAAAATCCTT 234 0 

AAGACAATGA TATTACGCGG ATATACAAAG 2400 

CACAAACGAT TGATATGCCA ATTGGTAAAG 2460 

CTCCTACTGG ACAGCGTGCA ATTACACACA 2520 

TGTGCCmACT TGAAATCAAG TTGGATACTG 2580 

CTGAAATTGG TCACCCTGTT ATTGGTGATC 2640 

TAGAACTTGA AAGTTACAAA ATAGAGTTTG 2700 

TTTCTTTGGA TGACTAATTT GATTAGTTTT 2760 

GTAACTAAAT CAAGTATCAA CtTAAAcGGA 2820 

TTCGGTTTTT TTGTTTTTTA CGATGCTTAG 2880 

TCATTTTATA ACATCCTTGG ATAATGACTT 2940 

TATGGACAAT ACATGACATC ATAAGATTTT 3000 

AACAATTCAT CATATAAAAA ATAGGTTCTA 3060 

CTTTTCATGG CTCTATATCA AATCAGACTT 3120 
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5 



10 



15 



30 



35 



40 



SO 



CCAAACTTTA 


TTTTATATTA 


ATATTTAATT 


AATGAGGATC 


TACCATATCT 


TCTGGTTTAA 


3240 


TCCATGCTTC 


AAATTGTTCT 


TCTGTAACAT 


ATCCAGTTTG 


AATTGCAGAT 


TCTTTTAAAG 


3300 


TTAAACCTTC 


TTTATGGGCT 


TTCTTAGCAA 


TTTGAGCTGC 


TTTTTCATAA 


CCAATATGTG 


3360 


GATTTAATGC 


AGTAACTAAC 


ATTAATGATT 


GATTTAAATA 


ATTATCAATA 


TTCTCTTCGA 


3420 


TTGGTTCAAT 


GCCCACTGCA 


CAATTGTTAT 


TAAATGTTTC 


CATACCATCA 


GCTAAAAGAT 


34B0 


AAATTGATTG 


TAGTGTATTA 


TGCATAATAA 


CTGGTTTATA 


AACATTCAAT 


TCAAAGTTAC 


3540 


CTTGTGAACT 


TGCGAACCAA 


CAACTGTATC 


ATTACCCATT 


ACTTGGACTG 


CnACCATTGT 


3600 


TAACATTTCA 


CATTGTGTAG 


GATTAACTTT 


ACCAGGCATA 


ATTGATGAAC 


CTGGTTCATT 


3660 


TTCAGGGATA 


GAAATTTCTG 


CCAAACCAGC 


TCGTGGCCCT 


GAAGCCAACC 


ATCTCACATC 


3720 


ATTAGCAATT 


TTCATTAAGT 


CTCCTGCTAA 


TGCCTTCAAT 


GTTCCATGCA 


ATTGAACAAC 


37B0 


TTCATCATGC 


GCTGTAAGTG 


CGTGGAATTT 


ATTTTCAGAA 


GATACAAATG 


GATAACCCGT 


3840 


ATTTTCTGAA 


ATATAATGTG 


CCACTTTATC 


ACCAAATTCA 


GGATGCGCAT 


TAATACCAGT 


3900 


ACCAACAGCC 


GTACCACCGA 


TGGCAAGATT 


TAAAATGTGC 


TTCTTAGATT 


CAGATAACAT 


3960 


TGTTTCACAA 


CGGTCAAGCA 


TATAACGCCA 


GCCACTAATC 


TCTTGTCCTA 


GTTTGATCGG 


4020 


CGTTGCATCT 


TGTAAATGTG 


TACGACCAAT 


TTTAATAATT 


GAATCAAATT 


TATCTTCTkT 


40BO 


TTCTTTCAAA 


GTATTTCTTA 


AAAGTTTTAA 


TGCAGGTTCT 


AATTTTUTTT 


CAACCTCTTG 


4140 


ATATAATGCA 


ACGTGCATAG 


CAGTTGGGaA 


TGTATCaTTC 


GaACTTTGAG 


ATTyATTTAC 


4200 


ATCATCATTk 


GGGTGGATAC 


TTTCATCACT 


TTGATGATCT 


TTT AAATACA 


TATTAGCAAC 


4260 


ATAACTTACT 


ACTTCGTTCA 


CATTCATATT 


ACTTTGTGTA 


CCGCTTCCTG 


TTTG CC AT AC 


4320 


AACTA6TGGG 


AAGTGTTCAT 


CTAATTCAC C 


TGATAAAATT 


TGATCACATG 


CGTATACAAT 


4380 


GG CATCTTTC 


TTTGCCTCGC 


TTAATTTTCC 


TAAATCAAAA 


TTAGCTATTG 


CTGCTGCACG 


4440 


CTTTAGTTGT 


GCAAAACCAT 


AAACTACTTC 


GATTGGCATA 


CGCTCTTTAC 


CAACTGGGAA 


4500 


ATTACGTTTA 


CTTCTTTCTG 


TTTGAGCACC 


CCAATATTTA 


TCTGCAGGTA 


CTTCTATTTC 


4560 


TCCAAAAGTA 


TCATGTTCAA 


TTCTTACTGA 


CATTCAATTT 


CTCCCCTTAT 


CACTGTTTAT 


4620 


TTAACTGTAG 


TATATCATTA 


AATAATTTAA 


TTGAGCAATT 


TATGATTAAA 


ACGTTTTCAT 


4680 


AATTTGAAAT 


AAAAATACAC 


TAATCGCACG 


TGTTCACCCT 


TTATTACAGT 


GATACGGTCA 


4740 


TACGATTAGT 


GTGTTATCTA 


TCATTATTTA 


GTTATTATTG 


AACTAAGTTT 


AATTACGATA 


4800 




GTAGCTTCAA 


CCGtAGCAAT 


AGCTGTAAGT 


ATATATAATA 


CAGCACTAAC 


4B60 


AATTGTCGTA 


TATGGATTTA 


GAGCAACAAG 


CGTaCCTAAA 


ACTCCTGTTA 


AACTCGCATA 


4920 
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TACAATACCT GATTGATTAC TTTTAATGAA TGTTTOCGCA TTAACATCAT CAATTAATCC 5040 

TTTTGATAAA TTGAGTTGTA ATTTTATTAC TTTGAAAATA ACAGGTAAAT ATAATGCCCC 5100 

5 

AATTGCCAAT GGAAAAGCTT TAATTGATAT TAAACTTATA ATAACTGTTG CTATCAATAA 5160 

TTGAATCCAG TATTTTCCTA ACATAAATAT ATAAATCTCC TCTAATTTCA TTCTTCAATA 5220 

GCATATCATA ATCTTGGCAT ATTAAGAAAC GCGGTTTAAT GATTTCATTA AAAATATTAC 5280 

10 

TGATAGATGA CTTCTTTCAA TTATGTCTGG AGTAATTAAT TATCAATTCC GTTTAAATGG . 5340 

TGTTTTAATA TTTAAAATTG AACTTTTGAT ATATTACTAT GTCTGGTACA CAAATCAATG 5400 

1S TTTTATGCTT TACAAAGTTA TATTGGCAGT AGTTGACTGC AGTCCACAAC ATAGAGGCTT 5460 

CGGAATGTCA GCTTCTATTT CATGCAAGTT GGTGGAGCTC CAACATAGTG GAATTGGATT 5520 

CCCAATTTCT ACAGACATTG CAAATTGGGG AAACGGGCCA CAAACTCAGA AACTGGTGGA 5580 

20 AAGTCAGCTT AAAATAACAT GCAAGTTGGC GAGGCCCCAA AATAGTGAGA TCGGATTTCT 5640 

AATTTCTACA GACATTGCAA ATTAGGGAAA CGGGCCACAA ACTCAGAAGT TGGTGGAAAG 5700 

TCAGCTTAAA ATAACATGCA AGTTGGCGGG GCCACAACAT AGAAAAATTG GATCCTCAAT 5760 

25 TTCTACAAAC AATGTAAGTT GGGGAAACAG CCCCAACACT GAAACTAGCA GAAAGTCAGC 5820 

TTCTATGAAT ATAATAAAAA AGCTAGGTAA CAAAATGCTA CCTAACTTCA TATTCAAGAT 5 8 80 

AATCAATCCT ATTTGATATA TGTTCTATAC TATACATTAT TTACATGATA AATAACTGAA 5940 

30 

TATTACACAA TTATAATACT TTACTGACTG TCTTCTTCAG AATTCTTTTC TTGATCATTT 6000 

TGATCAGAGA TTTGTTCCaT TTCTTTACCT AATTCTTTTA AATCTTCAAA ATCCGTTACC 6060 

ATACTGTTTT CTTCTTCATG ATAATTTAAT TTTGGATCTT TGTCTTTAGA CATAATCAAT 6120 

35 

ACCTCACAGA TTTTTAATTA AGCAAAGCGT GACGTAAAGT AAGCTTTAAC ATCTTCAGGT 61 BO 

AAACCTGCAG CCGCTTCTTT ATCAAGAATA ACATTTACCA TTCTATGTGC TTTTAAATCG 6240 

GCTGGTTOGA AGCTTGTTTT ACCATTTTCT TGATATAATT TTTCAACTAC ATCTCGTTTA 6300 

TTAGCACCTG TCACTACTAA GAAAATTTCT CTTGCTTCCA TTAGTCCTTG ACGAATACTA 6360 

ACATTTAACT TACCTTGCTC ATCGATAGAA ACAACTTGTA ATGTTAATTT CCCTTTATTT 6420 

45 TCTTTAGTTT TAATCTTATC AGCGATTAAT TCGATTGCAT CTTTTTCATA AGCAATTGGA 6480 

TAAACTTGAC CTGCTGGTAC ACCTAACGCT TCGAAATATG ATTTTTTATC GTCATAATCT 6540 

AAAATATTTA TTTGGCTAAA ATCAACAGCA TGTTTTTCAA CATTTTTCTT TAATTCATCT 6600 

60 AGAACTGGCG CTTGATCTGT ATCTAAATGA AAACCTGCAA TTGTAGTAGG ATTATTGTTA 6660 

AATTGCTTTC TAATAATATC AGCAGCATAT TCTGCTACAA GTTGACTATT GTCAAAGACT 6720 
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GATATTAATA 


ATTATTATAC 


CCTAACTTTC 


AATATATCAA ACCATTTAAC 


TTTAACATGC 


€840 


TTATACTCTA 


AATATAGCAC 


TTAAGCATCA 


TTTTTATAAT 


GAAAATGAGT 


AAATTTTAAT 


6900 


TCAATCCCGG 


rAAATCTTGT 


TGACGTAACG 


CTTCATAAAT 


TAACAACGCA 


GCAGTATTTG 


6960 


ATAAATTTAA 


TGAACGAATA 


TGTTCACTCA 


TAGGAATTCT 


TAACGCTGTG 


TCTTGATATT 


7020 


TCTCTTTCAC 


CCAGTCTGGT 


AATCCTGTCG 


TTTCTTTTCC 


AAAAATGAAG 


TAAAAATCTT 


7080 


TGTCATGATT 


TGAAAAATCA 


AAATCACTAT 


AAGTCTTTTT 


ACCAAATTTT 


GTTAATAAGT 


7140 


AATACTCGCC 


ATTTGTGACT 


TCAAAAAATG 


CTTCAATACT ATCATGATAC GTAATATTCA 


7200 


CAAATTCCCA 


ATAATCTAAA 


CCGGCTCTTT 


TTAACATTTT ATCATCAGTT 


CTAAATCCAA 


7260 


GAGGTTTAAT 


TAAATGTAAA 


TGTGTGTTTG TACCTGCACA CGTACGgCAA TGTTACCAGT 


7320 


ATTAGCTGGG 


ATTTCTGGTT 


GATATAAAAC 


GATATGATTT 


GTCATATTAC 


TATTCTCTCC 


7380 


TTGTGTCTAA 


TCCTTTTATC 


ATTTCATTCT 


GAACTTCTGC 


ATCCTCTTGA 


TCATAATTAG 


7440 


CATTGATAAA 


ATCTCTTGCT 


TCTTCCCCAA 


GAATTTGACC 


AATGGCCCAA 


TAAGCAGTTG 


7500 


CTCGAATCAA 


CGGTCTTTCA 


TCTGTTGTTG 


caact*itttt 


CAATTCTGGA 


ATTGCATCCA 


7560 


CTTCATTAAA 


ATGCGCCAAT 


G CT AAAAT AG 


CATTTCGTTG 


TATCGGCTTT 


TTACCACGCC 


7620 


AAGCACCTGC 


AAGgTGACCA 


TATGTTTGTT 


TGAATTCTTT 


ATTAGACATA 


CGTAGtAAAG 


7680 


GTACTAATCT 


TGGCTTTAAA 


ATTTCTGGTT 


CCAAAATGAT 


GTCATCTTGT 


TCGGTATTAA 


7740 


TACCTCTATT 


TTTCGGACAA 


ACTTGTTGAC 


ACGTATCGCA 


ACCATATAAT 


CTATTCCCAA 


7800 


TTTTATAACG 


ATATTGGTCA 


GGCATATAGC 


CTTTTGTTTG 


CGTTAAAAAA 


CTAATGCATT 


7860 


TCTGACTATT 


TAATTGGCCA 


TTTCCAACTA 


ATGCACTTGT 


TGGACAACGA 


TCAACACAAA 


7920 


TTGTACAATG 


CACCACAGCT 


ATCTAATAAT 


GGATCATCAG 


GTTCC 




7965 



(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 1302 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 
GCCCTGTTGG AGAAATCACC TTTATACGAT GGTTGAAAAA GCATCATTAG GTACAATCGT 60 
SO GGCAATTATA GTTATTTTGC TGTTTTTAAG AAACATTCGT ACGACGGCAA TTTCTATTAT 120 

ATCGATTCCG TTATCACTTC TTATGGCGCT TATTGCTCTG AAATTGAGTG ATGTTTCATT 180 

55 



1186 



EP0 786 519 A2 



5 



10 



IS 



20 



25 



TGTAGTTGTT 


GAAAATATTT 


ATCGACGCTT 


AACAGATTCA 


GAAGAACAAC 


TAAAAGGTGA 


300 


AAATTTAATT 


ATCAGTGCGA 


CAACTGAAGT 


ATTTAAACCA 


ATAATGTCAT 


CGACACTAGT 


^360 


TACTATTATC 


GTCTTCTTAC 


CACTTGTGTT 


TGTATCAGGT 


TCAGTAGGCG 


AAATGTTTAG 


420 


ACCTTTTGCA 


TTGGCTATTG 


CATTTAGTTT 


ATTAGCATCG 


TTATTAGTGT 


CAATTACACT 


480 


CGTTCCAGCG 


TTGGCAGCTA 


CACTATTTAA 


AAAAGGCGTT 


AAACGTCGTA 


ATAAACAACA 


S40 


TCAAGAAGGA 


TTAGGTGTTG 


TTAGTACAAC 


TTATAAAAAA 


GTATTACATT 


GGTCATTAAA 


600 


TCATAAGTGG 


ATTGTAATTA 


TATTAAGTAC 


ATTAATTTTG 


GTTGCAACTA 


TTGTATTTGG 


660 


AGGACCOAGA 


CTAGGCACTA 


GCTTTATTTC 


AGCAGGTGAC 


GATAAATTTT 


TAGCTATTAC 


720 


TTATAPApCt; 


AAGCCTGGTG 


AAACGGAGCA 


AGCAGTGTTG 


AATCATGCGA AAGATGTTGA 


780 






AGCATGTAAA 


AACAATTCAA 


TACTCAGTTG 


GCGGTAGTAG 


840 


TfVAfrPArsAT 


r'PAAfYWVTA 


G T A. CAAAT AG 


TATGGCAATC 


ATGGTTGAAT 


ATGATAATGA 


900 




^p^^n^ffi n ■ I *■ .i* ii • r 

X X lV3nl\91nU 


A ACS PQC5 AT AA 


fi^yPT A H w P AAA 


CATGCAGATG 


GCTTTAAACA 


960 


TCCTGGAGAG 


TGGAAAAATC 


AAGATTTAGG 


AACAGGTGCA 


GGTAATAAAT 


CTGTAGAGGT 


1020 


TACTCTAAAA 


GGTCCATCAA 


TGGATGCCAT 


AAAATCAACT 


GTAAAAGATA 


TTGAACAGAA 


1080 


AATGAAACAG 


GTTAAAGGAC 


TAG CCAATGT 


CAAATCTGAT 


TTATCG CAAA 


CATATGATCA 


1140 


GTATGAAATT 


AAAGTCGATC 


AAAATAAAGC 


GGCAGAAAAT 


GGTATTTCTG 


CAAGTCAACT 


1200 


TGCAATGCAC 


TTGAATGAAA 


ACTTACCAGA 


AAAAACAGTT 


ACGACTGTTA 


AAGAAAATGG 


1260 


TAAAACTGTT 


GATGTTAAAG 


TCAAACAAAA 


TAAGCAAACA 


GG 




1302 



(2) INFORMATION FOR SEQ ID NO: 323: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 
ATATATATTA ATTTAAACGT GTTTCACATG TACCAGTGTT AATGACAGAT AATGCTGCGT 6 0 

45 

TTAAACCACC TTCAACAAGG TTTTGTACTG CTTCATCAGA GAAGAAAGCA ATATGTGGTG 120 
TTACTAAAAT TCTTTCATGT TCGATTAACT CTAATAATGT TTTATCGTCA ATGTCTTTAT 180 
TAGTCCAGTC ATTTGTGAAG TATGCTGCTT CATTTTCATA AGTATCAATC GCAGCACCTA 24 0 

SO 

ACAAAGTACC ATCGTTCACT GCAGCGATTA AATCAGGTGT ATTGATGACT GCACCACGTG 3 00 
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TTTGTTCGCT 


GGAACATGTA AAGAAATAAT 


ATCGGCATCT 


TTAATAOCTT 


420 


CTTTAACACT 


ATCTTTATAA 


GTTAAAAAGT 


CTAAATCTTT 

^» *«win4 w a a a 


ATTAGGATAG 


GCGTCATAAG 


4S0 


CTGTAATTGT 


AG CACCAAAT 


CCTGCATATA 


A *- A A VJ A 


AGCAGCACCG 


ATACGACCCG 


$40 


TACCAATAAT 


TGCAACAGTC 


ATATTTTTAA 


CTGGTTTAGA. 

w ivv A A inun 


CATGATTTCT 


GCTTGCCAAG 


600 


TAAAATCATG 


TGCTTGTACA 


CGGOGTTCAA 


A^% A V. AVTVTVXnn 


GCGACGCACT 


AATTGTAGGG 


660 


CGATAQArAC 


AGAATACTCT 


GCAATtGTTT 


CAGGTGAATA 


ACTAGGAACG 


TTAGATATCA 


720 


CAATATTGTG 


TTTTTTAGCT 


AAATCTAAAT 


CATACATATC 


AAATCCAGCA 


GTACGTTGTG 


780 


CAAHTOm 1 


AATACCGTAA 


GATTCTAATT 


TAGGATAAAC 


GTCATTTTCT 


AACTTACCAA 


840 


ATTGCATTGT 


AGTTACGCCA 


TCGTAATCTT 


TTAATTGAtC 


GACTGTAGCA 


CTTGATAATA 


900 


GCTCTTTAGA 


AGTAGTTACT 


TCGACATTAT 


TCTtTtTCCC 


CCAATTTAAT 


GCCATCTCTT 


960 


TCTCATAATC 


ACGCGTaCCA 


AAGAACATAA 


TTCTCGTCAT 


TAT 




1003 



(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 
2S (A) LENGTH: 5030 base pairs 

<B) TYPE; nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 



35 



40 



45 



50 



GGCTTTTATA 


GTGTTTACGA 


TTATTATGAT 


TACTTATATT 


CACGCTGTTA 


AAAATTATCC 


60 


AAATAACCGT 


ACTGTTCATT 


ATGGTTATAC 


AGCTGCATTT ATACTTGTTA 


TATTACAAGT 


120 


TATCACAGGT 


GCATTATCTA 


TTATGACAAA 


TGTTAACCTG 


ATAATCGCAC 


TTTTCCATGC 


180 


ATTATTTATC 


ACTTATTTAT 


TTGGTATGAC 


AACATACTTT 


ATCATGCTAA 


TGTTACGATC 


240 


AGTAAGAAGT 


GACArGCAAT 


AACaAAAAAG 


cmAACCGTAA 


TTTTAATGGC 


ACGCCCATTA 


300 


AAATTACGGT 


nTTTTATATC 


AATATTTAAA 


AtTAAACCtA 


AGCCATGTAA 


AAACGAGATT 


360 


ACACGTCAAT 


TGTTGTGTAA 


TCTCGTTTTA 


TnTTAATCAT 


TTTAGTCAGT 


TGCTTTTTCA 


420 


ATTTCGATTA 


ATAAATCGCC 


TGTCGCTATT 


GTGTCACCAT 


TATTTACAGT 


TACTTGTTTA 


480 


ATCACACCGT 


CAAATGGTGC 


TTGAATTGTT 


GTTTCCATTT 


TCATAGCTTC 


AGTAATTAGC 


54 0 


AACGGCTGAT 


TAGCTTTCAC 


AGTTTCACCT 


ACACTAACCT 


TGACTTCAGT 


TACTGAACCT 


600 


GGCATTTGAG 


CACCGATATG 


ACTTGGATTA 


CTCTTATCTG 


CTTTTGGCTT 


AACGTTCGCA 


660 


TTTGTATGCA 


CATTTTCATC 


TTTAATGTAA 


ATACGTCTCG 


CTTGACCATT 


CATCGCATAG 


720 
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CGTTTACCTT TATCGATTTC GATTTCTACT GTTTCACCAT TACGCATTCC AAAGAAGAAT 840 

GTAGGCGTAT CAAGTAACGA TAAGTTTCCG TATTGATTTC TAGTTTGAAT ATATTGTTCA 900 

TATACTTTTG GATATAGTAC ATAACTAATA ATATCTTGCT CCGTAACAGG ACCTTGTTGC 960 

TCTTCTTCAA GCAACTCACG GACTTTTTCA AAATCAACTG GCTCTAGATA TTCACCTGGA 1020 

CGAGCTGTTA GTGCTTCTTG GCCTTTTAAA ATAACCGCTT GTAAATCTTT ATTAAAACCA 1080 

TTTACAGGTT GTCCTATTTC ACCTTTGAAG AACGACACTA CTGATTCTGG GAAATCTAAT 1140 

TTATAGCCAT CTGTAATCAC* GGATTGTTCA TCAAGATCAT TTTGTACCAT ATAAAGTGCC 1200 

ATATCACCAA CTACTTTAGA CGATGGTGTT ACTTTTACGA TATCACCAAA TAAGAAATTC 1260 

ACTCTGCGAT ACATATCTTT GACTTCATCA AATCTTTCGC CTAAACCTAA ACTTTTAGCT 1320 

TGTTGACTTA AATTCGAATA CTGTCCACCA GGCATTTCAT GTTGATAAAT TTCAGTATTC 1380 

20 GGTGATTTGA TATCACTTTc AAAGTCTGAA TAATAAGTAC GTACAGTTGA CCAATAATGA 1440 

CTAAGTGACT CCATACCTTC AATATCAGTT CTAAGGTGGC GTGGGAAGCC ATTTAATGCA 1500 

TAATATAACG AATTGGCGCT TGGCTGACTT GTTAAACCAC TCATTGAAGC AACAGCAGTA 1560 

25 

TCAATGATAT CGACACCAGC ATCTATTGCT TGTTTGTATG TTAATAAACC ATTACCACTT 1620 

GTATCATGAG TGTGAAGATG AATTGGTAAA TCTACAGCTG ATTTTAACTC ACCAATCAAT 1680 

TCGTAAG CGG CTTTAGGTTT TAATAAGCCT GCCATATCTT TAATCGCTAA AATATGGAAA 1740 

30 

CCTTCACGTT CTAACTCTTT AG CTAGTTTG ACATAATACT CTAAAGTATA AATGTTTGAT 1800 

CGCTCAGGAT TTAAAATGTC AC CTGTATAA CAAATAGTAC CTTCTGAGAT TTTGCCCGCT 1860 

35 TCTTGTACTG CTTCATTGGC AACTTTCATT TGATCTACCC AGTTTAATGA ATCGAAAATT 1920 

CTAAAGACAT CTATGCCTGC TTTAGCACTT TCTTGTACGA ATTTATGAAT AACATTATCA 1980 

GGATAGTTTT TATAACCAAC TGCGTTTGAA GCACGTAACA ACATTTGGAA TAATACATTT 2040 

40 

GGAATAGCTT TACGTAGACG TTCAAGTCGT TCCCATGGGT TTTCCTTCAA GAAATTATAT 2100 

GCCACATCAA ATGTAGCACC GCCCCACATT TCTAGTGAGA AAC CATCTTT AAATACGTCC 2160 

GCTGTTTTGG ATGCGATATT AATCATATCC TTAGTTCTAA CTCGTGTAGC TAATAATGAT 2220 

45 

TGGTGTGCAT CTCTAAAGGT TGTATCTGTT AGTAAGACAT CATCCTGCTT TTT AACC CAT 2280 

TCAGCTACAC CTTTTGGACC TACTTCATCA AGCAATTGTT TCGTACCACT AAATGAAGCG 2340 

60 ATTTTACTTG AAGATACAGT TGGAATTGAT GCTAATTCAT AGTCTGGTTT CGGACGTTTC 2400 

TCAACATTTG GGAAACCATT AATTGTTACA TTACCTATAT ATT CT AATGT TTTAGTACCT 2460 

CTATCTAGAG ACGGCTGAAT GTCGAAAAGT TCTGGTGTTT CTTCAATAAA TTTAGTTGTG 2520 

55 



1189 



10 



IS 



EP 0 786 519 A2 

TTAACACCAC GAATACGCAT TTCTCGTAAT GAGCGTACCA TTTTTTCTTC TGCTTGTTTA 2640 

AATGATATCG CGTGTGTAGA TAATTTTACG AGTAATGAAT CATAATAA6G TGATATCTCA 2700 

GCACCTTGGA AACCATCTCC AGCATCAAGA CGTACACCAA AGCCCCCGCT TGAACGATAA 2760 

GCAATGATTG TTCCAGTATC CGGCATGAAA TCATTTAACG GATCTTCTGT TGTAATACGA 2820 

CATTGGATGG CATAGCCTAA TGTTGTAATA TCTTTTTGTT GCGGCATATT AATCTCTTCA 2880 

CCAAATAAAT CGGCACCTGC TGCAACTAAA ATTTGTGTCT TAACAATATC AATTCCTGTT 2940 

ACCATCTCTG TAATTGTATG CTCTACTTGT ACACGAGGGT TAACTTCTAT AAAGAAGAAT 3000 

TCGTCACCAG ATACTAGAAA TTCAACAGTA CCTGCATTGA CATATTTAAT ATTTTC CATC 3060 

AATTGAATTG CAGCATCACA AATACGTTGA CGTAATGTTG ATGATAATCC AACTGATGGT 3120 

GCAACTTCTA CAACTTTTTG ATGACGACGT TGTACTGAAC AATCACGTTC AAATAAGTGT 3X80 

20 ACGATATTTC CATGTTCGTC ACCTATGACT TGTACTTCAA TATGCTTTGG ATTATCAATG 3240 

TATCTTTCTA TGTAAACTTC ACTATTACCA AATGATTTTT CAGCTTCTGA TTTTGCTCTA 3300 

TGGAAAGCAT CTTCTAATTC ACTTTCTTCA CGAACGATTC TCATACCTTT ACCGCCGCCA 3360 

25 

CCACTTGTGG CTTTAATCAT TAGCGGGAAA CCAGCTTCTT CTGCAAATTC TTTTGCTAAT 3420 

TCATATGATT TAATTGGACC GTCTGTACCA GGAATAACTG GTAAATCTGC CTTGATAGCC 3480 

GTTGTACGAG CTTTAACTTT ATCTCCAAAC ATATCTAAAT GTTCTAAATG AGGACCAATA 3 540 

30 

AATTTAATTC CTTCTTCTGC ACAACGACGC GCAAATTGTT CATTTTCACT TAAAAATCCA 3600 

TAGCCAGGAT GAATCGCATC CACATTCGCT TGTTTTGCTA CATCAATGAT ACGCTCAATA 3660 

35 TTTAAATAAC TTTCAGCAGG ACCTAAATCA CTTCCAACTA AATAGGATTC ATCTGCTTTA 3720 

TATCTATGTA ATGAACTTTT GTCTTCATTC GAATAAATTG CAACTGTGCT GATGTCTAAT 3780 

TCTGCCGCCG CTCTGAATAT ACGAATTGCA ATTTCTCCAC GGTTAGCAAC AAGTAACTTT 3B40 

40 TTTA T TTGTT TCAATAGCGA TACACTCCTC AAACTATTAG AATTTTCTAA CTAATTAGAT 3900 

AATAAAATTT TATCTTAAAG CGCTCTGTTT TGCTATAGTt TrtTGTTTCmAA TTTTCAAAaT 3960 

TTaACATyCT t GAGACAATT AAAaCCyCCG CTTCmGaAAT AATAATTTCA AAAATGACTA 4020 

TGCAACAACA GGTAGTTCCA CGTTTTTGTT GTGAAACATT TTCGATTTCT ACAACTCTAA 4080 

AAAATTAAAA ATAAAATTGC AAAACATCAA CATTTATTAT CAATAGCGAT AACTTTATCT 4140 

TATCATCATG ATTCTAATTT CGCCACCACA TTTAGTAATT TTTAGTCATA AAATTTAGTT 4200 

ATAATTATAC GTTGTTTTGT TTATAAAATT TGATAATCak GAGTAATCtC GTAATATCAA 4260 

AACaAAAAGG AAGTTAAGCG TTGTTTGGTT GcCTAACTTC CGTTATTGAA CTCATCcAGT 4320 
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TCTCGTACTA AATATTGGCT AOTATTTTTT TAATTAAATT GTCTTCTTAT ATCAACTTTT 4440 

TGTTGTTGTT TCTTTCGTTG CTGGTCTACT TTGATTTGTT TACCTACAAT CAGAAGTAAA 4500 

CCCATAGCAA TACTTAAACT AATCATTGAT GATCCACCAA AOCTGATAAA TGGCAATGGC 4 560 

ACACCAGTTA ATGGAATTGT TGCCGAAATA CCGCCAATGT TTACAAACGT TTGACTTCCA 4620 

AAGTATGTGG CAATCCCAAC ACACACAAGT TTATAAAAAT ATGATGATGT TTTATTTGCA 4680 

AACTGGAAGG CACGATATAC AATAAAGAAC TCTAAAGTAA TAACTAGCAA TCCTCCGATT 4740 

AAACCTAATT CTTCGCAAAT AATTGCAAAA ATAAAATCTG TATGTGGTTC TGGTAAATAG 4 800 

CCCAATTTCA TTGCACTATT TCCTAATCCT TTTCCAAATA CGCCACCGTT ACCTATCGCA 4 860 

AGCAATGAAT TGGAAATATG GTATCCAGTT CCTGATTCGA ATTGGAATGG ATCTGTTAGC 4920 

GTACTAAATC TGGCAGTTAA ATAACTTGGT AACCAACCAG CCATTAATGC AATGACAAAT 4 980 

20 ACTACTAAGA ATCCTAGCAC TGCTGGTATA CCAAATCTTA GGACTTTGTT 5030 
(2) INFORMATION FOR SEQ ID NO: 325; 



10 



1$ 



26 



30 



45 



60 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

CTTGTTAATC CGAAAATTAT TAGTCAATCA AATGAAACAA TAACAGACTT nGAAGGTTCA 60 

35 ATTACATTGC CAGATGTTTA CGGCGAAGTG ACAAGAAGTA AAATGATAGT TGTCGAAAGT 120 

TATGACGTCA ATGGGAACAA AGTTGaACTA ACTGCACATG aAGATGTAGC AAGAATGATT 180 

TTGCATATTA TAGATCAAAT GAACGGTaTC CCTTTTACAG AACGTGCGGA CCGTATTTTA 24 0 

40 ACAGATAAAG AAGTGGAGGC ATATTTTATA AATGACTAAA ATAATATTTA TGGGTACACC 300 

AGACTTTTCA ACAACTGTTT TAGAAATGCT TATTGCAGAA CATGATGTCA TTGCAGTCGT 360 

AACGCAACCA GATCGACCTG TTGGACGTAA ACGTGTTATG ACACCACCAC CAGTTAAAAA 420 

AGTTGCAATG AAATATGATT TACCTGTATA TCAACCTGAA AAATTAAGTG GATCAGAAGA 480 

ATTAGAACAA TTGCTTCAAT TAGATGTAGA TTTAATTGTA ACTGCTGCTT TTGGACAATT 540 

ATTACCTGAA TCATTGTTGG CATTACCAAA TCTTGGGGCA ATTAATGTAC ATGCATCATT 600 

GTTACCGAAG TATAGAGGTG GTGCACCAAT TCATCAGGCA ATTATCGATG GTGAACAAGA 660 

AACCGGCATA ACAATTATGT ATATGGTTAA AAAATTAGAT GCGGGTAATA TTATTTCGCA 720 
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ATTAGGGGCA 


GATTTATTAA 


AAGAAACTTT 


ACCATCTATT 


ATAGAGGGCA 


CAAATGAAAG 


840 




CGTACCTCAA 


GATGATACGC AAGCAACATT 


TGCTTCCAAT 


ATTCGACGCG 


AAGATGAGCG 


900 


s 


AATTAGCTGG 


AATAAACCAG 


GAAGACAAGT 


GTTTAATCAA 


ATTCGTGGAT 


TATCACCATG 


960 




GCCAGTTGCT 


TATACAACTA 


TGGATGACAC 


TAACTTGAAA 


ATATACGATG 


CTGAACTCGT 


1020 


10 


TGAGACTAAT 


AAGATAAACG 


AGCCTGGAAC 


CATTATAGAA 


ACGACTAAAA 


AAGCCATTAT 


1080 


TGTTGCTACA 


AATGATAATG 


AAGCTGTTGC 


AATTAAAGAT 


ATGCAATTAG 


CTGGGAAAAA 


1140 




GAGAATGTTA 


GCTGCCAATT 


ATTTAAGTGG 


TGCGCAAAAC 


ACACTAGTAG 


GGAAGAAACT 


1200 


IS 


TATATGATAG 


AAAACGTGAG 


AAGTCTTGCT 


TTTGACACGA 


TTCAAGATAT 


ATTAAATGAA 


1260 




GGTGCGTATA 


GTAACTTGCG 


TATCAATGAA 


GTGTTGTCAG 


AAAATGAATT 


AAATGCAATG 


1320 




GATAAGGCTT 


TATTTACAGA 


AATTGTCTAC 


GGAACCGTTA 


AAAGAAAATA 


TACGTTAGAT 


1380 


20 


TTTTATTTA 
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(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 6 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 





TTTGCTAATA 


ACAATAAAGC 


CAAAGCCGAT 


TCACACTCTA 


AACAGCTAGA 


AATTAATGTT 


60 


35 


AAGAGTGACA 


AAGTACCTCA 


AAAAGTAAAA 


GATCTAGCAC 


AACAACAATT 


TGCTGGTTAT 


120 




GCTAAAGCAT 


TAGATAAACA 


AAGTAATGCA 


AAAACTGGTA 


AATATGAACT 


TGGCGAaCTT 


180 




TTaAAATTTA 


TAAATTTAAT 


GGTGAAGAAG 


ATAATAGTTA 


CTATTATCCA 


GTTATAAAAG 


240 


40 


ACGGTAAAAT 


TGTTTATACT 


TTAACACTTA 


GTCCTAAAAA 


TAAAGATGAT 


TTAAACAAAT 


300 




CAAAAGAAGA 


CATGAATTAC 


AGTGTTAAAA 


TTTCAAACTT 


CATCGCTAAA 


GATTTAGACC 


360 


45 


AAATTAAAGA 


TAAAnATTCA 


AATATCACTG 


TTCTTACTGA 


TGrAAAAGGG 


kTTTATTTTG 


420 


aAGAAGATGG 


CtnAAGTTAGA 


TTAGTAAAAG 


CTACGCCTCT 


ACCTGGTAAT 


GTAAAAGAnA 


480 




AAGAAAGTGC 


TAAAACAGTT 


TCAGCAAAAT 


TGAaACAAGA 


GTTaAAAAAT 


ACAGTAACAC 


540 


SO 


CTACTAAAGT 


TGAAGAAAAC 


GAaGCGAT r C 


AAGAAGATCA 


AGTTCAATAT 


GAAAATACAT 


600 




TAAAAAACTT 


CAAAATTwGA 


GaACAACAAT 


TCGATAACTC 


ATGGTGTGCA 


GGATTCAGTA 


660 




TGGCAGCATT 


ATTAAATGCa 


ACTAAAAATa 


CAGACACTTA 


T AATG CACAT 


GATATTATGC 


720 
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AAATGATTGA ATACGGTAAA TCACAAGGCA 
CATATGAACA AGTTGATCAA CTTACAAAAG 
5 GTGTATCTCA AAACCCTAAT GACCCACATT 

CTAAAATTAA TGACCAAGAA AAACTTATTT 
TCCAAGATGC AGATTCAAGC CTATTACATT 

10 

GTTCAATGAT AGGTTACkAA AAAGTAATAT 
TATCAACTAC AATTTATAAA TTTAGTTTAC 

1S ACCAATATCA ATTTATTCAT TGGTAATTGG 

ATACGTCATG GCGATGATAC AAGTCACAAT 
CATCAACGCA TTAAATTTTC ATCTATTGAT 

20 GATGATACAC AGCATATCCT CATGCAAACG 

CCAATCGTTT ATGAG CGCTT AGTATAACTA 

TGGCGAGAAT TCAATTTCTC ACCAGCTCGT 

25 

TATTCTTTCT CTATTAATTT TTCTCAAACT 

CCCTTTTAAA TTCTTATAAA ATAAAAAAGC 

GACTTAATAT CATATTCAAA ATAACTTATG 

30 

CACGTTTTTC TTTAAACGCA TCACGGCCTT 
ATGTTGCATC CCCAGCCATT TGTTGTAAAC 

35 CTGCTTTAAG GAATCGTAAC GCTGTTGGTG 

CAGTTTCATC TTCAACTTTC TCTAAAGGTA 
CTTCTTGTGC ATTGTATTGA CGACATAAGT 

40 CGATACGTGC TAAATATCCT GAACCATAAC 

TTTGTCCAAA AATAGCATTA TCAGCAGCAA 
TACCGCCACC TACAGCATAA CCTTTTACCA 

45 

TTAAACGCTG TAAATCTAAT ACATTTAAGC 
CATGTCCACG TTTCTTCTGG TCACCACCAG 
so ATACGATAAC TGAAACGTTT TGATCATCAC 

CAACTGTTTT AGGTGTAAAC GCATTGCGTA 
TCCCTTCGTA AAATTCATAT TTGATTTCAT 

55 



0 786 519 A2 

GAGATATTCA TTATCAAGAA GGCGTACCAT 840 

ATAATGTAGG AATTATGATC CTTGCACmAA 900 

TAGGACATGC GCTAGCAGTT GTTGGTAATG 960 

ACTGGAATCC TTGGGATACA GAaTTATCAA 1020 

TATCATTCAA TCGTGATTAT AACTGGTATG 1080 

AGATATTGAT TAAAGGCAGG TAAAACTATG 1140 

GACACAACCA AACTCACACA TCTAGAACAA 1200 

AGTAATCATC AATTACAAAA ATCAATTTGT 1260 

CAATATCATA TTCTTTTTAT AGATACGGCA 1320 

AATGAAGAAA TCATTTATAT TCTTGATTAT 1380 

TCATCCAAAC AAGGTATTGG CACTTCGCGA 1440 

ATTTAAATGA TTTCACTTCA TAAAGCGGGT 1500 

TTTTTCATTG TAATAATAAT CTTTAACATT 1560 

ATCTTATCTT TATGATAATT AATTAAAATG 1620 

CACCTATCGT CGCTAATAAA CGACGCAAGT 1680 

GGAATTTAGG GAATTGATCG AAGTCAGGAT 174 0 

CTTTCGCTTC ATCAGTTGTG TAATAAAGCA 1800 

CAGCTAAACC ATCTGTGTCA G CATT CAT AG 1860 

AGTGTTTCAT AATCTCTTTA CACCATTGCA 1920 

CCACTGTATT TACTAGACCC ATATCTAAAG 19 80 

ACCAAATTTC ACGTGCTTTC TTATGTCCAA 2040 

CCGCATCAAA TGAACCTACT TTAGGACCAG 2100 

TCGTTAAGTC ACAAACAACA TTTAGTACAT 2160 

TCGCGATAAC CGGTTTTGGA ATAATACGAA 2220 

GAGGGATTTG GTCTTCACCT ACATAACCAC 2280 

AACAGAATGC TAAATCACCT TCACCAGTTA 2340 

GTGCACGTGA AAATGCGTCA ATCATTTCAG 24 00 

CTTCAGGGCG ATTTATTGTT AC CTTAGCAA 24 60 

CAT ATTCT CT AAGTGTTTCC CATTGTCTGT 2520 
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TCGACATGAA TTGTATGACC TGTAGCAGAA ATTAATTTAC ATTTACTATT AGGAATTAAA 2640 

TTTGCCATTT TTTTCGCAAT CTGTACAAAT TTTTCATCAT ATTCTCCAGC TAATATTAAT 2700 

5 GTTGGTACTT TAATTTCTnT CAGCGCGGCC ATAAGTTTGG CATTTG 2746 

(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

is 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 
TAATGTTTAG TTTATTAACA GTAAGTTCGT ATATCAATGT TTAGTGCTCC CCAAAATTGA 60 
20 AGTTTGAATT TT AAAAG CAT CTTGTAGAAT TTAGTTGTAT TTTTTTCAAA GAAATTCATT 120 

TTGATTATTT TTGATAATGA GCATTTTAAT AGTAATACAT GTTTATAGTG TGTAGTATAT 180 
GTCTATACTA GTAGTAACTA TATAGAGAAA GTAGGAATAA ACTATGTCAC AAGATGTAAA 240 

25 

TGAATTAAGT AAGCAACCAA CGCCAGATAA AGCAGAAGAT AACGCATTTT TCCCATCACC 3 00 

ATATTCCCTT AGTCAATATA CAG CACCTAA AACAGATTTT GATGGTGTTG AACACAAAGG 360 
TGCCTATAAA GATGGTAAAT GGAAAGTATT GATGATTGCT GCTGAAGAGC GATATGTATT 420 

30 

ATTGGAAAAT GGAAAAATGT TCTCTACGGG TAATCATCCT GTTGAAATGT TATTAC CTTT 480 
ACATCATTTA ATGGAAGCAG GTTTTGACGT TGATGTTGCG ACATTATCTG GTTATCCAGT 540 
35 TAAATTAGAA TTATGGGCTA TGCCAACTGA AGACGAGGCA GTTATAAGTA CTTATAATAA 600 

ATTGAAAGAA AAATTAAAAC AGCCAAAAAA ATTAGCAGAT GTGATTAAAA ATGAATTAGG 660 
ACCTGATTCA GACTATTTAT CTGTCTTTAT CCCAGGCGGA CATGCTGCAG TTGTTGGTAT 720 

AO ... 

TTCTGAAAGT GAGGACGTTC AACAAACATT AGATTGGGCA TTAGACAATG ACCGCTTTAT 780 
AGTTACATTA TGTCATGGAC CAGCAGCACT ACTTTCAGCA GGGCTTAACA GAGAAAAATC 840 
TCCATTAGAA GGATACTCTG TTTGTGTCTt CCCTGaCTCA TTAGATGAAG GTGCAAATAT 900 

45 

(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 64 2 base pairs 
so (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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ATCTGAnTnG AGATAGTGAT AATGTGTCAC 
CaTCaGCTAA CaCtTCaTTa TCtTTAGtCG 
5 AAAATGaTGa CtTTTCAACa CCGAACaCTA 
CAATTTCCCC TGTTTGCTTC AAAGTAGCTT 
TGTTAAAAAC AGATTGTTGC ACAGTTTGAT 

10 

CTTGCCACTT AT CAATGCCT TCTTTATTCA 
ATAAAAATAG CAAGTAACTA ATCATCGTTA 

1S TGTTGCGCCG TATTTCATTC CAAGCTAAAA 

TAAACATTGT TTATATTCTT AGATGCACGT 
TACACTTATC CTTCATTATA CCCGAACTTT 

20 AATTAATTGT CTATTATTTT TGTACGGTAC 

CTACATATAC CTTTAAGTAC ATGTAGACGT 
TTTAGGATCG AATTGCTTAA TAG CATTTTT 

25 

TTTATAGTTA TTGTTGTAAA TCGTAG CTTC 
GATATATTTG TTACCGAAAT CGTCTTTATG 
TGTAGCTTCT TTTGGATCTG TAAAGTCTAA 

30 

ATGATCCTTT TGAACCATTT GTCTGCGTGA 
TAAAATATGA CTCTTCTTAG CTTGAGGGAT 

35 CTTTAAATCT TTTTCTTTAT CTCTATCAGA 

AGCATATTTC AAGAATTCTT TTGCGCGATA 
TTCATGCCAA CCTTCTGGGA AAGGTTTATT 

40 ATTATCTAAA ACTGTTTCAT CTAAATCCAA 

CTTTTTAATC TCTTTATCTA ACTGTGTCTT 
TTTTGCTTCA GCTGAATTTT GATACGAAGC 

45 

CGCCTTTTGT GATGCTGGTA TAGAAGCTTG 
GGCAAACGCT GTAGAATTTG TCGTTTGTGG 
SO TGCTATTGCA ATATACTTTG AAATTTTATT 

ATACAGTTAA AATTATAAAT GCATTTATTT 
TATTATTTCT TGgAAAAGCT AACAAATATG 
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CcAATTTTaA AcCtTTATTT TTaAGcGtTT 60 

CTTTATOCCC tTCtATTAAA CtCGGaACtA 120 

AAACaTTGtC tTTTTGATGG CCaTTAGACA 180 

GCTTCTTGTA TTTATTTTCA ATATCTTTCT 24 0 

TGGCATCTTT ATTTAGAACA ATGGCATCTG 300 

TATTGATAAG ACCATTCGCC AATCCAGATA 36 0 

ACACACCAAT AATTAGTCCA AACTTCAATT 420 

ACATGCATTT CTCTCCCTAC TACTATGATT 4 80 

ACGTCGTGTT GOGCTCTGTA ATGTTATACA 54 0 

TTATATTAAA AGCAAATTTA TGGAAAATGC 600 

ATTTAAAATT AAGGATCAAT TTAAAAACGC 660 

CCAATTCATA TATTATTTAA CTTCGCCTGT 720 

ACGTAATTTA TCTTTTGCTT TrTCACTTGC 780 

CCAACTACCA TACATTGGGT TAGGGAAAAT 840 

TTTTTCAATT AATGCTTCAC G AG ATTCAG C 900 

TAAATTATCT CCAAATAGCA TGACAAGTTT 96 0 

TTCTTTACTC TT AT CAT CTT TACCTTTTAG 1020 

ACCTTGTTGT TTTAAGTTCT TTTGTGTTGC 10BO 

AATATAGTAG ATATCGACAC CTTTTTTGTC 1140 

GACAGGTTTA GCTTTAGCAG c TTGTAC CCA 12 00 

ATGTATTGAT GCATAGCCTT GATATGGAGA 1260 

AGCAATAGCT AACTTATGTT TACCTTTATT 1320 

TGCACTGTTA TAACCTTGTA AATATAATGC 1380 

CACTGCCATA ATATTTTGAT TACCAAGATT 144 0 

TTGCGTTTGT TGAACTTCAG CAGAACTTTT 1500 

TGCTGAAACT GTAACCGCTA CCGATAATGA 1560 

CAT CTT ATCA CCTCATGATT AATATTTAAA 1620 

AATATTGCTA TACTATGAAA AGATATTTAA 1680 

TGAACATTTC ATATAAGCAT GATTTAATGG 174 0 
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GTGACTAGGC CTTCCTATCA GACATATTCA 
GCTATCTCTT ATTTACCTAT TGGAACAACC 
5 ATAGCAGTCC TCACTCATAC AATTTCTCAT 

CTTACTTTCA CCTATTTTAA TTCATAAAAA 
GCAAGACGTT ATTATTGCAA TCTTTTTTAT 

10 

AAGATAAAGT AAGAACGTTT TTATTTTTCA 
TTAGGAAGGA AATTACATTG AAAAAACAAG 

1S TATTTTTTGG AGCCGGAAAT TTAATATTCC 

ATATGTGGAT TGGTATGCTA GGCTTTGCCC 
TTATTGTTGT TGCATTTTAT GATGAAGGTG 

20 GGTTGGGGTT TATTTTTGCT GTCGTGATTT 

CACGTGCTGC AAATGTCGCG TACGAAATTG 
AATGGACTTT AATTATATTC GCAGCAATCT 

25 

ATCCATCGAA AATCGTTGAT AATTTAGGTA 
TCGCTCTATT AAGTATTGCT GTCATTTTCA 
ATAAATAT AT AACACATCCT TTCATTTCAG 

30 

TTGTTGCTGC GTTAGCTTTT TCCGTAGTCA 
CAGATCGCAT GAAAATTTTA AAATATGTCT 
35 TTGGAATGAT TTACTTTGCA CTTGCATACG 

AAGATGGTAC AGATATATTG ACGTACAACT 
TCGTATTTGG AATGACGGTT ATCCTTGCAT 

40 

tTGCGCCACA TTTACTAAGA AACACGTACC 
TTTCtCTATC ATAGGGTTCT TATTTACAAC 
TGTCCCATTA TTGACTTTAA TATATCCCGT 

45 

TAACATGTTT AGCACATTCA GATTCAGTTG 
GATTATTTCA ATTTTACAAA TACTAAATAG 
so ATCGTTTATG ATGTT AC CTT TAGCAGATAT 

CTTTGCTATT ATCGGTTTCA TAATCGATGT 
TTAATAAATG CTCACTGCCT AGTAATGATT 
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CTCATCCACG TATCATTATG TGTACAGTGT i860 

ATAAACTCAT CCATAGTTTA CCTTTTATAA 1920 

AAAAATCACA ACGCTCCAAC GTATTTCCAA 1980 

CGACACTTTA ATTGTCATTA TCCAATAATA 2040 

AAAATAATAG AATCATAGTA TTGTCATTTA 2100 

GATTTTTTAA ATTATTATGA ATATCTAGTT 2160 

TTATTATTTC GGGCCTCATG TTATTTTCAC 2220 

CGCCCATGCT TGGCCATACA GCGGGTCAAA 2280 

TTACAGGCAT ATnACTCCCC TTTATTACTG 2340 

TTGAAAGTGT AGGCAATCGT ATACATCCAT 2400 

ACATGTCTAT CGGAGCATTT TACGGTATTC 2460 

GTACAAGACA CATTTTACCT GTGCATAACC 2520 

TTTTTGCCAT CGTTTACTGG ATTAGTTTAA 2580 

AATTATTAAC ACCGTTATTA CTATTAATGG 2640 

ACCCTGAATC TGCACTAAGT GCACCTAAGG 2700 

GAAGTTTGGA AGGCTATTTT ACAATGGATC 2760 

TTGTCAATGG CTATAAGTTT AAAGGCCTCA 282 0 

GCTTTTCAGG TCTTATTGCA GC CAT ATTAC 2880 

TTGGGGCATC AACAGCTCCA GGAAACTTTA 294 0 

CATTACGATT ATTTGGTTCG TTCGGTAACC 3000 

GCCTAACAAC ATGTATAGGA CTCGTcAATG 306 0 

TAAGTTTTCT TATAAAATAT TCGCACTTAT 3120 

ACTTGGTTTA GAAATGATTT TAAAAATTGC 3180 

GTCGATTGCA CTTGTACTCA TATCATTTGC 324 0 

GGCCTATCGA CTCGCAACTG TTATTACATT 3300 

TTTCAACTTA TTACACGGTG TTATTTTGAA 3360 

CGATTTAGCT TGGCTTGTAC CATTCATGCT 3420 

ATTTATACGC CGTCCGAAAC AAGCGACAAC 34 BO 

GACCCATCGT TACTAgGCTT TTttATATGA 354 0 
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TAAAAAATCC TAGCTGTTAT TCAAAAATAT TAOTTTTTAA AA 3642 
(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

15 TTATTGATAT TGAAAATTCA AAAACTGCAA CACATCTTAT CACTAAAAAC CAACACTATC 60 

ATCAAACTGA CACTCATTTT GAACAGTATA AGAAATTTAT TTTAGATTCA GGTATATCAT 120 

CAACACAATT TGTATATAAT AACCTGTCTG TAAGCGGATT TAAATATACT AATGATGGTA 180 

AGAATCCAAT TCAATTATCT GACATAGTGT ATCACTTAAT CGCATTATTA CGATATGGCG 24 0 

GTGGCATTAG CTATCAATTA TTAGATGACC ATTCAAATTA TATTTCCTTG TACAACAAAT 300 

ATGGTAGCCC CCTACCATTA ATGCATTTAT ATAAAATGTT TAGACCTTTT GTTAATGAAG 3 60 

ATATTGAAAT TACAAATAAT TATGTATTGA GTCGTAAAGA TAATAACTAC CATTTCTTAT 42 0 

TATTCAATAA AATTAATGAT CGATATATGT CAGACGTAAA ACAAGATTTC ATTTTCCATA 4 80 

3Q ATGAATTACC TCAAGACTCT TTGATGATTA TTAAAACATT GAATCATGAA CATGGTTCAA 540 

TTCAACATTT GCTTCCAATA AGCGATCAAC TTGTTTATAT AGAAAAAGAA ATTTTAGATG 600 

AATTAGACAA AACGAATTAC CCTAAAACGG AGCTTGCAGT TCAAGAAGAA ACTGGTCGTA 660 

3S CATTTGAACT CAAGTTAAAT CACGACGAGG TTAAATATAT TTGCTTTAAA CCAAGCTAAA 72 0 

TACTAACAGT CCTCTTGTGT TTAGTTTCTT ACGTTAAAGG CTATTTATAT CATAAGGAGA 780 

TGATTTGTAA TGAGTAACTC ACAAGCAATT CAAGCAATTG AAAACGTGTT AGTAACGTCA 840 

AAAGTTGGTG TATTATCAAC TGCATATAAT AATAAnCCTA ATAGTAGATA TATGGTCTTT 900 

TATAATGATG GTCTTACTTT ATATACTAAA ACGAATATCC ATTCTGCTAA GGTCAAAGAA 960 

ATTAAAGATA ATCCAGCAGC ATATGTTTTG TTAGGCTATA ACGACACAAC TAATCGTAGT 1020 

TTCGTTGAGA TGGAAGCGAC AATCGAAATC GTTACAGAAC AAGAAGTGAT TGATTGGCTA 1080 

TGGGAAACAC AAGACAAAAG CTTTTTCAGT TCAAAAGAAG ACCCAGAGTT ATGTGTTTTA 1140 

50 AGAGTAGTTC CGCAATCCAT TAAGCTAATG AATGATAAAT CATTAGATAC ACCTATCAAA 1200 

ATCGATTTAT AACACAAAGT GTATATAGGA AATAACTTTT ATGAATTCTA GATATAACAA 1260 

TGTTAAATAC TTAAAATAAC TCGCTATAAT TAAAGTGTTT AATATGTTTA CAATTCAATT 1320 
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TTTAAATTAA TTTTATGTAA TATAAATACT GCATTTGCAA ACTGTTGCAC TTTTAGOTAT 1440 

AACAGAATTA ACTACATTTA AGGAGATTGA TGAACCATGA AAAAGAAAAA AGGTTTTGGT 1500 

CTTGGTATTA GTTTAATCGC CATCATGTTA ATTGTATGTA TTGTATTAGT AATCATGATG 1560 

ATGACTGGCG GAAAGAAAGA TACATACTAT GGAATTATGA AAGATAATAC TACTATTGAA 1620 

AAAATGATTA GTGAAAAAGA TGAAAGTATT GAAAAAAATG TTAAATTACC TTCAGATTCA 1690 

GATGTTAAAG TTAAAAAAGG TGATTTTGTA ATTGTTTATA AATTAGCAGA TTCAGATAAA 1740 

ATTGTTAAAG TTAAAAAAGT TGACCATGAC GATGTACCAC ATGGTTTAAT GATGAAAATT 1800 

1S CATGACATGG GCAAAATGCA CATGAAACAC TAATTGTAAT TTAAATTACA AATTTTAGTT I860 

GCCATCAAGG TATATACGAG TAAAAGCAGC GGTAAGTTGA TTTCCAATTT GGAATCATTT 1920 

TACTGCTGCT TTTTATATTT GAAATACTTT CATATTGAAT AGCTCCACTT GCCGTTCGCC 1980 

20 TgcgCTTTGC GCATGCATAA AAGCCCCTAA CAACCTGAGG TCACTGCGCT CCGGTTCGCC 2040 

TGcgCTTTAG CGCATGCATA AAAGCCCCTA ACAACCTGAG GTCACTACGC TTCGGTTCGC 2100 

CTGCGCTTTA GCGCATGCAT AAAAGTCCCT AACAACCTGA GGTCACTACG CTTCCGTTCG 2160 

CCTGCGCTTT AnCATGGCCA TAAAAGC 2187 
(2) INFORMATION FOR SEQ ID NO: 330: 



25 



30 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

CCnCCTTTTT AAACCTGGnG AAATGtmCAG tTTTGATGGt ATTGGGtTCT TTAGTATTAT 60 

G CTTAAG ATA GAGTGTAATA CGCTCCTGTT GTTCTTTATA GT AT ATTG CT TTTTGTTTTT 120 

CTTTTTCGTC ATATTTCACT TTTAAATAAA TGACTGATGC AACTATACAT ATACATAAAA 180 

TGACACCAAT AATTATAAAA ACATGTTTTT TGTTCATTAA AGTAACTCCT AAAATGTGGT 240 

GGAAAATGAA AAAATTTTTA TAATCTATAA TTATGAACAT TACAGATTAT AAACCATAAC 300 

ACTAACATCG TCGCCTTCAT TAACTTGATT GTTAAAATCA GCAATTACTG AGAATTGTAC 360 

SO AAGGAATAAT TGCCTATTAT GCCCTCATGT AATT ATTG C C TTACTAACAT TAACAAAATC 420 

GATAGCTATT ACATTAAATG CCTATACCCC AGACCTCAAA CACCTTTTTA TACAGGACGC 480 

ACTGTAATTT CATyyACGTT AACGTGCTTT GGTTGTGTTA ATGCATATAA TACTGCTTCT 540 
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GTATCTACCA TTCCTGGAGA AATGCTTGTT ACTTTAACGC CTGTCTTTGC CAACTCTTTT 660 

TCTAATCCTT GAGTAATAGT GTGAACTGCT GCTTTCGTCG CACTATAAAT CGTACTACTT 720 

5 

TTCGTTACTT CAAAGCCAGA AATAGATGcA ATGTTAATAA GATGGCCACT TGATTGTTCT 780 

AACATAGTTG GTAATGCAGC CTGTGCCGTA TATAAAGTGC CTTTGATATT CACATCAATC 640 

ATACTATCCC ACTCATCTAC TTGATAATCA GTAATCTTAG ACGACAACAT TTGCCCCGCA 900 

CTATTGATAA CAATATC CAA ACCACCGAAT GTTTGTTGTG CAATTTTTAT CAATTCATCG 560 

ACTTCTTCTT TATTCGTTAC ATCTGTTGGC ACTACCTTCA CACTATCTTG TGACAATTGA 1020 

1S TTCGCTACGT TTTGTAATTT TTCTTTATTT CTACCTGCTA AGACAACTTT TGCCCCTTCT 1080 

TCATGTAGTA ATGTTG CAAT TGCTTCTCCA ATACCACTAC CTGCACCTGT AACTACTGCT 1140 

ACTTTATCTG TTAATACTGT CATAATGATC GACTCCTTTG ATTCTTTTTA TTTTTTCAGG 1200 

20 

GTAAATCATA AATACATATT ACTTTTAAAA AGCGTATCAC AATT CAT ATA ACGGTCATAA 1260 

TAACTCGCTT CATTTTCATA GATAAATTAC ATTACAAGCC ATTCGAAACA TACAATTAAT 1320 

CGTTGCTTAT ATTTTTTATT TTTAAAAATG TTGAAAAATC GTCACTTCTT TATTGTAAAA 1380 

25 

ACATTATATT AGTAATAAAG TTAATACTGT GnATTTaTCA TTCGATTGAA TGATTAGAGG 1440 

GAGGAATAAA ACgTGACATA TCATGAGCGT GTTTT AG CAT TAAGAGCAGA AAGTAAAAGA 1500 

30 ACCG CATTTG ATTTTCGATT CGAAGATTTA TTTAGCAAAG AAGAATGGcT AAGTATGTCT 1560 

CTTGCAGAAA GACAAAAAGC TGAAAAAGCA TTTCGACACG AgTTAAAAAT ATGGACGATG 1620 

TAAGAATGCC CTTCTCAAGT GTCCATGACG CCCAAGTAAA ATTATATAAT GTTGTATATT 1680 

35 CTTATAACGG CaTTAAACGT AATTTTAAAC AAGTTGAAAA TGGAAGGATT CTAATATCAT 1740 

TTCGTTTATA TATnGCAGAC CATGGATAGA ATTTTnTATG GTnAATCC 1788 
(2) INFORMATION FOR SEQ ID NO: 331: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

50 TAAG CCAAAT CCATTGTTAC CAGAAGAAGT TCGCTTCATC TTAAAACAAA TGGGTCTTAA 60 

AGAAAAGACT ATCGATGTTG AACTCGAAGT TGGCGAGCAA GTTCGTATTA AATCAGGTCC 120 

ATTTGCGAAT CAAGTTGGTG AAGTTCAAGA AATTGAAACA GaTAAGTTTA AGCTAACAGT 180 
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GCTTTAATTA ACAATTAAAG TTATTAAACT AACCAAAAGA TAAAAAAGAG TATTGATTTT 300 

TTAATTAGAA AAGTGTTAAA ATTATGTGGT cGcGCTTTTA GAGCGCCCAT TTCGTcACGA 360 

AATGTTAAGA GTGGGAGGGC AAAACTGAGC CCTGTGACCA CATCACGATA TCAAGGAGGT 420 

GCACATCGTG GCTAAAAAAG TAGATAAAGT TGTTAAATTA CAAATTCCTG CAGGTAAAGC 480 

GAATCCAGCA CCACCAGTTG GTCCAGCATT AGGTCAAGCA GGTGTGAACA TCATGGGATT 540 

CTGTAAAGAG TTCAATGCAC GTACTCAAGA TCAAGGAGGT TTAATTATTC CGGTAGAAAT 600 

CAGTGTTTAT GAAGATCGTT CATTTACATT TATTACaAAA ACTCCACCGG CTCCaGTATT 660 

y5 ACTTAAAAAA GCAGCTGGTA TTGAAAAAGG TTCAGGCGAA CCAAACAAAA CTAAAGTTGC 720 

TACAGTAACT AAAGATCAAG TACGCGAAAT TGCTAACAGC AAAATGCAAG ACTTAAACGC 780 

TGCTGACGAA GAAGCAGCTA TGCGTATTAT CGAAGGTACT GGACGTAGTA TGGGTATCGT 840 

20 TGTAGAATAA TTTTACGAAT ATTAAATTTG ATTACATGAT TTAAACGATG AAGCAGATAA 900 

CAGAGATAAT AATGATGAAT TATAAATATA ATCTGAATGA CTAGATTAAT GATTGATTTA 960 

TTCATAAGAT TAATTCTTCT GTTGTCTGCy CTTAACTTGC ATATAGCAAG TAATGTGGGA 1020 

25 

GGAAATTCCG CTAAAACGAC TAAAGGAGGA ACTATAAATG GCTAAAAAAG GTAAAAAGTA 10B0 

TCAAGAAGCA GCTAGTAAAG TTGACCGTAC TCAGCACTAC AGTGTTGAAG . AAGCAATTAA 1140 

ATT AG CT AAA GAAACAAGCA TTGCTAACTT TGACGCTTCT GTTGAAGTTG CATTCCGTTT 1200 

30 

AGGAATTGAT ACACGTAAAA ATGACCAACA AATCCGTGGT GCAGTTGTAT TACCAAACGG 1260 

AACTGGTAAA TCACAAAGTG TATTAGTATT CGCTAAAGGT GACAAAATTG CTGAAGCTGA 1320 

35 AGCAGCAGGT GCTGACTATG T 1341 

(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 5136 base pairs 

<B) TYPE: nucleic acid 
(C) STRAND EDNESS : double 
<D> TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 
CTCTAAATCT TCAATAGGTA ACTTCATTTT AATAATTCCC ATTTGAACAA TTGCTTCTTC 60 
50 ATCATTGACA ATTTCTAAAA CTTCACCCTT TTGACCGTAA GAkAATACTT TkACTTCATC 120 

ACCAGCAACA ATTTTATCGT ATTTTTG CTT TTGTACATTT TGCTTTATAG ATTTCGCTTC 180 
ATAATGATCA TCTAATCGTT TCTTCTTATC AATCAATTCA TGTTCTTTAA CATCTGCACC 240 

55 
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CTTAATCTTC TGATTTGCTT TkTCTTTCGC 
GAATTGTTGA TACTGCTTAG ATAAATCATC 
6 TCCAGkTCTA ACCTCTGTGT CTCTACACGT 

TCATTTATTT CTTTTTCATC AGTACCAATC 
CTAAGACCTA ACTTTTTAGA AATGTCAAAA 

10 

AACTTATACG TTGGACTCAA AGTATCTACA 
TCTCGATTAT AACTATATGC TTTAAGTTCA 

1S CCAATTTTTC TAACATGATC TAAAATGCTC 

GTACCTGCAC CTAATTCATC AAATAAAACT 
TCAACTATAT TCGTCATATG AGATGAAAAA 

20 TCTCCGATAT CGCAATATAC ATTTTTAAAT 

ATCAACAATC CTGATTGAGC CATAAGAATA 
TTACCACCTG TATTCGGTCC TGTAATAATT 

25 

GTATTAGCTA CAACAGTCTC AOGATTTAAT 
GTACGGTCCT CTTTAAATAT CGGCTTTGTT 
GCGATTAAAA AATCTAACTG ACCCATGACT 

30 

GCAGC CACAT AACCAGTTAG TTGCGTTAAA 
CGTAATCGAC TAATTTGATT ATTCATTTCA 
35 TGTCCTGAAG CAGATTGATC ATGTACAATC 

ACAGGTATAA CGTTTCTTTC ATTCCTAACT 

TTTGCTTGGC TTTTAACAAT ACGGTCCAAA 

40 

ATTTTACTTC TAATCCCTTG CAATTCATAA 

CATGTTTCAT TTATTTGTTG AAAAAGATCA 

TCTAATATTG GGTATTTAAC ACCTTCATCT 

45 

AATTGATTTT GTACTTGAAT TAATCTTTTT 
CCGCCAATAT CAGCGCGATG AATGAATGCT 
SO AAACGATGCT TATTATAGAT TTGAGCAATC 

GTTTCAAAAT TAGTAGCTGG CATCATTTGA 
GTTTCATTGG CAACGAGTGA TTTTATTTTT 

65 



TTCCTCTATT AGAGACTTTT CATAATTTTG 360 

ATGCACTTGt CCGCkTskTT TACAAGACGG 420 

TTGTAATTAC GCTCTAATGA TTCAATCATT 480 

ATCGTCTTAG CCTTATTAAT AATATTCAAA 540 

GCATTTGAAC GACCCGGCAC ACCCATTAAT 600 

TCAAATTCTA CACTCGCATT CATAACGCCT 660 

GGATAGTGCG TCGTTGCCAT TACTAGAGAA 720 

ATTGCTAATG CAGCACCTTC ACTCGGATCT 780 

AAACTATGTT TGTCTGCATG CTTTAAAATT 840 

GTTGATAATG ATTGTTCTAT TGATTGTTCA 900 

ACACTTAACT GACTACCATC AAGTGTGGGA 960 

ATTAAACCTA ATGTTTTTAA TGTTACAGTT 1020 

ACCGTTTCAA TATCTTCCAT AAATTCGATG 1080 

AATGGATGGT ATGCTTTAGG TAAATATACA 114 0 

CCTTTAATAC TTCTACTATA TCTCGCTTTT 1200 

TGTTCTGCCA CAAGTAGTGC AT CTTTGTCC 1260 

ATGCGTTCTT TTTCAATTGC TTCGTCATGA 1320 

ACAACTGATG ATGGCTCAAT ATACAATGTT 13 80 

CCATTAAAAT CTTGTCGATA TTCAGCTTTG 1440 

GTTACAATAG CATCTGATAA TTTTTTCTGA 1500 

TTTTGTCTAA TACGTTGATT CGTGCTAGAA 1560 

CTCGCATTAT CATATAAATC ATACGTATCG 1620 

GTTAACACAG GTAATTGATT CATCTTGTCA 1680 

TCTTCAACCA ATTGATTATA AAATGTCTTG 1740 

ATCAAGTTAA GCTCTGATAC ATTTAAAACG 1800 

GATACTTTAG ATAAGCCACT CAAGCTTGGT I860 

TCATCCGTTT CTTCCATTTG AAAAACAACC 1920 

TTGACCTTTT CCAAGCCTAA GTCACTAATA 1980 

TCAAATTCTA AGACGTCTAA TGTTTTTTGT 2040 
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TCACGCGATA ATGCGTTAAT CACTCTATCT TTTGTtACAA ATCCTTTTTG CGCAGTTGtA 2160 

CGCCATAATT CATAAAATCT AAATGATTTG TATGATGCGC AT CAGTGTT A ATAGTTAATT 2220 

5 

TCACATTTGG ATATTTACGA ACGATATCAG CGCTCAGATC CAGTCGATGT GGATTGGCAT 2280 

TAATTTCTAA TACTGTATTC GTTTCTTCAG CTAATGCCAT TAATTGTTCA ATATTOGGTT 2340 

1Q TATAACCATC TCTTCTACCT ATAATACGCC CTGTTGGATG CGCTATATGT CGCACGTATG 2400 

GATTGCGACA TGCATTAGCT AATCGTTCCA TAATTTGTTC TTCTGATTGG TTAAAGCTTT 2460 

GATGAATAGC TCCAATTACA TAATCAAGTT GTGCTAAAAT TTCATCATCA TAATCCAGCG 2520 

1S AGCCATCAGG TAATATATCC ATTTCTGTAC CTGAATAAAT ATCAATTTCA CTATATTCTT 2580 

TATCTAAAGC CTTAATTTCT TCGTTTTGTC TTAAAAGTCT TTCTACTTGT AAGCCATTAG 2640 

CAACACGTAA ACTTTGTGAA TGATCAGTAA TTACCATGAA TTTATAACCT TTTGCGATAT 2700 

20 

TTGCTTCTAC CATGTCTCGA ATAGAAAACG CACCATCACT ATAOGTTGTA TGCATATGAA 2760 

TATCACCATT AATATCATCT ATTGTAATGA TATTACTTAG ATCTTTATCA AATTCGCTAC 2820 

CATCTTCTCG CATAGCAGGT GGTATAAAAT TCACATTAAA ATGTTCATAT ATCTTGGCTT 2880 

25 

CACTATCATA TTGAATTAAT GTACCATCAG CTTGTTCAAT TCCATATTCA CTTACTTTTT 2 940 

CATCACGTGC TTTAGCAAGT TGTCGAATTC TTATATTATG TTCTTTTGAC CCAGTAAAAT 3000 

30 GCTGCAATGT ATGATAAAAA GCACTTGGTT CAATTAATCG AAAATCGACA CCAATCGTTT 3 060 

CATCATCATA CGCTAATTCT AATGAAACTT TTGTGTTCCC CACTGCAACT TCTTTTACTT 3120 

TATTGGGAAT ATTTAATAAT TGCTGCTGCA CTGCTTTTGG GTTATCGGTA CTTATTATGA 3180 

35 AATCTAAATC TTTGCTCATT TCTTTAAAAC GACGGAAGCT TCCTGCAGAT GAATATTGAT 3 240 

CGATATAATT TAATGTATCT ATATAATCAA TGATTTCTTG ATTAAGTCTT CTCATTTGAT 3 3 00 

CAATTGGATA TCTATCTTTC TTAGCACCAA GTTGTTTCAC AGCTTCTAAT ATGTTTTGTT 336 0 

40 

CCGTTTTCTT AGCAAATCCG CTTAATTCAC TAACTTTTCC ATTTTCACAA GCAACTTGAA 3420 

GTGACGCTTT ATCAACAATA TTCAACTCTT TATATAGCTT AGCAATTTTC TTGCTTCCAA 34 BO 

GTCCTTGAAT TTTCAAAAGT GGAATAAGAC CTTCCGGAAC TTCTTCCTGT AATTGCTGTA 354 0 

45 

AATACTGAGA TTCACCGGTC TCACGGTAAT CATTGATTAC TTCTGCAACA CCTTTACCAA 3 600 

TGCCTTTtAA CTCCGTtACA TCAGATATTT CATCTAATGG TCGTTCATCT AATT CAAG AC 3 66 0 

so TTTGAGCTGC TTTTCGaTAC GCTGaTATTT TAAAAGTATT TTCCCCTTTT AATT C CAT AT 3720 

AAGTAGCAAT TTGTTCTAAT AGTTTGATAA CATCTTTTTT TGTCATAATA ACACTCCATA 3780 

AAAAGAAGAC CAGGACGTAT CATTAATATA TACCTTTGTC CTGACCTCTT ATGATAATTT 3 84 0 
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TAGATATTTT AAGCTGATGT TGTAATGCTT CGTTAGGATA TAATGCCAAT AGATATAACG 3960 

TAAAGTGTAA GACAATTATC GTCATAAACA CACCAACTAT CATTCCCATT GCACGACTGA 4020 

6 

AAATATGAAT GTTTTGATAC GCTATTATTT TATCAAAAGT TACGATAATT AGATATAAAA 4080 

TGAACTTACA AAACAATGTA ATCATTAAAA AAGCTACAAT CGCTTCAAAT CGATTTTGTA 414 0 

GATGATTAAA ATGAAACGCA AAAGTTGTAT TAAATGCTGT TGTTTTAGGA TATGGAATAA 4200 

10 

ATACAATTAA TCTTTCTACA ATAGATTTGT AAAATTGACT GGCAATCCAC AATGATACAA 4260 

nCGTTGCACT CAAATGTATC ATAGATAACC AAAAACCTCG TCTGAATCCA ACGATGACAA 4320 

15 AATACACAAA GAAAATGATT ATGATAAAAT CAATGACCAT TTATTGCTCA CGCTGCTGCA 43 80 

ATTTGTGAAT TTGTTGTTTC AAACGTCGAT TTTTTTTCTT CTAGTAGTAC TTTTTCATGC 4440 

ATAATATTCA CAGCAGTTAG TATTGCTTTT CTTGAAGTAT CTAAACCTGC TGCTTTATAC 4500 

CCTAATTCTT TTATTTTATC ATCAACTAAA TGTGCTACAT ATCGTATGTG CTCTGGGTTA 4560 

TCTTCCCCAA CAATTGTAAA AAGCTGATCA TTAATTGATA CATTTACCTT GTTTTTAAAC 4620 

TGTGTCATTT ATAATTTCTC CTGATCCTTT TTTTAAAATC TAAATTCACG TTATAAAATA 46 80 

25 

TGACTGGATA GTTTGTCTGA ATTTGATACT AATATTGTTA TATTGCAATT ATGATAAAAC 4740 

AACAACACAA TCTCTATAGA TGACTTAATG TTCTTTTTAT AATGAAATAA TGTAAAGAAT 4800 

3Q TTTCTATTCA ATACTTTATC ATGTTTAAAT TGTGTCACTA TAACATTTTC ATAAACATTA 4860 

TACATGACCA CTATGTATTT TGTAAGTATC CGCAATTAAT TCTTTACAAC ATACATAAAT 4920 

GTTTCTGACG TT ATT AT CAT TTATGATATG ATTATTTTTG CTAAAGACAA TGAAATTTTA 4980 

35 TGAAAGGATT TACACAAATG GCGAATATCG TTTTTAAATT GTCGGATAAA GACATAACGA 504 0 

CATTAATGTC ACGCATTTCT TTTGATACTG AGAATTTACC TCAAGGnATG AnAGCACGTG 5100 

CAAAGTATCC AAAATACAAC TGTAAATATT TACCAT 5136 

40 

(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4239 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : double 

<D) TOPOLOGY: linear 

so (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

GGCCAAAATT GCACCTCCAC TTTCCTTTGA ACAAATCTAT TTTAAACGCA CATTCCCATT 60 

ATGACAAATT AAATGTGAGT nACATTTGTT TGTATTTTAA CATGACTACT AACGCAAACT 120 
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10 



GGTATCTTTC AAAGATAAAA TCTTAATAAT TTCTTAGTAA ACTCTTTTCT CTAGATTTAT 240 

CACAATATTA TATAGACCTA TTTTATTTTG ACGTAAGTTG CTAGTATCTT CAAACAAAAA 300 

ACCTTTATAA AAATTCATAC CTTTATGCTA TCGCTGTAGG CTCATTAACT TGTTACATAT 360 

AATTCTTAAC TATCCTTTGA TGATTGTTTT ATTAGATTGT TTCGTTGATG GATACTTTCA 420 

CGAATTTCTA TAGTTCAATG CTACTAAAAA AACAGCCCAA AACTTTAATT TGTTTTGGAC 480 

TGTTTTATAA TTATGCTTGC GATGGTGTTT TAGTTTCTGA AGTTTGTTCA GCAATGTCAT 540 

ATTTAAACTC TTTACCATCA TGATCTACTG TAACTTTCTT ACCTTCAATT TGATTACCAT 600 

1S CTAATATTAA TTCACTTAAA TTATCTTCGA TAGTTTTTTG TATCGCTCTA ATTAATGGTC 660 

TTGCACCATA TTCTGGATCA TATCCTTCTT CTGCGATTTT GTCTTTCGCT TTATCAGTTA 720 

CAATAATATT TATGTTTTGT TCAGATAATC GATTTGTTAA TTTATTAACC ATCATTGTTA 780 

20 ' CAATTTCTTT TAATTCTTCT TTTGTTAGTT TATGGAATAC AATGATATCA TCTACACGGT 840 

TTAAAAATTC TGGACGGAAT GAATTTTTTA ATTCTTTTAA CATCGTTTTT CGAATTGTTT 900 

CATAATCTTG TCCATCACTT GAACCACCGA ATCCAGCAAA TCGTTGATCT TGTAATTCTT 960 

GTGCCCCAAC GTTTGATGTC ATTATGATAA TTGTATTTCT GAAATCAACT GTACGTCCTT 1020 

TTGTATCTGT CAAATGTCCA TCATCTAAAA CTTGTAATAG AAT ATTAAAT ACATCTGGAT 1080 

GAGCTTTTTC AATTTCATCA AATAAAATTA CAGAATATGG TTTACGTCTA ACTTTTTCAG 1140 

TTAATTGTCC ACCATCATCA TGACCAACAT ATCCTGGAGG AGCACCAACT AATCGGCTCA 1200 

CTGCGTGTTT TTCCATAAAT TCACTCATGT CTACACGGAT CATCGCATCA TCATCGCCAA 1260 

35 ACATTGATTC AGCTAAAGCT CTAGCTAATT CAGTTTTACC AACACCAGTT GGTCCAAGGA 1320 

AGATAAAGCT ACCAATTGGT CGTTTAGGAT CTTTTAACCC TGCACGGGCA CGTCTAACCG 1380 

CTTTACTGAT TGAATTAACA GCATCTTTTT GCCCAATAAC TCTCTCATGT AATGTATCTT 1440 

40 CTAGACTAAG AAGTTTTTCA GATTCTGTTT CATTGATTTT AGTTAATGGG ATACCTGTCC 1500 

ATCCTGCAAT AACTTCAGCA ATATCTTCTT CTGACAATGA AGTTGaCATG CCATTTTGTG 1560 

CATTCTTCCA TTCATTTTTA GCTTCTTCAT ATTGCTTTTC AAGTTTTGTT TGTTTATCAC 1620 

GCAGgTTAGC AGCATTTTCA AACTCTTGAG CATGTACTGC GGCATCTTTT TCATTTTTAA 1680 

CTTTTTCAAT TTCTTGTTCA ATTTCTTTTA AATTATTAGG TGTCGTATGA CTCTTAAGTC 1740 

TTACTTTAGA ACTTGCTTCA TCAATTAAAT CAATTGCTTT ATCTGGTAAG AAACGATCTG 1800 

AAACGTATCT GTTACTTAAT TTAACAGCTG CTTCAATAGC TTCGTCTGAA ATATTAATAC I860 

GATGGTGTGC TTCGTAACGA TCTCTTAATC CTTTTAAAAT AGCAACTGTA TCTACTACTG 1920 

SS 



25 



30 



4S 



50 
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TTTTGCGATA TTCATCTAAT GTAGTAGCAC 
CCGGCTTCAA AATATTCGAA GCATCGATAG 
5 TATGCAACTC ATCAATAAAT AGGATGACAT 

TTTTCAGACG CTCTTCAAAT TCACCACGAT 
CTAAAGACAT AACACGCTTA TCTTTTAATG 

10 

GCGCTAAACC TTCAGCAATA GCAGTTTTAC 
TGTTTTTCGT ACGTCTACTT AATACTTCAA 
TAACAGGATC TAATGTACCG TCTTTGGCAA 

15 

AAGTTGGAGT ATTATTTGAC TTACTAGCTT 
CTAAAGCTTT CACAACTTGT GCACGTGCTT 

20 CTCTTGCTGC AACACCTTCA TTTTCACGAA 

CAAAATTGTG ATGTAATTTT CTAGCTTCAT 
TAGGTGTATA ATGCAATGTA CCAACATGAT 

25 CTTCAATTAC TTTATCTTCA GTGATATTAA 

CAGGTTCTTT CATTAACCCC AATAATAGGT 
GAATTGCTTC TTCTTGGGCA TGTGCTAATA 

30 

ATAACATAAA TAATGACCTC CTACTTTATA 
TTTACAGATT TGTCATCTTC TTCATCTAAT 
TCATTAAATT TAAAGTTTTG TAATTCAATG 

35 

TTAATCTATA AGAAGC CTCT TCCATAGTTA 
AGCGAAAAAC ACGGTCTTGT GTTTCTAATT 
40 GTTTTTCTTC ATGAATGATT TGATTAACAA 

CAGATTTACC AAGTGTAAGT TGGTTGGATA 
CTTCACCGTA AATACCTCTG ATTGTATATC 

45 

TCATTCTTTT CATAATAGAT AGACCTGGCA 
TACCTATATT GGTAGGACAT GTAGTTAAAT 
GGCTTCGATC TAATTCATCA TCAATTGATG 

60 

TGTCAGTTCC CATAGCTTGA ATACGAATAT 
ATTCATCGTC ATTCACTAAT ACTGCGGCTG 

55 
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CAATACATTG TAATTCACCA CGTGCTAATG 204 0 

CACCTTCAGC ACCACCAGCA CCAACTAAAG 2100 

TACCTGCTTG TTGGATTTCT TCCATAACCT 2160 

ATTTAGTACC TGCAACTACT GTTCCCATAT 2220 

TCTCTGGTAC CTCATTATTC ACTAtGGCtT 2280 

CAACACCTGG CTCTCCAATA AG CACAGGAT 2340 

TTACACGTGT AATTTCTTTA TCACGTCCTA 2400 

TGACTGTTAA GTCACGAOCT AAACTATCTA 2460 

GTGCATTTTT ATTACTCATT TCAGGGTTTC 2520 

TAGTAATATT TAAATCTAGA TTTGCAAAAA 2580 

TCAAGCCTAA TAAAATATGT TCCGTTCCAA 264 0 

CCATCGATAA TTCAATGACT TTTTTAGCTC 2700 

CTTGACCATG TCCGATTAAT TTTTCAACTT 2760 

AACTTTCTAA TACTTTTGCA GCAATTCCTT 2820 

GTTCTGTTCC TATATTTGAA TGATTTAAAC 2880 

CGCGCTGTGC ACGCTCAGTT AATCTACCAA 294 0 

TGTTCTCTTA GTATATCTGC TCGTTTTTCT 3000 

AAAAATGGTG ACTGTATAGC TACCATCAAT 306 0 

TAATTTAAAT CTATACCAAG TTTAACTcGC 3120 

TCATTCTACA GTTTTGTAAA AT AC CTAGCG 3180 

GATTATAAGT GTCTAACTTT TGTCGTATTT 324 0 

CTTCTGTTAA TGTTTCTATG ATTTCTAACT 3300 

CTTGATAAGT ATGTCCATAA ACTTGCGAAC 3360 

CAAAACGATT AATGGTTTGA GCAATCCGTG 3420 

AATGT AG CAT CACGCTTGCT CTCATACCAG 3480 

AACCAAGTTG TTCATCATAA CTTATATCAA 3540 

AAGCTTGATT ATATAAAGCC TGTAATGTCG 3600 

GGTCCTCTTC ATTTATCATG ACACTTAAAG 3 660 

CTGGTTGTTT TATTAGTTCA GGACTAATCA 3 720 
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CAAAGTTTGG CAAGGCATCT TGTACCTCAT TTATAACTCT AAATCCcATC ATTTTCAGTA 3840 

GCATACATTA GTGGATGCAC ATGATTTTCT AAAtTACGCG CTAACCGAAT TCTAGAAOAC 3900 

ATAACAATTG GTGTTTCTTC ATTACTTTTC ATCCATTGGC TGATATT AT C ATGAATATTA 3960 

TGCGTCATCA TGTTGCACCT CACTCTCAGC TTTTAGTGCT TTAATTTCAT CTCTAACAAT 4020 

GGCTGCTTCC TCAAAATCTT GGATTTCAAT AAGTTTTTTC AAATATTCAT TCTTTTCTTC 4080 

GATTTTTCGC TTTAAAGCTA TCTTTTTATG TGAAGAATGT GGTGTCTTTC CAACGTGCTC 4140 

AAATTGTCCA CCTTGAACTC TGCGGACGAT ATCAATGATG TCATCTTTAA ATGTTGcATA 4200 

ACAATTAGCA CACCCAAATT TACCAACATG TGCAATATC 4239 
(2) INFORMATION FOR SEQ ID NO: 334: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1245 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 
CAAAACTTTG ATTATGTCGT GTCAGTTATT AATGCATACA GTAATGCATC GTTGTTATTT 60 
TTTATTTATT TACTATTTAT TATGTACCAA ATCAGCCTTt CAGTAAAATA GGTCAAATTA 120 

30 

CTGATTTTCT AAATATAAAA TGCCTCCTAA TAACATACTA TTAGTACATC ATTAAGAGGC 180 
TCTTGTGTTA TTTGCATACT AAGCGCTCAA ATTTAAATTT AAG ATGAAGA TTCTTGCAAT 240 
55 AATATTTCTA TATTTGTCGA TATATCCAAT GGATCTTCCA TTGGCAAGAA TCGATTTACA 300 

ACATTTCCTT GTCGATCGAT TATAAATTTT GTGAAATTCC ATTTTATTGG GGaCCCAAAG 360 
ATTCCTGGTT GTTCaTTCTT TAAATGCGtA AATAACGGAT GTTCATCGTT CCCGTTCACA 420 
40 GATATTTTAG CTAACACTGG AAATGTTACA CCAAATTTCT CACGACTAAT TTTCAAGATT 480 

TCTTCATTAG AACCTGGTTG TCGATTGTCA AAATTATTAT TCGGAAAACT CAACACTACA 540 
AACCCACGAT CCTTATATTT TTGAAATAAA GTCTCTAGTT TTTTCAATTG TTCGCTATAT 600 
ATACATTCTG TTGCAGTATT AACAACTAAA ATCACTTTAC CTCTAAATGC TTCTAATTTA 660 
TAAGTTAAGC CTTTATAATC ACTTACTTCG ATATCATACA CATTTCTATT ATTCATAAGA 720 
CACCCCTACA CAGCCTTTTT TATATTGAAT ATGTTCTTTT TAGAATGTTC TGATAAAATA 780 
AGTGCGCGTT TACACCGTGA ACACACATTA TAT AG CGTG A TACATTTTTC GAGCACACGA 840 
TAAATAATGT TCGAGTTTAT GTTGTTGCTC AACCTATCCG ATTTACCGTC TTTTTTCACC 900 
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CCGTTATAAC CCCTCATTAT AATCATCCTT ATTTTCTATT TTTAAAAAGA CAATTAGACC 1020 

GCTCTTTAAA CTATAGaTTA ATACTTAAGk TAAACTCATA CATACTGATA CCATACGTTA 10 SO 

GATCTAACAA TTTAAAATTC GTTATAACTA TGGATTAAAG AGCTGCCCAA CT CAT AT AAT 1140 

CCTTAAAAAC TTCACATGTG ATTGTkTATT AAGCCCTCCT TTATCnTATT AAATATCCTT 1200 

ATAACCCTTT TAAAATTAAA CTGACACACT CATACATTGT TACAC 1245 
(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 



25 



30 



35 



45 



60 



CTTGTAAAGG 


TAGTTGTTCA 


TTTAAATTAA 


AACAGTATGC 


TAAGTTnGTG 


CTTATATTTT 


60 


GCAACTTCAA 


TTCGACAGGC 


TCATCTTGAT 


ACATGAATGC 


CTCAATTTCA 


TCATGTGATA 


120 


ACTTTTTCAA 


AAT AT CAATT 


AAATGrATAC 


TAAAAACGAT 


AAATAACATA 


TCCCAATTTG 


180 


AATTCAGCCC 


TAGCGATTTT 


AATTTGTTTA 


AAATCTTATC 


TTTTTGAAAA 


ATTCGATTCC 


240 


TAATGTCTTG 


TATATCGTTA 


TCAGTCAAAG 


TTTCCCAATC 


TATATGTGAA 


TGAAGACCTA 


300 


AATAACACTT 


ATCCATTAAT 


AATTCATATA 


CCGTTAATGC 


AGAGACATCG 


AAACAACGTT 


360 


CTTCACTTAr 


AAAAACGCCA 


TCAACATCAA 


ATAAAATTTT 


CTTCACAATC 


CCCACTCCAT 


420 


TTCTGaAAAT 


TCAGaTATAA 


ATCATTCTAC 


TATTTGACTA 


AAAAAAGCGC 


AAACC CTATT 


480 


GAAGTaGATT 


TGCGCTTTAG 


CTGTTnAAAT 


TTTATAAATG 


TnTTTCAATT 


TCATCAGCAA 


540 


CCTGCTGTAC 


GTGTGTACCG 


ACAATAACTT 


GAGTTGAATG 


TTTGCCATTA 


ACAGTAACAC 


600 


CAACTGCACC 


GGCG'rrrriA 


ATCTTCTGTT 


TATCAATAAT 


AGATGTGTCT 


TTTAACTCTA 


660 


GACGCAACCT 


TGTTGCACAA 


TTGGTTAAAT 


TAACAATATT 


CTCTTGACCG 


CCTAAACCTT 


720 


CTAATATTTG 


TATAGCATGT 


TGATGATATT 


TACTTTGTTT 


AATATCATTT 


TCACCAGGAG 


780 


CAATATTATC 


TTTTACAACT 


GTTGGGTCAA 


CTAATTCATT 


TTCACCTCTA 


CCAATCGTAT 


840 


TCAAGTTAAA 


TACTTGGATT 


ACTACACGGA 


AAATCACATA 


GTATAAGATG 


AAAAATACAA 


900 


CACCTTGAAC 


AAGCAACATC 


AATGGATGAT 


TTGATACTGG 


ATTAATTAGT GATAACACAT 


960 


AATCTATCAA 


ACCTGCACTA 


AATGAAAATC 


CAGCTGTCCA 


ATGGAATGTA 


GCTGCGATAA 


1020 


ATAAAGATAA 


TCCTGTTAAT 


AACGCATGAA 


CAACATATAA 


GATTGGCACA 


ACAAACATAA 


1080 
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ACCAACCGTA AACTTGTTTT TTCTGAGTAG TTTTAGCTGT ATGATACATT GcTAACGCAG 1200 

CCGCTGGAAT ACCGAACATC ATGATTGGGA AGAATCCCGC TTGATAG CGT CCTOTAATAC 1260 

CTTTTATAGC ATCTTTGCCA CTTTGGAATT TACCAATATC ATTAATACCA ATCGTATCAA 1320 

ACCAGAACAC ACTATTCAGT GCATGATGTA ATCCTGTAGG AATTAATAAT CTATTGGCAA 1380 

CACCATATAT GAAAGCTCC 1399 
(2) INFORMATION FOR SEQ ID NO: 336: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1329 base pairs 

(B) TYPE: nucleic acid 

/S (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 



2S 



30 



3S 



SO 



TATAGTTnTA 


TTATTTAGCG 


AAGCATTAAT 


ACTACCACCG 


GTTATAACAA 


ATGTATTTTG 


60 


CGTATTAAAT 


TGAATGGTAG 


GACCAATCAA 


AGTATATTCA 


ATCGCTGGAC 


CATCATTGTT 


120 


AATTAATGAC 


TGCGCAACCT 


TAAAACTAAA 


TTGATCCATG 


GCACCTGCGC 


CTGAAAATCC 


180 


AATATGTTCA 


TAACCTATTC 


TTCCTAGATC 


TTGTACCGTT 


GAAAAGAGAC 


CTGGTTGTAA 


240 


AATCTTAATT 


GACATTTTCA 


ATCACCACCC 


AGTCATCAAC 


ATTAAAGTTG 


CCATCTGATA 


300 


TATCTCTTTC 


GATTTGTATA 


AATTTCTGTT 


CATCTATTGC 


ATAAAATTGT 


ATCCATTCTC 


360 


CTGCTTCGTA 


CATTGACATT 


GGTTCACGCT 


CGCTGCTAAA 


TACTTTTAAc 


GGTGTGCGTC 


420 


CAATAATTTG 


CCATCCGCCA 


GGAGAATCTG 


ATGGATATAG 


TCCTGTTTGA 


TTATTCG CAA 


480 


TACCTACAGA 


ACCTGCATGA 


ATTTTTAACC 


TTGGCTGATT 


ACGTCTAGGT 


GTATGTAGTT 


540 


GTTCATCAAG 


TcCGCCTAAG 


TATGGAAATC 


CTGGCATAAA 


TCCTAGCATA 


TATATTAAAT 


600 


AAGGTTTACT 


TGTATGTTTT 


TCAATAACTT 


GCTCAACAGT 


TATTCGATTA 


TGCTTTGCTA 


660 


CTTCTTCAAT 


ATCTGGTCCA 


TATGTACCAC 


CATATTGAAC 


AGGTATTTTA ATAATACGAT 


720 


TGGTTTGATT 


CACAGCATGA 


ACArrrrrrr 


CATTAAATTT 


GTTAAGTTCT AAATTTTCAA 


780 


TTAATTTAGA 


AGATG TTATA 


GCTTGTTCAT 


CAAAATATAT 


TAGAACTGCT 


CGATACGAAG 


840 


GGACAATATC 


TTGAATTTCT 


AATATTTCTT 


TTTCTCGTAT 


CCACCGTACC 


ATTGCTGTGA 


900 


CATTACGATA 


TGTCTCTTCG 


GATATTTTAT 


TTTCAAAATA 


AATCATAATT 


GTCTGCTCGT 


960 


TAATAAATCT 


TACATCCACT 


TTAAATCCCC 


CTTTGTATTG 


CAATAAACCA 


GTATTGAATA 


1020 


CCTTTTCATT 


GTATCATTGA 


GAAGCACAAG 


TTGTTTAATA 


AGTAATTCAA 


ATCGCATATA 


1080 
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TTAATATTGT AACTCTTACA CTAATTTAGG TTCTGCTATC ATTCOOTCTG ATGGAAAATT 1200 

TTTACTTTTC ATCTGTCCGA TTTTTTGATT TTGAATATAA AAAAGCACGA CCGAAGTATC 1260 

S ATTAACACAC TTCAATCGCG CAATTAAATA AT CTATTTGA TCATTTATTG GATATTAACA 1320 

ATTTTTACG 1329 



(2) INFORMATION FOR SEQ ID NO: 337: 

W (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3421 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

2 0 GAATAGTGCC ATTTGGAAAG ACATAAAGAA TCCAGAAGCA CCTTTAGCGA ATATGCCGTG 60 

TTCATATAGA TTTGTGAAAC TGGCATGCCC GAATTGAGTT TTAAATGCAA ATAGAATCAT 120 

GACGAAACCA ACTACTATTA AACCAATAAT TGTCGCTATT TTAATGATAG AGAACCAAAA 180 

25 TTCTAATTCT CCGAAAAGTC TTG CGCTAAG TAGGTTGAAT GACATTAATA ACAGTACACA 24 0 

AAATAGTGCA CTTATCCAGT TTGGAATTTC TGGGAACCAA AAGCTAACAT ATTTTGCCAC 300 

AGCCGTTACT TCAGCCATAC CTGTAATAAT CCAACAGAAC CAGTATGTCC ATCCGGTAAC 360 

30 

AAATCCTGCA AAAGGCCCAA TATATGTATT GGTTACATCT GCGAAAGATT TAAATTCAGT 420 

ATTCTGTATA ATGATTTCTC CTAAACCTCG CATAAACATA AATAACATAA ATCCtATAAT 48 0 

GATGTATGTT AATAGAATTG AAGGGCCGGT TAATGCAATC GTTTGACCAG CACCTAAGAA 54 0 

35 

TAAGCCTGTA CCAATTGCAC CGCCAATTGC AATTAATTGT ATGTGGCGAT TGCTCAGTTC 600 

CCTTTGTAAT TTTTCAGCCA TAATACATCT CCCTTAAATA TAGATATGTT TATTATGCAC 660 

40 TTATATTGAG ATATATACAA TTATTTTCGG TAAAAATGTG TAAAATTCCA TGTTAATATA 720 

CTTTGGTTTT TATAATCATA TATAATAACC AATTGAAAAT TTAATTCTAT TGTAAAATTC 78 0 

ATGGATTATT CACATCTTGA AAAAGCTTTA ATGGTGCTAT TTGTGGCTAT TCTGTGACAT 84 0 

45 TTACATAGAT TTACAAAAAA ATTGTTGCAC ATATAATGCC AGTtTTTATA TTTCACAAAC 900 

GAAATGCGTT TAcTATAATA TTAGTTGAAA GCCATTTCAT AAAGAAACAG TAAAGGGGAA 960 

ATTtATCaTA GCmGaATTAC AAAGAGGTTT AGAAGGGGTT ATCGCmGCGG AGACTAAAAT 102 0 

50 

AAGTTCAATT ATTGAAAGTC AATTGACTTA TGCCGGCTAT GATATTGATG ATCTAGCTGA 1080 

AAATGCGCAA TTTGAAGAAG TTATTTTCCT ATTATGGAAC TATAGATTGC CAAACGAAGA 114 0 
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TACACATTTT GAGGAGTATG TTACAGATCA CGTGCATCCA ATGACAGCAT TAOGTACGTC 1260 

ATTATCATAT ATTGCACATT TCGATCCTGA TGCTGAAAAT GAATCAGATG AAAATCGTTA 1320 

TGAAAGAGCA ATGCGTATAC AGGCTAAAGT AGCATCATTA GTTACAGCGT TTGCTCGAGT 1380 

AAGACAAGAT AAAGAACCAC TTAAGCCTAA TCCTGACTTA AGTTATGCGG CAAACTTCCT 1440 

ATATATGTTA CGTGGGGAAT TACCAACAGA TATAGAAGTA GAAGCCTTCA ATAAAGCACT 1500 

TATTTTACAC GCTGATCATG AGTTGAACGC ATCTGCATTT ACGGCACGTT GTGCGGTATC 1560 

ATCATTGTCA GATATGTACT CAGGTATTGT AGCAGCCGTA GtTCTCTGAA AGGGCCATTA 1620 

CATGGTGGTG CAAACGAACA AGTTATGACG ATGTTATCTG aGATTGGGTC AaTTGAAAAT 1680 

GTTGATGCTT ACTTAGATGA AAAATTTGCT AATAAAGrTA AAGTAATGGG CTTCGGTCAT 1740 

CGTGTATATA AAGATGGTGm tCCTAGaGCG AAaTATTTaA GaGAAaTGAG CCGTCAAaTT 1800 

20 tnCGAAAGACG CTGGTCGTGA AGAATTATTT GAAaTGTCAG TGAAAaTGGA AAAmCGTATG 1860 

GCAGAAGAAA AAGGATTAAT TCCTAATGTT GATTTTTATA GTGOGAGTGT TTATCACTGT 1920 

ATGGAAATAC CTCATGACTT ATTCACGCCA ATCTTTGCTG TAAGTCGTTC TGCAGGATGG 1980 

ATTGCTCATA TTTTAGAACA ATATAAAGAT AATAGAATTA TGCGTCCTAG AGCGAAATAT 2040 

ATTGGCGAAA CGAATCGTAA GTATATCCCG CTTGrAGaAA GAAAmTAATC AATACAAATT 2100 

AAAAATGAAG ATGTAAAATT TGGAGGTAAA ATAACTATGA CTGCAGAAAA AATTACTCAA 2160 

GGAACTGAAG GATTAAACGT ACCTAATGAA CCAATTATCC CATTTATTAT CGGTGATGGA 2220 

ATTGGACCGG ATATTTGGAA GGCAGCAAGC CGAGTTATAG ATGGTGCtGT TGAGAAAsCC 2280 

T AT AATGG CG AAAAACGCaT TGAATGGAAA GAAGTGCTAG CTGGCCAAAA AGCATTTGAT 2340 

ACAACTGGTG AATGGTTACC TCAAGAAACA CTTGATACAA TTAAAGAATA TTTAATTGCT 24 00 

GTTAAAGGAC CTTTAACAAC ACCAATTGGT GGTGGTATTA GATCATTAAA TGTGGCTTTA 2460 

40 OGCCAAGAAT TAGATTTATT TACTTGCTTA AGACCGGTAC GTTGGTTTAA AGGAGTACCA 2520 

TCACCTGTTA AACGTCCACA AGATGTTGAT ATGGTTATTT TCCGTGAAAA TACTGAAGAC 2580 

ATTTATGCTG GTATTGAATT TAAAGAAGGT ACAACAGAAG TTAAAAAGGT AATTGACTTC 2640 

TTACAAAACG AAATGGGTGC GACAAACATT CGATTCCCAG AAACTTCAGG TATTGGTATT 2700 

AAACCAGTTT CTAAAGAAGG AACTGAGCGA TT AG TT AG AG CAGCTATACA ATATGCTATC 2760 

GATAATAACC GTAAATCAGT TACTTTAGTT CATAAAGGTA ATATTATGAA ATTTACAGAA 2820 

GGCTCATTTA AGCAGTGGGG TTACGATTTA GCATTATCTG AATTTGGTGA TCAAGTATTC 2880 

ACTTGGCAAC AATATGACGA AATTGTTGAA AATGAAGGCA GAGATGCTGC TAATGCTGCT 2940 
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TTACAACAAA TTTTAACTCG TCCAGCTGAO CATGATGTTG TAGCAACTAT GAACTTOAAT 3 060 

GGTGACTATA TTTCAGATGC TTTAGCTGCA CAAGTTGGTG GTATTGGTAT TGCGCCAGGT 3120 

GCAAACATTA ATTATGAAAC AGGTCATGCT ATTTTTGAAG CAACACATGG TACAGCTCCA 3180 

AAATATGCAG GTTTAAATAA AGTGAATCCA TCTTCAGTAA TTTTAAGTTC TGTATTAATG 3240 

TTAGAACATT TAGGATGGCA AGAAGCGGCA GATAAGATTA CAGATTCAAT TGAAGATACA 33 00 

ATTGCTTCAA AAGTTGTTAC TTATGACTTT GCCCGTTTAA TGGaTGGtGC TGAAGAAGTT 3360 

TCTACATCAG CATTTGCAGA TGAATTGATT GnAAATTTAA AATAAGCAGA ATAGAATTAG 3420 

O 3421 
(2) INFORMATION FOR SEQ ID NO: 33 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3173 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 - (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 

CCCTnGATAC CCAAACTGCC CTAATACCAC TGGCTAAACC TAATGGATAG TACCATTTAT 60 

TTTCCAATAA ATAAnCCAAC TGCAATnGCT ATaACTCTAA ATATAATAGA GATAATCmcA 120 

TTAATTGGAT TAAtACGCCA AATACTAGTA ATAATAGGCT AGATAATAAT CCACCTAAAA 180 

AGTACTTTTT AATTCCAAAG AAAGCTAATA TCAATAATGC TGCCGGTGCA GATAATTGAA 240 

AATCTAATCC TGGTATAATG GACGGTATTT TCAAAACTGC CAAAATGGTT AAAATCGCAG 3 00 

CAATGACACT AATTTGAGTA ATATCTTTTG ATGTCATACT AAAACCCCTA TACCGTTTCA 360 

TAAACAACTT GCTTCGGTGT GCTTTCTAAA AATGATATGT AATGATTTAA ATCAATACAA 420 

40 TCGTCCACAA AT ATT ATT CT GCCTCCATAT CTCGTATTAA CTGGTTTAAT ATCAAATAAT 4 80 

CGATGGTAAC CAATTTTAGC AGCGGCAAAA TAACCTGTCG TATACGTTAA GTCATCGGAC 540 

ACGCAAAGTT CTCCTTTGAC ATACGGATGC GCATTGATAC AACTAG CAAT TGCTAAGGCA 600 

TCAGTCACTC TTTCATTAAG ATCACCTTTA TTATTTATAT CTTCAAACGA AAAATGTGTT 660 

GCCCTAATCC CCCTTTGTCC AAATGAATCT AAACGTTTAC CAGATATAGC AGATAGAATA 720 

ATAGCTCCTG TATAAACCGT TTCATTTTTA ATATATGTCA TCCCTTGATT TAGCGCTTGT 780 

TCAGTGACAC CACATTCTTG TGTTAAATGT TGGAGATTTG CTTTAT CATC CTCAATAATT 84 0 

TGTAATGCTT TTATTTGTTG AATCGGTTCC ATGATTTTTT GTATTTTTAT ATTGAGAAAA 900 
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ATTTTGTCAA ACTCACAAAT CGTTTCAGCA CCACTAATAT GAACATCTTG ATTGCTAGAA 1020 

CGCATTTTTA TACTATACAT GACGATCACC TcAATCTTCT TGaTGCAAAA TTTCAAACAA 1080 

CCTATCTATA TCTTGTTCAG TATGAAAATA CGACAATGAT ATTCTTAACA TTGGCTTAGT 1140 

CACAGTtGGA TACCTTAAAT AACTTGTAAA CACATGATGC TTTAATAATG TTTGATGAAT 1200 

GTTCTCAGCC GCTTCTATGT CATCAAACTC AATAAACTTA ATCGGCGAGT TTGaACTATT 1260 

ATAATkAACA TTGAGTGCTT TTAACTTTTG GTTAAAATAT TTACTCAAAC TATTTAATTT 1320 

AGTGCGTCTA TCATCAGCAT TTATTAACTT TTCAATGTTT CTTTTTATAA AATACAAATT 13 80 

ATAAATTGGC AAACTACTTG AGTAGATGAG TGGTCTACCG TGATTAATTA aCATATCCtT 1440 

CaCATCaTTT GaACTkaAAA TcACACCCCC GTATGCACCA CATGCTTTAG ATAAACTAGA 1500 

AGTGAGTATA TCTACACCTT GATAATTCGA GTAAtTCTCT ATt CCAAAAC TATGTGAAAC 1560 

20 ATCGAGTATC AGTGTTGCGT TAnATTTATG CTTTAATGAG ACTAATTGAC CAATATCCAC 1620 

AACGTCGCCA TTCGTTGAAA ATACACTATC AGATATGATT ATTTTTGGTA TATTTTGATT 1680 

AGGGTATTTC TCTAACCTTT TTTCTAAATC AGCAATATCT AAATGCTTAT ATATCACTTT 1740 

TTCTAAACCA CTTAACTTAA TACCGTCAAT AATACTCGCA TGATTTTCTT GATCTGAAAA 1800 

CACGACACAA TTTGTATTTT TGAAAATATT AAATAACGCC AAATTAGCAT CATAACCACT I860 

ATTTAAGATA GTACATGcAC TATATCCGAG CCAACCTGCT AACATTGTTT CAATTTCTTC 1920 

ATAAGCTGTC GAACTTCCAC TAATTAATCT TGAACTTGAT AAGTGATAAC TATACTTCCG 1980 

CATAAATCTT TCGAAATCAT CCTTATCAAA CGCTATTTGA CCTAATCCTA AATAATCATT 2040 

AGATGTATAG TTCGTACATC TCTTATTTTC TACTTCAATA TACTGTCTAT CTATATACCC 2100 

35 

TACCGATTTA AGCGACCGAT ATAACCCTTT CTGTTGTAAT AAATCAATTT GCTCTTGAAA 2160 

CTTCATTCTT GTTTTCCTTA TTTTCACAAG TGTCATAATC AATTTCAAAG CCTAAATCAT 2220 

40 TAATCATATC GTAGTGTAAT TGGTTCGGTT GCCCACCAGT AATTAGATAA TnCACCGACA 2280 

AATATTGAAT TCGCCGCTTT TAATGCTAAT GGCTGTAACG AACGTAAGTT GACCTCTCTT 2340 

CCTCCAGCAA TACGAATTTC TTTCGTAGGA TTGATTAATC GGAATAATGC TACGATTCTT 2400 

45 AAACATTTCA TTGGTGTTAA ATCATCCATG CTTCCAAACT TTGTGCCTTT GATTGGATGC 2460 

AAAAAATTAA TCGGAATACT GTCGGCATCC ATTTCTTTTA AAGCAAATGC CATATCAACA 2S20 

ATATCTTGAT TAGATTCTCC CATACCACAA ATCACGCCAG AACATGGTGA TATATTATTC 2580 

50 

GCTTTCATTA GTTCTATCGT ATCTGTTCTA TCTTTATAAC TATGCGTTGT CACGACGTTA 2640 

TCATGGTAAT TTTCACTTGT ATTAATATTG TGGTTATATC TGTCTACACC AGCTGACTTA 2700 
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TGTTGAGATT TAATCGTTCT TACAGTATTA CTAATATGAT CAACTTCTTT ATCGCTCGGT 2820 

CCTCTACCAC TCATAACAAT ACAATATGTT CCAATATGAT TATCATGTGC CACCTTTGCT 2880 

CCATCGATAA TTTGTTCCTC TGGAATTAAA GCATATCGCT GTTTTTGTTT AATATCTCGT 2940 

GATTGTCCAC AGTACCCACA ATTTTCAGGA CATATACCAC TTTTAGCATT TAAAATCATG 3000 

TTTAATTTTA CTTTTTTACC AAAATAATGT TTTCTTAAAA TGTACGCCTC ATTTAATAAA 3 060 

TCTAAGGTAT CAATATTAGT ATCCTCATAA ATtTTCAATA CAGTCTCTTT TGTtAATTGT 3120 

tCCCCTTGTA ATATGCGTTT AGCCAAATTC ATATTAACAC TTCCTATCTA AAA 3173 
1S <2> INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1694 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
so (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

CGATTATCCA TTAATACAAC CCTAAGTAAA TGTATAAAAA TTATCTTCCA CAAACTTCAA 6 0 

CAAAAGCCTA AATAAATTAC AG CAATTT AT CAAATATTGC TTACTTTGAT TTTATGAAAT 120 

nACTTAATTC TAACACATAC TAAATCATCA TATACTAATT CGAAATCAAA TGCATTTAGA 180 

GATAATCaAA ATGCGGAAAC ATCTCCaATA ATCAATAATC TATTCCCAAT AAATATGAAT 24 0 

GTTCTCAACA ATACATTATT TATATCTCTT TACACTGTCA TCGACAAAAA CTAAATCTTT 3 00 

35 CACTTTCAAT TTCGAACGTG GTTCTACGAC ATTTGCTGCT ATATCATTTA ATGGGATTAA 3 60 

AACAAATGCA CGTTCATTCA TTCTCGGATG TGGCACCGAC AGTTTTGGTA AATCTATCAT 4 20 

TTCTTCTCCA TACAACAAAA TATCCACATC TAAAGTTCTA GGACCCCATC GTTCCTTTCT 480 

40 AATACGGTGT AAACATTCTT CTGTCkTCAA ACAACATTCC AACAGTTGTA ATACTGTGAG 54 0 

TGTTGTTTsA ATTTCAACAC ACAAATTTAA AAAGTTAGGT TGCTCAGTAT ACCCAACTGG 600 

TGCTGTTTCA TAAATCGGAG AAATAtTAGA TACGTTAATA CCATCATATT CATTCaAAAT 660 

CTTyATAGCA TCGTTTAACt GGCTTTCTCT ATCACCAaTa TTACTAC CT A AyCCTAAGTA 720 

TGCTTGAATC ATyTATTCTC CCTCACTATT TCGATACCTA CTCCATCATA ATGACCCGGA 780 

ATCGGTGGGT TTTCTTTAGT GATTCTCACT TTCGTTTCCA TTACACGATT ATATTGTGAA 84 0 

TTTATACGAT TTG CAATACG TTCAGCTAGA TGCTCAAGTA AATTAACGGC CTTACCTTCC 900 

ATAATTGATT TAACCTCTTC GAACACTTCA C CAT AATG AA CTGTATCAAT AACATTATCA 960 
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ATTTCATTTT CAGCTGATAA AGCACCATGA TATCCATAAA AGCGCATACC TTTAAGAAAG 1080 

ATTGTGTCTT GCATTTTCAT TCTCCTTTAA AAAATCTATA CCTTTAGCTA ATTTAGCATT H40 

5 CAACTCGACA TTATGAACG C GTACTGCTCT AACGCCTTTC ATAATACCAT ATGCAGTCGT 1200 

AGCTGCAGTT ACTTCATCTC TTTCAACCGG TGTTGTATCA TAACCCATCA TCTCTTTAGT 1260 

GAAACGTTTC CGGCTTGTCG CTAATAAAAC TGGATATTCT GTTGCAACAA GTTCATCCAG 1320 

10 

TCTTGCCATA ACTTCGGCTT CTTCATTTCT AGTTTTAGCG AAACCTATAC CTGGATCTAG 1380 

CCAAATTTTA TTTGAAGGTA TACCAGCTAT TTTAGCTTGA TGTGCTTGTG CTAACAAAGA 1440 

TGTTAACATT TCTTCGACAA CCGGTTCATC ACGATTACCA TTTCCATTAT GCATTAAAAT 1500 

IS 

AATTTCCGCG TCATAtTTAG CTACAATTTG GAACATACGA TGATCATACA GACCgCCcAT 1560 

t GATCATT AA TCATATCAAC GGCTAATTTT AAACATGCTT CAGCAACCTC ACTTCGAAAT 1620 

20 GTATCGACTG AAATTTTTTA CATCAnAACC GACAATAGCT TCAnCAACAG TAATACTCTG 1680 

TTCCATCTCT TCTG 1694 

(2) INFORMATION FOR SEQ ID NO: 340: 
2* (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 



40 



45 



AGCATTTCTT 


TTCTATAAAC 


ATTTAATTGA 


ACATTATTAA 


GTACACTATT 


ACTATAGTCA 


60 


CTATATTGAA 


CaCATACCTC 


ATTTAATTCT 


AATAGCGGTT 


CAGATTTGTA 


CTTATTATCA 


120 


TTATTTGCAG 


ATGTTTCATC 


TATCCATTTT 


TTCACTTTAA 


ATTTAACATG 


TTCACTCATA 


180 


CAAACGTCAC 


GTAAATTCGC 


TAAGTTATCA 


ATGGATTCGA 


CATCTACTTC 


TG CAT ATTT A 


240 


AGCGCTGTAC 


AGTATAATGG 


TTCACGTATG 


CCTGCTTCTT 


TAAGCTTAGA 


TGATTTTAGC 


300 


AAATCACTAG 


GCGTTGTATT 


AGCGATGATT 


TTTCCATCTT 


TAAAAAGAAG 


AACTCTATCA 


360 


AACGTATCAT 


CTAATGATTC 


TTCTAATCGA 


TGTTCGACAA 


TAATCATCGT 


TGACTTTGTT 


420 


TCTTCATGAA 


TATTGTTTAA 


CAATCTCAGC 


GTTTCATGTC 


CTGTCGCAGG 


ATCTAAATTG 


480 


GCCAGTGGCT 


CATCCAATAT 


TAAAATAGGC 


GTACGATGGA 


TTAATATACC 


ACCTAATGAA 


540 


ACGCGTTGTT 


TTTGACCTCC 


AGATAAATCT 


TGCGGTCGGT 


GATTTAAATG 


TTCTATCATG 


600 


CCAACTTTTT 


CAGCCCAATA 


ACTTACATTT 


TTCTTCATAT 


CATCTTGTTC 


AACACAATTA 


660 



SS 
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TCTTGTAAAA CTGTACCAAC AACATTAGAT CTATCATGTA AACAACTAAC GGTTGCATCT 780 

TGATTATTTA TATATAGTTC CCCAGTTATG TTACCTTTAG TTTTAAATGG AATTAATCCG 840 

TTTATGCAAT TTGCAAAAGT CGATTTACCA CTACCCGAAG CACCAACTAC TAATACTTTT 900 

TCTCCTGGAT AAATATCAAC ATTTATATTC TGTAATGTAG GTGTTGCTTG ACTATGATAT 960 

TGAAAACTAA AGTCTTTGAA CGAGATAATT GGTTCAGTCA TGATATATCA TTACCTTTCT 1020 

ATATTCATTT ACATATCTGA TTCAACAAAA TAACTATTCC TTACGTAAAC TACCTTTTTT 1080 

AATTTGAGAT GAaGCATATG CTTTTAATAA TATTGTCCCA ATAATGCCAA CTGAAATAAT 114 0 

ATTTAATACT GCAGAGATAA CACCTTGTGT ATAAACCTTG TTAGCTGGTT CGTTATAAAT 1200 

CAAAATATCT AATGTTGGTG CAATAAGTGC CCAGCAAATA ATATTCGCAA TAATTTGACC 1260 

GATATTAAAA TAAACCATCG ATTTCCTAGA AAATCGGCCT GAAGAAAGAT TTAATTTTAG 1320 

20 TCCAATCCAG CCATATAAAC AGCCTATAAT TCCCGAGC 1358 
(2) INFORMATION FOR SEQ ID NO: 341: 



10 



15 



25 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

TAGAAGAATT GGAGAAAATG CTAATTCAAT TGTCAACATT CCATAGTTAT CATGATTTAG 60 

AGTTTCTATT TGTGACACGT GAAGATGAAG TTGAAACATT GAAATGGGCA CGTTGGTTGC 120 

CACATATGAC ATTGAGAnGG CAAAACATTA GAGGATTTGT TTACAATCAA CGAACGCGTG 180 

ACCAAATTTT AACGTCAATT TATAGCATGA TTAAAGAACG TATCCAAGCT GTGCtGaACG 240 

40 CAGCAGAAGT AATGAGCAAA TTATTTTCAC ACCGCAATTA GTGTTTGTCA TTACAGATAT 300 

GTCATTAATT ATTGATCATG TCATTTTAGA ATATGTAAAC CAAGATTTAT CAGAATATGG 360 

TATTTCATTA ATCTTTGTTG AAGATG t GAT TGaAAGTTTG CCAGAGCATG TAGATACCAT 420 

45 TATTGATATC AAGTCTCGTA CTGAAGGCGA ACTGATTACG AAAGAAAAAG AATTAGTTCA 480 

ATTGAAATTT ACACCTGAAA ATATTGrTAA CGTCGATAAA GAATATATCG CGCGACGTTT 540 

GGCGAATTTG ATACACGTCG AACATTTGAA AAATGCAATT CCTGATAGTA TTACATTTTT 600 

50 

AGAGATGTAT AACGTGAAAG AAGTAGATCA GCTTGATGTG GTTAATCGAT GGAGACAAAA 660 

CGAAACATAC AAAACGATGG CAGTACCTTT AGGTGTAAGA GGTAAAGATG ATATTTTATC 720 

55 



1215 



AGGGAAATCT GAGATTATCC AATCATACAT 
TGAAGTTGCA TTCCTATTGA TTGACTATAA 
TTTAGTCCAT TTAGTTGOTA CGATTACAAA 
AACATCAATC AAAGCCGAAT TGAGAAAACG 
CCATATTAAT CAATACCATA AGTTATTTAA 
TTTATTCATT ATTTCCGATG AGTTTGCCGA 
AGAACTTGTA TCAACGGCAC GTATTGGACG 
ACAAAAACCA TCGGGTGTTG TTGaTGACCA 
ATTAAAAGTA CAAGATAGAC AAGACAGTAA 
CATTACmTTA CCaGgTCGTG CGTATTTACA 
CCAATCTGCA TGGAGTGGTG CAACATATGA 
TAAGAOGATT TACATGATTA ATGACTATGG 
TGGACTTGAA GATGAAGAAA CGAAAGAAAA 
TATCGAATCT ATTACAACAC GATTAGAAAT 
ATTGCCAGAA AATGTATATC AxGAAGATTT 
AG ATGATG CA AAAGAAGTGG AATTAACATT 
TCAAGGACCG ATGGTATTGC AATTGAAAAA 
AGGATATGGT AGAACAACGT TCTTACACAA 
TCCTGATCAA GCACACATGT ACTTGTTCGA 
AGACAT AC CA CATGTCGCTG ATTACTTTAC 
GATACGTATA TTTAATGATG AAATTGATCG 
CACTAGTATT TCTGAATATC GAAAATTAAC 
TATTGATAAC TTTGACGCAG TAAAAGATTC 
GATTAAAATG ACGCGTGAAG GGCTAGCATT 
AGCTAACGCT ATGAAAACAC CAATGTACAT 
ATATGATAAA TCAGAGGTGT CGAACGTAGT 
TGTGGGTCGA GCATTGTTAA GTAGTGATGA 
TAAACATGAT GAGACCAAAT CATATAATGA 
AGAATTTTAT AAAGGTGAAc ACCAAATGAT 
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TTTATCTTTA GCTATTAATT TTCACCCTCA 
AGGTGGGGGT ATGGCGAACT TATTTAAAGA 
CTTAGATGGC GATGAAGCGA TGCGTGCCTT 
TCAACGTTTA TTCGGAGAGC ATGATGTTAA 
AGAAGGTATT GCGACAGAAC CAATGCCACA 
ATTAAAATCA GAACAACCTG ATTTTATGAA 
TTCGTTAGGT ATTCATTTAA TACTTG CG AC 
AATTTGGTCT AACTCTAAAT TTAAGTTGGC 
TGAAATTTTA AAAACACCAG ATGCAGCAGA 
AGTTGGTAaT AATGAmATTT ATGAATTATt 
CATCGAAGGC GATAAATTAG AAGTTGAAGA 
TCAACTTCAA GCAATCAACA AAGACTTGAG 
TCAAACTGAG TTAGAAGCGG TCATAGATCA 
CGAAGAAGTT AAGCGTCCAT GGCTACCACC 
AGTAGAAACa GATTTCAGAA AATTATGGTC 
AGGACTTAAA GACGTACCAG AAGAACAATA 
AGCTGGGCAC AT CGCGTTAA TCGGAAGTCC 
CATTATTTTC GATGTTGCAA GACACCATCG 
TTTCGGTACC AATGGTTTGA TG C CAGTT AC 
AGTAGATCAA GAAGACAAGA TTGCTAAGGC 
TCGTAAGAAG ATTTTAAGTC AGTATCGTGT 
TGGTGAAACA ATTCCGCATG TCTTTATTCT 
ACCTTTCCAA GAAGTTTTTG AAAATATGAT 
AGACATGCAA GTAACCTTAA CTGCTTCAAG 
TAATATGAAA ACGCGTATCG CAATGTTTTT 
AGGACAGCAA AAATTTGCGG TTAAAGATGT 
CAACGTATCA TTCCATATTG GCCAACCATT 
TCAAATTAAT GATGAAGTAT CGGCGATGAC 
ATt CCTATGA TGCCAGATGA AATTAAATAT 
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GGATTAGATT ATGAAGGTGT TACACTACAA AAAATTAAAT TAACTOAACC AQCAATGATT 2640 

TCATCAGAAA ATCCGAGAGA AATTOCGCAT ATTQCTGAAA TTATGATGAA AGAAATTGAC . 2700 

ATATTAAATG AAAAATATGC GATTTGTATC GCAGACTCAA GTGGAGAGTT TAAAGCTTAT 2760 

AGGCATCAAG TGGCTAACTT TGCCGAAGAA AGAGAAGACA TTAAAGCGAT TCATCAACTA 2820 

ATGATTGAAG ACTTAAAGCA AAGAGAAATG GACGGCCCAT TTGAAAAAGA TTCACTTTAT 2880 

ATTATCAATG ATTTTAAAAC ATTTATTGAT TGCACGTATA TTCCGGAAGA TGATGTTAAA 294 0 

AAGCTTATTA CAAAAGGACC AGAACTTGGC TTGAACATTT TATTTGTCGG CATTCATAAA 3000 

GAATTAAtAG ATGCTTATGA TAAACAGATT GATGTTGCAC GTAAAATGAT TAACCAATTT 3060 

AGTATAGGTA TTCGTATTTC AGACCAACAA TTCTTTAAAT TTAGATTTAT TCAACGAGAA 3120 

CCTGTTATTA AAGAAAATGA AGCATATATG GTCGCAAACC AAGCTTATCA AAAGATTAGA 3180 

20 TGGTTTAAAT AGCAATGAAT TAAATAGGAG GGAGGTATGT TATGAATTTT AATGATATTG 324 0 

AAACAATGGT TAAGTCGAAA TTTAAAGATA TTAAAAAGCA TGCTGAAGAG ATTGCGCATG 3300 

AAATTGAAGT TCGTTCTGGA TATTTAAGAA AAGCTGAACA ATATAAGCGA TTAGAATTTA 3360 

25 ATTTGAGTTT TGCACTAGAT GATATTGAAA GCACAGCAAA GGACGTACAA ACTGCAAAAT 34 20 

CTAGTGCTAA TAAGGACAGT GTAACTGTTA AGGGAAAGGC GCCCAATACG TTATATATTG 34 80 

AAAAAAGAAA TTTGATGAAA CAAAAGCTTG AAATGTTGGG TGAAGATATC GATAAAAATA 354 0 

AAGAATCCCT CCAAAAAGCT AAGGAAATTG CTGGCGAAAA GGCAAGTGAA TATTTTAATA 3600 

AAGCAATGAA TTAATATTGA GGTGAAGATA TGGGTGGATA TAAAGGTATT AAAGCAGATG 3660 

GTGGCAAGGT TGATCAAGCG AAACAATTAG CGGCAAAAAC AGCTAAAGAT ATTGAAGCAT 3720 

GTCAAAAGCA AACGCAACAG CTCGCTGAGT ATATCGAAGG TAGTGATTGG GAAGGACAGT 3780 

TCGCCAATAA GGTGAAAGAT GTGTTACTCA TTATGGCAAA GTTTCAAGAA GAATTAGTAC 3840 

AACCGATGGC TGACCATCAA AAAGCAATTG ATAACTTAAG TCAAAATCTA GCGAAATACG 3900 

40 

ATACATTATC AATTAAGCAA GGGCTTGATA GGGTGAACCC ATGATGAAAG ATGTTAAGCG 3960 

AATAGATTAT TTTTCTTACG AAGAATTAAC AATTTTAGGT GGTAGTAAAT TGCCTCTCGT 4020 

45 AAATTTTGAA TTGTTTGATC CATCAAATTT TGAAGAAGCT AAAGCTGCTT TAATTGAAAA 4 080 

GGAATTAGTA ACAGAGAATG ACAAGTTAAC TGATGCAGGT TTTAAAGTGG CTACATTAGT 414 0 

CAGAGAGTAT ATTAGCGCCA TTGTAAATAT TCGAATTAAT GATATGTATT TTGCACCATT 42 00 

50 TAGCTATGAA AAAGATGAAT ATATTTTGTT AAGCCGGTTT AAAAATAATG GGTTTCAAAT 42 60 

ACGAATTATC AATAAAGACA TTGCATGGTG GTCGATTGTA CAAT CAT ATC CTTTATTGAT 4 320 

55 
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CTTAAATAAT GAAAGTATCG ATACGATTGG GCGTGTTTTA GAAATTGAAA TATACAATCA 4440 
TCAAGGTGAC CCTCAACAAA GTTTATATAA CATTTATGAA CAAAATGATT TOTTATTCAT 4500 

5 

TCGATACCCA TTAAAAGATA AAGTGCTGAA TGTTCATATT GGTGTCATTA ATACATT 4557 

(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 3931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 
TTGAGTGACT TTATTGAAGC GCGTGTAGAA GAAATATTCT TCOAAGTATT TGATGTTTTA 60 

23 CAAGATTTAG GATTAACAAA AGTAAATGGT GGGTTTATTG TAACTGGTGG ATCTGCAAAC 120 

TTACTTGGCG TAAAAGAATT ATTATCAGAT ATGGTAAGTG AAAAAGTTAG AATTCACACG 180 
CCATCACAAA TGGGAATTAG AAAACCTGAA TTTTCTTCAG CAATTTCTAC AATTTCTAGT 240 

25 AGTAT CGCTT TTGATGAGTT ATTAGATTAT GTTACAATTA ATTATCATGa TAATGAAGAA 300 

ACTGAAGAAG ATGTTATTGA TGTGAAAGAC AAAGATAACG AATCTAAATT AGGCGGaTTT 360 
GaTTGGTTTA AACGTAAAAC AAACAAAAAA GATACTCATG aAAATGAAGT AGAGTCAACA 420 

30 

GATGAAGAAA TTTATCAATC AGAAGATAAT CATCAGGAAC ATAAACAGAA TCATGaACAT 480 
GTTCAAGACA AAGATAAAGA TAAAGAAGAA AGTAAATTCA AAAAACTAAT GAAATCTCTA 540 
TTTGAATGAT TATTGGCCAA TAAAACTAGG AGGAAATTTA AATGTTAGAA TTTGAACAAG 600 

35 

GATTTAATCA TTTAGCGACT TTAAAGGTCA TTGGTGTAGG TGGTGGCGGT AACAACGCCG 660 
TAAACCGAAT GATTGACCAC GGAATGAATA ATGTTGAATT TATCGCTATC AACACAGACG 720 

40 GTCAAGCTTT AAACTTATCT AAAGCTGAAT CTAAAATCCA AATCGGTGAA AAATTAACAC 780 

GTGGTTTAGG AGCAGGAGCT AATCCTGAAA TCGGTAAAAA AGCTGCAGAG GAATCTCGTG 840 
AACAAATTGA AGATGCAATC CAAGGTGCAG ACATGGTATT TGTTACTTCT GGTATGGGTG 900 

45 GCGGAACTGG TACTGGTGCA GCACCAGTCG TTGCTAAAAT TGCAAAAGAA ATGGQCGCAT 960 

TAACTGTTGG TGTTGTAACT CGTCCATTTA GTTTTGAAGG ACGTAAACGT CAAACTCAAG 1020 

CTGCTGCTGG AGTAGAAGCT ATGAAAGCTG CAGTAGATAC ATTAATCGTT ATACCAAATG 1080 

so 

ACCGTTTATT AGATATCGTT GACAAATCTA CGCCAATGAT GGAAGCATTT AAAGAAGCTG 1140 

ACAACGTGTT ACGCCAAGGT GTACAAGGTA TCTCAGACTT AATCGCTGTT TCTGGTGAAG 1200 

ss 
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GTATTGGTGT TTCTTCTGGT GAAAATAGAG CGGTAGAAGC TGCTAAAAAA GCAATCTCTT 1320 

CTCCATTACT TGAAACATCT ATCGTTGGTG CACAAGGTGT GCTTATGAAT ATTACTGGTG 13 80 

GCGAGTCATT GTCATTATTT GAAGCACAAG AGGCTGCTGA TATTGTCCAA GATGCTGCAG 1440 

ATGAAGACGT TAATATGATT TTCGGTACAG TTATTAATCC TGAATTACAA GATGAGATTG 1500 

TTGTAACAGT TATTGCAACT GGTTTTGATG ACAAACCAAC ATCACATGGT CGTAAATCTG 1560 

GTAGCACTGG ATTCGGAACA AGCGTAAATA CTTCTAGCAA TGCAACTTCT AAAGATGAAT 1620 

CATTCACTTC AAATTCATCA AATGCACAAG CAACTGATAG TGTAAGTGAA AGAACACATA 1680 

CAACTAAAGA AGATGATATT CCTAGCTTCA TTAGAAATAG AGAAGAAAGA CGTTCAAGAA 1740 

GAACAAGACG TTAATCGGTT AATATATATA CACAAATAAT TCAACACAAA TCATCAGATA 1800 

ACATATCTGA TGATTTTTTT ACTAATTTTT AGaACATGTA GAAGGACATT TAAGTTTT T C 1860 

20 aAAGTTATTA AAAGTGTTTA AGTAT CGTGT GAAAATTAAG TCaAAAATTA TTTGCGCAAC 1920 

ATTTTAACTT TAAACATAAA TGTTATATTA TATAATTATT AACTTTGTAC AGTTAGACGA 1980 

AGATAATTTA AATGAAATGA TGGTGACGAT CGAGTGAATG ATAATTTTAA AAAGCAACCG 2040 

25 CAT CATTTAA TATATGAAGA GTTATTACAA CAAGGTATTA CTCTAGGTAT TACAACTAGA 2100 

GGAGATGGTT TAAGTGACTA TCCTAAAAAT GCTTTTAATA TGGCGAGATA TATTGATGAT 2160 

CGCCGATATA AT ATT ACT CA ACATCAATTG CAATTAGCTG AAGAAATTGC GTTTGATAGA 2220 

AAAAATTGGG TGTTTCCCAT TCAAACACAT GAAAATAAAG TCGCTTGTAT TACAAAGGAT 228 0 

GATATAGGCA CAAATATAGA CACTTTAACT GATGCGCTTC ATGGTATTGA TGCGATGTAC 234 0 

ACATATGATA GTAATGTCTT ATTAACGATG TGTTATGCAG ACTGTGTACC AGTATATTTT 2400 

TATAGTACAA AACAT CATTT TATTGCATTG GCGCATGCAG GTTGGCGTGG TACCTATACT 24 60 

GAAATTGTAA AAGAAGTGCT AAAACATGTG AACTTTGATT TGAAAGACTT ACATGTCGTT 2520 

4Q ATTGGAC CAT CTACATCATC AAGTTATGAA ATTAATGATG ATATTAAAAA TAAATTTGAA 2580 

ACATTGCCAA TTGATAGTGC CAACTATATT GAAACTAGAG GACGAGATCG T CATGGT ATT 264 0 

GATTTGAAAA AAGCCAATGC TG CATT ATT A ATTTATTATG GTGTTCCTAA AGAAAATATT 2700 

45 TATACGACAG CGTATGCTAC AT CTG AACAT TTAGAATTAT TTTTCTCTTA TCGATTAGAA 2760 

AAAGGTCAAA CAGGACGCAT GTTAGCATTC ATTGGTCAAC AGTAAACAAG GAGGAGATAT 2820 

GTTTGCGTGT GAAAGATAAT TTACAACAAA TCTCAACACA AATTAATGAC AAAAGTGAAA 2880 

50 

AAAATAATTT TTCAACAAAA CCAAACGTGA TTGCAGTTAC AAAATATGTT ACAATAGAGC 2940 

GAGCTAAAGA AGCGTATGAG GCTGGAATAA GACATTTTGG TGAGAATAGA TTGGAAGGCT 3000 
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AATCTCGAAA AGTTAAGGAC GTTATAAACG ACGTAGATTA TTTCCATGCT TTAGATCGAT 3120 

TGAGCTTAGC CAAAGAAATT AACAAACGTG CAGAACATAA AATTAAATGT TTCTTGCAAG 3180 

TGAACGTTTC GGGAGAAGCT TCTAAACATG GTATTGCTTT AGAAGATGTT GATCAGTTTA 3240 

TAGATGATCT TAAAAAATAT GACAAAATCG AAATTGTAGG TTTAATGACG ATGGCACCAT 3300 

TGACAGATGA TGAAG CAT AT ATTAGATCGT TATTTAAACA GTTACGTTTG AAAAAAGAAG 3360 

AAATACAACG ACTCAATTTA GAATATGCGC CTTGTGATGA ATTATCAATG GGAATGAGTA 3420 

ATGACTATCT TATTGCAGTT GAAGAAGGTG CGACGTTTGT TAQAATTGGG ACTAAACTTG 3480 

TAGGAGAAGA GGAGTGAGCC ACTTGGCTTT AAAAGATTTA TTTAGTGGAT TTTTTGTAAT 3540 

AGATGATGAA GAGGAAGTAG AAGTACCTGA CAAACAACAA CAGGTAAATG AAGCGCCAGC 3600 

AAAAGAGCAG TCACAACAAA CAACAAAACA AAACGCAATC AAATCAGTCC CTCAAAAATC . 3660 

2o TGCATCAAOA TATACAACAA CGTCAGAAGA AAGGAATAAC CGTATGTCTA ATTATTCAAA 3720 

AAATAATTCA CGTAATGTTG TAACTATGAA CAATGCTACA CCAAACAATG CATCACAAGA 3780 

AAGTTCAAAA ATGTGTTTAT TCGAACCACG TGTTTTTTCA GATACACAAG ATATTGCTGA 384 0 

25 TGAGCTTAAA AACCGCCGTG CGACACTTGT CAATTTACAA CGTaTTGATA AAGTATCAGC 3900 

GAAAAGAATT ATTGATTTTT TAAGCGGTAC T 3931 

(2) INFORMATION FOR SEQ ID NO: 343: 

30 <i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 3150 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

AATTGTCGGG GGACTCTTAG GTTTTGTCAT GCAAAOAACA AGATTTTGTT TAACAGGTGG 60 

40 

CTTTCGAGAT ATGTATGTGC AAAAGAATAA TAAGATGTTC TATGCATTAT TAATCGCTAT 120 

TACTATTCAA AGTATAGGAT TATTGATTTT GACGGCAACA GATATTTTAC AAATTCCTGC 180 

45 ACATAGTTTT CCAATATTGG GAACAATTAT AGGTTCTTTT ATTTTTGGAA TTGGAATAGT 24 0 

ATTGGCTGGA GGATGTGCAA CAGGtACTTG GTATCGCGCT GGTGAAGGGC TAATTGGTAG 300 

TTGGATTGCA TTAGTATTAT ATGCTGTTAC TGCAGCAATC ACTAAAACAG GGATTTTAAA 360 

60 GCCAGTAATG GATAAAATTA ATCAAC CAAC GAATGTAAAT AGTGATATGT CTCAAACAAC 42 0 

TGGCATTCCG TTTTGGGGAT TAGTCGTTAT ATTAACTATA ATCACCATTT TTCTAGTTGT 480 
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AGGTATTAGA TATTACCTTT TCGAAAAACG ATACCATCCA TTTATTGCAG CAATTGTAAT 600 

TGGACTTATC GCACTCTTAG CTTGGCCAAT GAGTGCATCA ACTGGAAGAA ATGACGGTTT 660 

AGGTATAACA ACGCCTTCAG CAAATTTAGT ACACTTTTTG ATTACAGGTG AAACTAAATT 720 

TATTGATTGG GGTGTCTTTT TAGTTCTAGG AATTTTCATT GGTTCATATA TTGCAGCTAG 780 

AGGATCAAGA GAATTTAAAT GGCGATTGCC AGACAAGATT ACAATACGAA ACAGTGCCAT 840 

TGGTGGCATA TGTATGGGAT TTGGTGCGTC AGTTGCTGGT GGTTGTTCTA TCGGTAACGG 900 

TTTGGTTGAA ACGGCAACGA TGACTTGGCA AGGATGGATT GCGCTAgCAT gCGATGATAG 960 

TTGGTGTATG GACAATGAGT CATTTTATCT TTGTTCGTCC AATGAAAAAA GTACACCAAC 1020 

AATCTGCAAA GGTTAAACAG CAAACGCAAA TAGTATAGAA GATTATTATG CAAATGATGT 1080 

TGATCAAATA AAAGTGATTG GAAAAGGAGA AATAATTATG ATACACGAAT TAGGTACAGT 1140 

20 AGGAATGGTA TGTCCATTTC CGTTAATTGA AGCGCAAAAG AAAATGGCAA CATTGCAATC 1200 

TGGAGATGAA TTAAAAATTG ATTTTGATTG CACGCAAGCG ACGGAAGCCA TTCCAAATTG 1260 

GGCTGCAGAA AATGGTTATC CTGTAACAAA CTATGAACAA ATTGATAATG CTTCATGGAC 1320 

25 AATTACAATT CAAAAAGTTT AACGTTATCA TTTTAACAAT AAAATAGATA TT AG ATT CT A 13 80 

TGGCTACTTC CGCTAATTTA AAAGTGAGTA AGTAGTCTTT TTTTTTTTAG TTCATGAAAT 1440 

CATTTTTATA TAGTGTGGCA CATTTTATTC CAAAAGATGT AATAAAACTT AACGCATTTT 1500 

30 

TGCTTTTTAT AAATTGTCAG ATTATTATGA AAAAAAGGGA GTGGTAAGTA TGAATCTTAA 1560 

CGATACGATA TTTATGTTTT TGTGTACATT ATTAGTTTGG TTAATGACAC CAGGATTAAG 162 0 

TTTATTTTAT GGTGGGTTAG TTCAATCTAA AAATGCGCTT AATACTGTCA TGCAAAGTAT 1680 

35 

GGCAGCAATT GTGCTTGTTA CATTTGTATG GATAACAGTT GGTTTTACAA TTAGTTTTGG 1740 

GAATGGGAAT TTATGGTTCG GAAATTGGGA ATATACTTTT CTTAATCATG TAGGTTTTGC 1800 

40 GACTCAAGAA GATATTAGCC CACATATTCC TTTCGCTTTG TTTATGTTAT TTCAAATGAT 1860 

GTTTTGTACG ATTGCAATTT CTATTTTATC TGGTTCAATC GCTGAGAAAA TGAAGTTTAT 1920 

TCCTTATTTA TTATTCGTAG TAATATGGAC TGCTCTTGTA TACAGTCCAG TAGCACATTG 1980 

GGTTTGGGGC GGCGGTTGGA TTAACAAACT CGGTGTATTA GATTTCGCTQ GAGGTACGGT 204 0 

TGTTCATATT ACATCAGGTG TTTCTGGTTT AGTATTAGCT ATTATGATTG GAAAAGGAAA 2100 

CAAACATTCT GAATCAACAC CACATAATCT TATCA1TACG TTGATTUGCG GTATATTCGT 216 0 

GTGGATTGGT TGGTATGGAT TTAATGTAGG TAGTGCTTTT ACATTTGATA ATATTGCGAT 2220 

GCTTGCATTT ACAAATACTG TCATTTCAGC CAGTG CAGGT GCTATAGGTT GGTTAATTTT 2280 
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TGTAACATAT CTTAGTGCAA CAATAATGGC 2400 

CATTAATTAC ATCAAGGTAA AACTAAAATA 2460 

TGGTGTTGGT GGTATTATTG GTGCTGTTTT 2520 

TCCTGACATT GAGAATGGCT TTATTTATAC 2580 

ATTATGTGTA ACAGCAGTTG TAATTTTTAG 2640 

AATTAAATTa ATTACACCAT TATCTGTTAC 2700 

GATTGTTCaC GGTGAACATG CTTACTTTGA 2760 

TCGATATTAG AATATATTTA CATAGAATAT 2820 

TGTTGGGACA TTTTGTTATA CAAAAGTTTT 2880 

AATATTATTT AAAGCGGTTA CACATATGCT 2940 

AGGTGAATAA GTATGAAAAA TATATCTGAT 3000 

ACAGTATCTA GATTTTTAAA TAATGGATCT 3060 

AGAATTATAG CAGAACATGA CTATCAAC CG 3120 

3150 

(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 3719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

GTTATAGTGA AATTGACTCA TCACATTTCA CAGACCGTGA CAAACGCGTT ATTAGACGTG 60 

40 ATCATGTTAA AGAAGCACAA AGCTTAGTAG AGAACTATAA AGATACACAA AGTGCTGATG 120 

CTAGGATGAA AGCCAAACAA AAAGTTAACA CATTAAGCAA ACCGCATCAA AACTATTTCA 180 

ATAAACAAAT TGATAAGGTT TATAATGGAT TACAACGCTA ATCCAAAGTA AATTATAAGT 240 

45 

TATACATCTC GTTTTTAAAT GACAATTTAT CCCCGTAAAT ATTATAAATA ATCTTTTCAA 300 

ATTCCACATA GATATAGAGA CACTAATAAA CCTCTTTGTC TCGATATGAT AGTCTGCAAC 360 

GATTCATGTT GTAGGCTTTT TAATTTTACA AATAAGGCTA AATATATAAG TTCTGGCACC 420 

50 

TAAAATATAG AAAATACATA AAAGTAAGTA TAG TT ATTTT ATTATAATTA TTAAATTTTT 480 

ATTAATTAAT TGTAAAAATG TATAATTATA ATTAATTAAC GTTTAATATT AAAATTAACT 540 

55 



ATTAGTTGTC ATTACTCCTG CAGCAGGATA 
TTTAATAGGA GGTATCTGTT GTTATATTGT 

5 

TCATGATGCA TTAGATGCAT TTGGTATTCA 
AACAGCAGTT TTCCAAAGTA AAAAAGCCAA 
TGGTGACATA CATATTATAC TTGTACAAAT 

10 

TATAGTCATG ACGTTTATTA TTGCGAAAGT 
GGAACAAGAA ACGAATATAG GATTAGACAA 

T5 AGGTGAGCTA AATAGATTCA ATAAACATAT 

TCATTGTCCT GACATTTAAC TAAAGGTTGA 
ATTTTGAAAT CTTTTTATGA AAGAAGCAGA 

20 AAAATAAGGC TAAGTGTCAC AAATAATGAT 

ATTGCCAAAT TGGCAGGCGT TTCAAAAAGT 
GTCAGTAAAA AAACAAGTGA AAAATTAACA 

25 

AATCAATTTG CTCAAAGTTT AAGAGCGAGA 
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ATCGTTTCAA TATTACTTAT AGGGATGGCT ATCAGTAATG TTTCGAAAGG GCAATACOCA 660 

AAGAGGTTTT TCTATTTCGC TACTAGTTGT TTAGTGTTAa CTTTAGTTGT AGTTTCAAGT 720 

5 

CTAAGTAGCT CAGCAAATGC ATCACAAACA GATAATGGCG TAAATAGAAG TGGTTCTGAA 780 

GATCCAACAG TATATAGTGC AcTTnCAACT AAAAAATTAC ATAAAGAACC TGCGACATTA 84 0 

ATTAAAGCGA TTGATGGTGA TACGGTTAAA TTAATGTACA AAGGTCAACC AATGACATTC 900 

10 

AGACTATTAT TGGTTGATAC ACCTGAAACA AAGCATCCTA AAAAAGGTGT AGAGAAATAT 960 

GGTCCTGAAG CAAGTGCATT TACGAAAAAA ATGGTAGAAA ATGCAAAGAA AATTGAAGTC 1020 

t5 GAGTTTGACA AAGGTCAAAG AACTGATAAA TATGGACGTG GCTTAGCGTA TATTTATGCT 1080 

GATGGAAAAA TGGTAAACGA AGCTTTAGTT CGTCAAGGCT TGGCTAAAGT TGCTTATGTT 1140 

TATAAACCTA ACAATACACA TGAACAACTT TTAAGAAAAA GTGAAGCACA AGCAAAAAAA 1200 

20 GAGAAATTAA ATATTTGGAG CGAAGACAAC GCTGATTCAG GTCAATAATG CTCATTGTAA 1260 

AAGTGTCACT GCTGCTAGTG GCACTTTTAT AATTTTTAGA TCACGATATG ATTTATTATC 1320 

AATTCAGAAT TAAAAAAGTA AATAGTATCA AAAGTAAGTG TATTTAATAT TAGAAAATAA 1380 

25 

AAATTTTAAA TTTAGTATTA AAATGGAATG TTACTATATA GTTCAATGTG TATTATCACA 144 0 

GAAAATAAAA TAATGCTTTA CTTCTATATT TAAAAGTGTA TAATGAAAGT TAAGTAATAA 1500 

AGAGCGTGAA GAAAAATGTG AGTTATTTAT ATAGAATATT CTCCTTTTCA TTTATGAATT 1560 

30 

TGTTACAAAA TATTTAGTGC AAAAGCACGA CGGAGGTATT CAATATGaAT AACGGTAGAG 1620 

TTAAATGGTT TAATGCAGAA AAAGGTTTTG GTTTCATCGa AAGAGAAGAT GGTAGCGACG 1680 

35 TATTCgTACA CTTCtCAGCA ATCGCTGAAG ATGGATACAA ATCATTAGAA GAAGGCCAAA 174 0 

AAGTTGAATT CGACATCGTT GAAGGCGACC GTGGCGAGCA AGCTGCAAAC GTAGTTAAAA 1800 

TGTAATTTTA ACTTATTCAA ACAGTCCTTA CTATAGGGCT GTTTTTTTAr GCTTTAAATC 186 0 

40 GATAACAGTT GGTGTGGTAA AAGCACTAGC CGTTATTTTT TTGTCCAATA AATTTAGTTG 1920 

GAGATTTAAC AATATATAAT GGTTCTAAAA TAAATCGAAC TGATGGAAAA GTTTTTTACT 1980 

TTTCATCTGT CCGACTTTTG ATTTTGAATA TAAAAAAGCG CCAATACAGA ACTTTAATAA 204 0 

45 

TGACGAGAAT TAAAGTCTGT ATATGGCGAT AACAAGAAGT AATGTTAAAC ACT CAAAATG 2100 

TTTAACAATA ATAGGATACC ACATCGCATA ATATCTTACT ACTTAATTAA TAATTTAACT 2160 

AATCAACTTT TTGTTAATTT TTTATTAAGA CTGATTAATT ATTGAGAATA TTTATTGTTT 2220 

SO 

TTAAAATCTC ATAATAATTC AGTAATCTTG TTTTCATTTA AAAGGCGAAA CATTAAAATA 2280 

ATTAAATAAA AATATTGCGT TTAATTTACA GCGTCAAATA TACTTATTTC TAATGCTTTG 234 0 
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ATTTCTGTGA GTATTTGGAA GCTACCATTA GGCAACGGTT TAACAATAGA CAATTOCTTT 2460 

TCCGCTTGTT GTATTAAAAA AGGTTTTGTA GATTGATTAT TAATATGCCA TTCACTCATG 2520 

6 TATGTTTTTC ACTCCTGCTT TAAAATAGGG TTAGAAAGTT TATAGTTGAG ACATTCATGT 2580 

TCAACCAAAA TTTTGTTCGA ATTCAATAAA TGTCTTGTTT AAAATAGAAA TATTGTAAAT 2640 

GTTATCGTCC AAAACTTCAC CAGTTAAGTA TTTGTTTTGA ATTAAAATTT GGCAGTTAGT 2700 

10 

TAAGAAGTCT TGATAATCAC GATCGCAAAA ATAGTTTTCA CGTGCATCTT TAGCATCGCC 2760 

AAAAAAGTTA GCGACTGTTT CTGTTTCTCC TTTATTCGAA CGTTCAATAT ATAATTTGTA 2820 

AAATTTAGCT ATTGTATACT TTTGTTCTTT AGTTAGTTCA TTCAAAATAT TGGGCCTCCT 2880 

15 

GAAATATCAT TTGTAATCTA TACC CAATTT ATTGCAAAAC AAAAACTAAT TTAACTATTT 2940 

GATGAAACTG TGTTAATAAg CTTTAACAAG CCTTAGTTTG TATGGATCTA TAAAATTATC 3000 

20 TTTAATTGCA TAGGGTGAAA TAATATGTAG TCCATAACTT TTAACTGATT TTTCACTTAC 3060 

ACCAAATTTA TAAGCTTGGT AGATAATTTT AGTACAATAC GTAAATTTTT TGCTGTTCAA 3120 

ATTTAATGTA ACTAGATAAC GATGATTTGT ATTCTCATAG TTTTTCTTAA CCCATTCAGC 3180 

25 . CGCTTTTTTA CCTGCACCAG GATAGCTGCA ACGATAAACT TTCATCCAAT CATTTTTGCC 324 0 

ACTTGCATAA TTATATTTAA AAGATTCGAA GGATTGTGTA GTTGGTTTGT CGCCAGGCCC 3300 

CTCAATTTGT AAAATCGTTT TATCATCAAT CGCGATACTA CAATGACCAA AAAATCscCA 3360 

30 CATGACAGGG CCTTTTGTAA CAATAAT AT C ACCAGGTTGT AATTGGAATT TGTCATCTTG 3420 

AATTTCTGAA TACTTATTAT CTGCAATTGT TTTTGGTGAG TTTATTGGGG ATACGACAAC 34 80 

GAATAATATA AGTAAAATTA TCGTTCGTTT AATATAGTTC ACTTAAAAGC TCCTTGTTGA 354 0 

35 

AGAAATATAT GTAAATAGTC TTAAATTAGA ATTGTAATCT TTAATAAGCT TGtAAGACTA 3600 

AAACATATCT TAAATATTAA AGTATGAGAG TGTGAAATGT CTATTAAGAA TnAAAAACAG 3660 

TCTGAAACAT CATTGAGACG TTCCAGACTG GATATAAAAT GAATTTCATT TATAGCACA 3719 

40 

(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1676 base pairs 
4S (B) TYPE : nucleic acid 

<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 

TTGCGTTGCC GCACCAAGAT ATTGAATGCC TAGCGATTCC GAGTATGCAA ACTGAACGCA 60 
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CTCTTTAATA CGCGTTATCG CTTTTTGTAA ATCTGCATCA TGATAACAAA TCATAACGCC 180 

ATAGCCACCT GCTGTCGGAA GATCATCTCC CGGCTTAATT ACTAACGGGA ATTCCCAATT 240 

CTTAATCTCG TTTTCGAATT GCTCAATTTT TACAACTTTT CTTTTTGGTA AAAACTTCCC 300 

ATTTGTCCAT TCAGGTATTC TTGCTTTATT ATTTAAAGCA ACAAATAACG TTTTATCTAA 360 

TGCATAATAT TGCTGATTCA AGATTGTTTC ATCATGAATA TATTGAAAAT AAATCTTTTT 420 

ATTTTCCTTA TGTGCCAATT GTTTGATCAA GTTTTCGTAA GATTGCTGAT TGTTAAATGT 480 

ATAAATTGAG TTCGGtACTT CCTTACCAAT AACTTGAAAT AGCTGATGCA ATTTGTCTGT 54 0 

CGCACTAGCT TCGTGAACAA TAACAGGTAA TTGATTTGCT ATTAATAACT CCCTACCAGT 600 

TAAAAAATTA GATTGATGTT CGTCCGGTTT CAACCATGGA TTCGATATAT ACGAAGGTCT 660 

TGACGTATAG ACAACATCTT TGTCATATAA ATCACTTAAC GTTAAGTTCG GCTCATTACC 720 

20 ATTATTTGTC ATTACTTCCC ATTCCCTTTC AAATGCGCAT GCTCTTCAAT AATGTCTTGA 780 

TAAACGTCTT GATTTGTAAT TAACTCTAAC CCCATCAACG CCATTATTTT AGCGCCTTTA 84 0 

ATTAATGCTT CATCACCATG TACACTCGCA GCCGCTTCTC TAAATCTATG CGTATGTCCT 900 

25 

ACTAAATTAC GTGATCCTAT TTTAATATGA GGATGTATTG TTGGCACAaC ATgaCTTACG 960 

TTCCCTGTAT CCGTAGAGCC ATAACCAAAA TCATCATCAA TAACTGCTTC ACCAACTTCT 1020 

TCAGCATATT TAGCAAATAA ATCATCTAAT TTCGGCGTTT TAATGAATTC ATTCACACCG 1080 

30 - 

TTTTGAATTC GACCAAATTC ATAATCACAA CCAGTCTGTA TCGCAGCTCC ACGTGCGATT 114 0 

TGATTTACTT TTTCTGTTAA TATATCCAAT TCTTTACGCG TCATTGCTCT AGTATAAAAA 1200 

3S CGAGCATGTG TATAGTCTGG AATAATATTA GCTGCTTTCC CGCCATCTAA AATCACACCA 126 0 

TGCACACGTT GATCTTTTTT AATATGTTGT CGTAGTTGTG CTACACCATT AAAATAACTA 1320 

ATCATAGCGT CTAATGCATT TAACGCTTCA TCTGCATTTT CAGAGGCATG AGCACTTTTT 13 80 

40 CCGTAAAATT TAACATCTAA AACATCCACT GCCAAAGTAT CAATCGTTTT ATAAGTTTCA 144 0 

TTTCCCGGAT GAATCATTAA GGCAATGTCT ATTTGATCAA TCACACCAGC CTTGACATAA 1500 

GAAGCTTTAG CGCTACCATT TTCCCCACCT TCTTCAGCTG GACATCCAAG AACGACTACT 1560 

45 

TTACCACCAA TTTGGTCAAT CACTTGCTTC AAACCAATTG CACCAAGAAC ACTTGCAGTT 1620 

C CAATG AT AT TATGACCACA AGCATGACCC AATCCTGGCA AAGCATCGTA TTCTGC 1676 
so (2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

5 



10 



25 



30 



35 



40 



45 



TACAGTAGGA 


ATCATAAAAC 


CTAATACAAC 


AAATACAAAA 


CCATTTAAGG 


-m mm m i till «t «\ * 

CATAACTAAA 


60 


TCJTUTTCCAA 


ATTTGATGGT 


AATTCATTTG 


TAGTTCCGTT 


TGTGCTCTAA 


TTAAACGGTC 


120 


GCGTTCTAAA 


CCATGGATTA 


GACCTGCGAT 


TACAACTGCA 


ATGATACCTG 


AAGCATGAAC 


180 


TTCTTCTGCT 


AAAAAGTATA 


CGACAAAAGG 


AGTTAATAAT 


TGAATAAAAG 


TTAAGGTATT 


240 


GTTATCTTTT 


AAACCTTTAT 


TAGCGGTTAA 


GTCTATACGT 


ATTCTAACGA 


CAACGAATCC 


300 


AATAATTGCA 


CCAATAAGTA 


CACCTAGTAT 


TGTTGAAATG 


ATAAATTGTT 


CAACAGCTTG 


360 


GAATAATGAA 


AAGGTACCAG 


TTACTAATGC AGTAACAGCA ATTTTAAATG 


AAATGATACC 


420 


TQCTGCATCA 


TTGAGTAAAG 


ATTCACCTTC 


TAAAATCGTC 


ATAGAACCTT 


TAGGTAATAA 


480 


TTTTCCGOGT 


GTAATAGCAG . 


ATACTGCTAC 


TGCATCAGTA GGACATAAAA TTGCTGCTAT 


540 


TGCAAAAGCG 


GCTGGCATTG 


GTAAGGCAGG 


CCAAATCCAA 


TGTATAAAAT 


AGCCAACACC 


600 


GACTACAGTT 


GCAAACACTA 


GTGCCATTGA 


CATTAATAGT 


ATAGGTTTAC 


GATATTCTAA 


660 


TAATTTTGTT 


CGAGAGACGT 


GGGTACCTTC 


CACAAAAAGT 


AGTGGCGCGA 


TAACGGCAAA 


720 


CATAAATACT 


TCAGAATTGA 


ATTGGAAATC 


AACTTGTATT 


GGAATAATGA 


AAATAACGAC 


780 


ACCTAATGCA 


ATTTGAATAA 


AGGCAGTAGG 


AATTTGTGGG 


AATCGATTAT TGATAACCGA 


840 


ACTAATAATC 


ACAGCAAAAA 


TAAAAATTAA 


AAATGCTTCT 


AATAGTGCCA 


T ACAAT ACT C 


900 


CTCAAAATTT 


TAATAGTTAA 


TATTTTATCA 


CTTTTAAGGC 


ATAATGACAT 


AGATATATTG 


960 


ATAAAATGAA 


GTTATTTTCA 


AAAAAACTCT 


AGTATCGGTT 


GAACTGAT AC 


TAG AG CG AG A 


1020 


TGTTTAAATT 


ATTGATTGTC 


ATATCTGAAA 


TGACCGCTGT 


CATTTTGTCG 


TTGTTCATAC 


1080 


GCGAGCTTTT 


CAGCATTCGT 


TTTGTATTTT 


TtATAAAAGa 


AAAATAaAAA 


TATnAACCaG 


1140 


AATGGCGAAA 


TATAAATAGC 


TGCTCTTGtT 


TCGTCACTAA 


AGAATAATAA 


AATGAATACA 


1200 


AAGAAGAAGA ACGCTAGAAT AATGTAAGCA 


ATAGGCTTAC 


CACCAATCAA 


CTTAAATTTA 


1260 


CTOiTOAT 


GTGCCTCAGG 


ATGCTTTTTC 


AAAT 






1294 



(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ACATGATAAT 


GATGACGCTA 


TTAAAACACG 


TTTTTTATTT 


TTCATTGTTA 


TAACCTTCTT 


60 




TCGTATGATT 


GATATTTGTT 


GATATGTATC 


GACATGTGAA 


TAATATCACA AAAACAGAGA 


X20 


5 


ATATATATTT 


AACTATTTAT 


TAAATGATTT 


TGTT AAT ATT 


ATTAAATACT 


TTATCCTCTT 


180 




TAAAAATAAT 


GTGTGTACAA 


AGTCATTAAT 


TT AG CAAAT A 


TTTTTATTTA 


GTAGTTAATA 


240 


10 


AC CATCGATT 


TGAAATTTAT 


ATATAATTAT 


TAG CT AAAT A 


ATATCCTGCA 


TCTTTCTCAT 


300 


ACAATTTACT ATAAAtTAgC ATATCCGATA TCAGCGTTAA TAAGATCGTT 


GATACTAGmC 


360 




AGTTAATTTC 


ATAGAACGAA 


ATCAAATAAC 


AGACTACTTT 


CTGCATTTTA 


AATTATGTTT 


420 


IS 


AAGAATCAnA 


ATTATGTTTA 


nATAAATATA 


TATACTACTT 


TGAAAGGTGT 


GAGCTTAATG 


480 




ACAACTTTTA 


GTGAAAAAGA 


AAAAATTCAA 


TTACTAGCAG 


ATATTGTTGA 


ACTACAAACT 


540 




GAAAATAATA 


ATGAAATAGA 


CGTTTGTAAT 


TATTTAACAG 


ATTTATTCGA 


CAAGTACGAT 


600 


20 


ATTAAATCTG 


AAATTTTGAA 


AGTTAATGAA 


CACCGCGCCA 


ATATCGTTGC 


AGAAATCGGT 


660 




AACGGCTCAC 


CTATACTCGC 


ATTGAGTGGT 


CATATGGATG 


TTGTTGATGC 


AGGAAATCAA 


720 




GATAATTGGT 


CATATCCCCC 


TTTTCAACTG 


ACAGAAAAAG 


ATGGCAAATT 


ATATGGCCGA 


780 


2S 


GGCACTACAG 


ATATGAAAGG 


CGGTTTAATG 


GCTTTGGTCG 


TATCTCTAAT 


CGAATTAAAA 


840 




GAACAAAATG 


AATTGCCTCA 


TGGAACGATT 


AGATTACTGG 


CTACTGCTGG 


CGAAGAGAAA 


900 


30 


GAACAAGAAG 


GTGCCAAATT 


ATTAGCTGAT 


AAAGGCTATT 


TAGAOGATGT 


CGATGGCTTA 


960 




ATTATTGCTG 


AACCAACTGG 


ATCTGGAATT 


TATTATGCAC 


ATAAGGGGTC 


TATGT CATGT 


1020 




AAAGTAACTG 


CAACTGGTAA 


AGCTGTCCAT 


AGCTCAGTTC 


CATTTATTGG 


TGACAATGCA 


1080 


35 


ATTGATACAC 


TGCTTGAATT 


TTATAATCTA 


TTTAAAGAAA 


AATATTCAGA 


GCTTAAACAA 


1140 




CAAGATACTA 


AACATGAATT 


AGATGTTGCG 


CCTATGTTCA 


AATCATTGAT 


TGGAAAAGAA 


1200 




ATTTCTGAAG 


AGGATGCAAA 


TTATGCATCT 


GGTCTTACAG 


CTGTATGTTC 


GATTATAAAT 


1260 


40 


GGCGGcAAAC 


AATTTAACTC 


TCTACCAGAT 


GAAGCTTCAC 


TTGAATTTAA 


CGT AAGAC CA 


1320 




GTTCCTGAGT 


ATGATAACGA 


CTTTATAGAA 


TCGTTTTTCC 


AAAATATCAT 


TAATGATGTG 


1380 


45 


GATAGCAATA 


AGCTTTCACT 


CGATATTCCA 


AGCAATCACC 


GACCTGTAAC 


AAGCGATAAA 


1440 


AATAGCAAAT 


TAATTACTAC 


GATTAAAGAT 


GTAGCTTCTA 


GTTATGTAGA 


ACAAGACGAA 


1500 




ATATTTGTTT 


CAGCGCTTGT 


AGGCGCAACA 


GATGCCTCTA 


GTTTCTTAGG 


AGATAATAAG 


1560 


SO 


GACAATGTTG 


ATTTAGCCAT 


TTTTGGACCA 


GGTAATC CAT 


TAATGGCACA 


T CAAAT CG AT 


1620 




GAATATATTG 


AAAAAGATAT 


GTATCTGAAA 


TATATTGATA 


TTTTTAAAGA 


GGCTTCCATT 


1680 




CAATATTTAA 


AAGAAAAATA 


AGAACGATGC 


TGTCAGCTGC 


CCTATTCGCG 


TGCTGG CAGT 


1740 
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TCAAATATCA ACAAGCACAT TTTCATTGAT TAAGTGATGT AAAACTGAAA TTATTGTGCT 1860 

GATTTGTCAT ACATATATTG ACTAATGGGC ATATAAAAAG ATAGCCTCTA ATAGTnACAT 1920 

AAACTCGTAA AAnCC 1935 



(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

CCTTTnCCTA AACAATTTTT AGATTTAGAC AACAAACCGA TTTTAATCCA TACATTAGAA 60 

20 AnATTTATTT TAATTAATGA TTTTGAAAAA ATTATTATCG COACGCCACn ACAATGGATG 120 

ACGCATACGA AAGATACACT TAGAAAATTC AAAATTTCTG ATGAAAGAAT TGAAGTCATT 1B0 

CAAGGTGGTA GCGATCGTAA CGATACAATT ATGAATATCG TTAAACATAT TGAATCAACA 240 

25 AATGGTATTA ACGATGACGA TGTCATTGTG ACACATGATG CAGTTAGACC ATTTTTAACG 3 00 

CATCGTATTA TTAAAGAAAA TATTCAAGCT GCTTTAGAGT ACGGTGCAGT AGATACAGTG 360 

ATTGATGCTA TAGATACGAT TGTTACATCT AAAGATAATC AAACGATTGA TGCAATTCCA 420 

30 

GtGCGTAATG AAATGTACCA AGGTCAAACA CCTCAATCGT TTAATATTaA TTTATTAAAA 4 80 

GAaAGCTATG CACAGTTGAG TGATGAGCAA AAGAGTATTT TATCTGATGC TTGTAAGATT 540 

35 ATTGTAGAAA CAAACAAACC GGTTCGACTT GTAAAAGGTG AGTTATATAA CATTAAAGTA 600 

ACAACACCTT ACGATTTAAA AGTAGCGAAT GCTATTATTC GAGGTGGTAT TGCCGATGAT 660 

TAATCAAGTA TATCAATTAG TTGCACCTAG ACAATTTGAA GTTACGTATA ACAACGTAGA 720 

40 TATTTACAGT GACTATGTCA TTGTACGTCC TTTATATATG TCAATTTGTG CTGCCGATCA 780 

AAGATATTAT ACTGGTAGCC GTGATGAGAA TGTCTTATCT CAGAAATTGC CAATGTCTTT 840 

AATTCATGAA GGTGTTGGTG AGGTCGTATT TGACAGTAAA GGTGTGTTTA ATAAAGGTAC 900 

45 

AAAAGTAGTT ATGGTACCGA ATACGCCGAC AGAAAAAGAC GATGTCATTG CTGAAAACTA 960 

TTTAAAATCG AGCTACTTCA GATCAAGTGG ACATGATGGG TTTATGCAAG ATTTTGTGTT 1020 

so GCTAAATCAT GATAGAGCTG TACCACTACC TGATGATATT GATTTAAGTA TTATTTCATA 1080 

TACAGAGCTT GTAACAGTAA GTTTGCATGC TATTCGTCGT TTTGAAAAGA AATCTATTTC 114 0 

AAATAaAAAT ACATTTGGTA TTTGGGGTGA TGGTAACTTA ggTTACATTA CAGC CATTTT 1200 
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GAGTCACTTC TCATTTGTTG ATGATGTCTT CTTTATTAAT AAAATACCTG AAGGCTTAAC 1320 
ATTTGATCAT GCATTTGAGT GTGTGGGTGG T 1351 
(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

TCATCAAGTC TACGATAAAT TAAGTCCATA TCTAAAGGCT CGGGGTCGAC AGTTTGTAAA 60 

GTATAACCAA CTGCACAGTG GCTACAACGC ATATTACAAA GATTTGTAGT TGTAAATTCG 120 

ATGTTACTTA AAGTTAATTG GCCATGTTCT TTAACATCGT TATATGCTTC CCATGGGTCG 180 

TTTTGAATAC TTATTTTAGG CTTGTTATTA CGCATTTTAT AAACTCCTTA ATTGTTATTT 240 

GATACCAATT TGATACCGTT TAATCAAATA TGCTCATAGC TTGATGTTTT TTATCAGTAT 300 

25 

ATAAATGAGA GTACGTTTGA ATTGTTTCTG TAATGTTAGA ATGCCTCATT AATTCCATTA 3 60 

ATAAATACAT ATCTACACCA TTATTAATTA AaTAGCTAGC GTACGAGTGT G 411 
30 (2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 9 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 

55 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 



45 



SO 



TCATTTTCAT 


AGGTTATTAC 


GCAGATCAGC 


ATAATAATCC 


ATTCCATATG 


AGTc CTTATT 


60 


TTGGTTATGC 


AGCACGTCTA 


TTGGCAACAA 


GTGGCATTGA 


CTATACGTAT 


GTAAGAATGG 


120 


CAATGTACAT 


GGATCCACTT 


AAACCATATT 


TACCAGAATT 


GATGnATATG 


CATAAACTGA 


180 


TTTATCCnGC 


TGGCGATGGT 


CGTATTAATT 


ATATTACTAG 


AAATGATATT 


GCTAGAGGTG 


240 


TCATTGCTAT 


TATTAAAAAT 


CCAGATACTT 


GGGGCAAACG 


CTACTTATTA 


TCAGGCTACA 


300 


GTTATGATAT 


GAAAGAACTT 


GCTGCAATTT 


TATCTGAGGC 


ATCaGGCACA 


GAAATTAAAT 


360 


ATGAGCCCGT 


TTCATTAGAG 


ACATTTGCAG 


AAATGTATGA 


TGAACCTAAA 


GGCTTTGGTG 


420 


CATTATTGGC 


ATCAATGTAC 


GACGCAGGAG 


CAAGAGGACT 


ATTAGACCAA 


GAaTCCAATk. 


480 



1229 



10 



EP0 786 519 A2 

TTAATAATAA AGGAGCGTTA TAGTGAATAT CATCTCAACA ATTtTAATCA TATTTGTGGC 600 

ATTAGAGTTT TTCTATATTA TGTACCTTGA AACGATTGCT ACAACTTCCA AAAAGACTAG 660 

CGAGACATTT AATATAAGCG TCGATAAATT GAAAGACAAA AATATTAACC TACTTTTGAA 720 

GAACCAAGGC GTATATAACG GTTTAATCGG AGTTTTGCTA ATATACGGTT TOTTTATCAG 780 

CAGTAATCCA AAAGAAATAT GCGCAGCTAT TTTAGTGTAT ATCATTGGCG TTGCTATTTA 840 

TGGTGGCCTT TCAAGCAATA TTAGTATCTT TTTCAAACAA GGCACATTQC CAGTATTGG C 900 

ACTCATATCA ATGCTTTGGT AAGTATTGGT GTTTGGGGGG GTGGAGATGT AGTCGGAGGT 960 

1S TTGGAGGATT TGAGCGAATT GTGTGTGGAC TTTAGACTCA GAGTATTTCA TCCTAATTAT 1020 

TTCAAGCAGA GGTGACAGTA GCGTTGCCTC TGTTTCCTTA TAAAAAAATT ATTTAATGAA 1080 

GAAAACCCAT ATCTGATTTA ATTTTCAGCT GATAAATACT CCATATATTA GAATGGCTAC 114 0 

TTTATCTATT GCATCAATCC TTTAAAACAA AAAACCCATG ATTTCGAAAT TCCCGTATGA 1200 

TOGGGTTCCT ACTCTCATGG ATCAGTTAAA TAAATATTAT CACTATCAGT TTATTATTTC 1260 

AATATTATTA ACAATATATG TAGTCGTAAA AGGAAAGAGG ACATGAGAAC TTCGGTGTTG 1320 

ATTGGCATTA CATAACGCTT C CAAACAT AT TATTTGGTAA CAATAAGAAA CTATTTACAC 1380 

AATATATTTT GTATAGTAAA ATTATTTTAT AATATTTAAA TCCAATTGCA CAAGGAGTGA 144 0 

TTATCaTGGT ACCAGAAGAA AAAGGTTCTA TTACTTTGTC AAAAGAAGCA GCTAT CAT AT 1500 

TTGCAATCGC AAAATTCAAA CCATTTAAGA ACAGAATTAA AAATAACCCA CAAAAAACAA 1560 

ATCCATTTCT TAAATTACAT GAAAACaAAA AATCTTAATC . ACTTTtATTT ATAGcATTTC 1620 

35 TAATCTCAGA AATGCTATA 1639 
(2) INFORMATION FOR SEQ ID NO: 351: 



20 



25 



30 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1816 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

AAAATCGCAT ATAGTAATAT GAATAACCAG ATTGTATCTA CAAAAAAGTA TATTGAAAAA 60 

SO CCAAGCGCAC CCATTAATAA TGCGAGAATA ATAATAATTT TTCTATTAAA GTGATG CGTA 120 

TCCGAAAATC TAGCAATAAT TGAATTTACT GTAAACTGGC TAATCGCTGC AGATGCTAGA 180 

AGTAATCCAT ACTGATTTGT TGTCATACCT AAATCTTTAG TTGCAAAAAG AACAAGATAT 240 
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TTCTyTATTT 


GTAaTAACGC TGCAAACATA TCCATAACCC CGCTTCTTAO 


AGCCCCTTTT 


360 




AATTnATnAA 


TTAGGGGCTC TTATGCAGTT GGTGCATTAG CAACCAACTG 


TATTCCTTTG 


420 


6 


TCCCCTTTTA ATTTATTaAT TAGGGGCTCT TTTGCTOTTQ GTGCATTAGC 


AACCAACTAC 


480 




GTTCAATTTA 


ACCGAATAGT TTAAAATTAA ATACAAACCT TAAATTAGTC 


TAAAACTACG 


540 


10 


CCTTTGGTTG 


TTCAACAAAG CTCGCCATGA GATTTACAAA AGAATCAACT 


TGTGGCAATT 


600 


GCAACATGCT 


CGGATCATAA CTCATAAATG TCGAACGAAT CAGCGGTTCA 


TTATCAATTT 


660 




CTACTTTTTC 


AAACTCAAAT TGTTCTTTGC TGATATTTTT CATCATAATT 


TCTGGCAAGA 


720 


15 


TTGTAACACC 


TACACCACTA ATCAACATTT CTTTGCAAGT TGCTACTTGA 


TCCACTGTAA 


780 




TAGTTGCATG 


GTAATCTTGT TCTAAATTAT CGTTATACCA TTCTTTTATT 


TGATTTATAT 


840 




AAATCGGATC 


AGCTTGAAAC TCTATAAATG GTAACTTTGT AACATCATCT 


CGTCTATTTT 


900 


20 


TTGGAAAAAT 


AAAATAATGA TCATCATTAA ATAAATGTGT GTTAGCTAAA 


TTCATTACCT 


960 




TATTTCCACG 


AGTTATCATA ACATGATAAT CTCTATGATT TGCTTTAATT 


TGTTCAGTTG 


1020 


25 


AACCAACTTG 


CACTTGTATT TCAACATTAG GAAATTGGGC ATTATATAGG 


CTCAAAACTT 


1080 


CAGGAAGTAA 


GGTTTGTCCA ATCAAAGAAG AACACCCGAT TGATATTGTT 


CCATTCACTT 


1140 




CACCAATATG 


TGCCTGCATT TTGTCAAAAA ATAATCGCTC TCTTTTCAAC 


ATGTCACGAG 


1200 


30 


CATGCTCAAT AATCATTGTT CCTTCAGTTG TTGTAATCAA TTGTTTTTTT 


GTTCTGATAA 


1260 




AAATATCTAC 


TCCAAAAGCA TTTTCAATAG CTTTTAGTCT TTGTGTAACA 


GCAGGTTGAG 


1320 




ATATATATAA 


AATTTCAGCC GCTTTACGTA ACGTTTTCGT TTCGTCTAAT 


GTTATTAGTA 


1380 


35 


AACGATAGTC 


TTCAATCTTC ATAATTTCCC CCCATAAATT ATTCAATTAT 


TGAACTTTCA 


1440 




TGGCTACAAG 


CATTCATGAG TTCATTACTA ACGAATAATT TCACCAATTT 


TATTGGTATG 


1500 




GCTGCAGCTT 


GAATTACTTA GTTTTTCTTT TGTTGTTGGT GATTTTTAGT 


TTGATTATAT 


1560 


40 


TGCTTAGGCT 


TTATTTGTTT GCTTTTTTCA ATATTAGTTT TATTTTGTGG 


CTTTTGATGA 


1620 




TTTTTTTGAG 


CCTTTGCATT AATTTTATTA AAGCAGTACA TGATTTTCTT 


TTGGAATCCT 


1680 


45 


TTAAAATCAT 


TTTCTAACTC TGC CATAATT TGATGTGCAA T CAT AT ATGC 


TTCATGAAAT 


1740 


TGCTTTTTTG 


TAATTTGCTC ACTTTCTAAT GCAAACATTA AATCATCTTC 


I ATCTAC CAAC 


1800 




TCATaTcACC 


ACTTGG 




1816 



60 (2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9956 base pairs 

(B) TYPE: nucleic acid 

« (C) STRAND EDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352; 



5 


GCGTTGTCGT 


CGATGATTAA 


TAAAGTATGG 


GTATACCATT AAGAATAACG 


CTACCCAAAT 


60 




g AJcTG ctagt gacgtqccgc caatgacatc tgaaaagtaa tgtgcatgaa aataaaggcg 


120 




ACAAAATAAT 


ATGCTAAGCC 


ATAATATTCC 


CATAACCAAC 


GCACTCAACA 


CTTTTGTTAT 


180 


10 


TGTCTTAGCA 


GCAAOTGAAA 


TAATAATGAT 


CATTAAGGCG 


AAATATAATA 


ATGTGCTGGC 


240 




GTTGGAATGT 


CCACTCGGAA 


ATGAAAAGCC 


TGTATCAACG 


GCTAAATGAT 


TATATGGTCT 


300 




TGGACGTAAT 


ACAGTATCTT 


TAATTAATTT 


GTTCATGATG 


ACACCTGAAA 


CCAAATATGT 


360 


15 


CACAAACCAA ACCGCTAAAT GCCTCTGTTT AATAAACAGT ATGATTGTCA CGATAATGGA 


420 




AATCAACACG 


ACACCTTTGA 


CATCTCCAAT 


TTCCGCACTA AACGTCATAT AGTAATTAAA 


480 


20 


CAAATTGTTA 


ACATACTGAC 


GTTGTGGCTC 


ACCGAAATAA 


TCTGTAAACC 


ATGTTAATGA 


540 




TCCCATATCT 


ATATTTTTAA 


GCCATTCTTG 


ATTTGTCACT 


ACACTGTAAA 


ACATACCTAT ' 


600 




AAATACAATC 


AGCGCGATTA AAAATAAAGG 


CACTGTcATT 


TTCGGTGATG 


TTAATTrilT 


660 


25 


ATCTATCATC 


TTACAATCTC 


CTCGTATCAT 


CATTTTCATT 


TTACAAATGT 


TATCCATAAT 


720 




ATCAATGTGC 


CACAAATTTC 


ACTTTACCGA 


CAATATCAAA 


ATTATAAAGT 


TCATATTGTT 


780 




ATGTATATTG 


CAAATAAAAC 


ATTGTATAAT 


TGAAATAACA 


ATATTTTGCT 


ATTTTCAATT 


840 


30 


TAGTACGATT 


TATATTTATT 


ATACAGAGGG 


GGTAAGGCGT 


ATCAATAGAG 


TTATTTTTGT 


900 




CTATATAGCG 


TTAATCATTA 


CATTAGTTAG 


ATTCTTTACC 


CCTATTCATC 


CATCATTTTC 


960 


35 


AAATTTGATT 


TACTGGATAT 


TTGTATTATA 


TTTTATTCCT 


ATTATACTAT 


GCGTTATCGG 


1020 


TTTCAAGGCC 


GAAAAACTTA 


TTGCAACAAT 


GGTCATTATA 


CCTAATTTTT 


TAGGAATACT 


1080 




TTATCGATTA 


TATGCCTACG 


TCACACATAT 


ACTCTTTATG 


TAAAAGGATA 


GTGGGCATGT 


1140 


40 


CTCGCAACAA 


ATATGCTTGC 


GTCGACTTGT 


CACCATTCGC 


AAATTTTATG 


ACATACGCCT 


1200 




TTCACGGGCT 


ATATTCAGAC 


CCACGCATTC 


ATCCACGTAA 


TAAACACATC 


ATGTAATAGA 


1260 




AAAACAGCAC 


ACCCAAATAT 


ATGGCGTTGC 


GCTGTTTAAC 


CAAGCATACT 


TCTATAGCTT 


1320 


45 


TAATAAGCCA 


GCAGAAGCAT 


ACCTAACCTT 


CTTAAATATG 


CTTTTCCAAA 


TTATCCTCAA 


1380 




GTTTGAATAC 


GATAATACGT 


TCACCTGTAA 


CTGTACTTAA 


ATCACTATGG 


AAGCTCATCA 


1440 


50 


CTTTGATACC 


TGTAATTTTA 


AAAATGATAT 


CATTCAAATC 


TTGCTCACCG 


GATTCAACTA 


1500 


ATTCAGAACG 


TGTTCGTTTA 


ATATTTAATA 


ATCCTTCATT 


CGTACTACAT 


ACACGATATT 


1560 




CAGCTGGCGT 


TAAGATACCT 


TGTAAACTAA 


TAATCACCAT 


ATCTCTTAAA 


ATGTCTGATT 


1620 
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TTTCAGCTTC GATTTCACCT TTCGTTCTTT TCATATCACT AACTCCAATA ATATTAAAAT 


1740 




TGATTACTTC 


ATCTTTGTAT 


CGTTATCACG 


ACATCAACTA 


TTACATTAAG 


TTTATCATTT 


1800 


6 


TTAGTATATT 


TT AAG AAU v- 1 


AGAACATTGT 


AGATATGATG ATATATTAQT 


TACTTAGCAT 

A*m\* A ■* *1S» A 


1860 




CGCAACATAT 


CATCGTTAAA 


TCCAACTTTT AAAACGCCCT TCCTCATTAA 




1920 


10 


ACGCAGCCAA 


TGATTAGACA 


CCTTCCTAGC 


GAAATGCTCA 


TTATTCGCGA 


\J X V» I IVJV» 


1980 


TACAACATAG 


TCGGGTGCCT 


GAATAACGAC 


AAGTAAACGA 


ATTGGCGAAT 




2040 




CCGATCAGCA 


GCCATAACAG 


ATTGCCATGA 


TAAGCCATAC 


ATCAGATCAC 


TCGCATTAl-t- 


2100 


15 


TTGCATGACA 


CCAACACCTG 


ACGTGACGGT 


TTGTGTCGCT 


TTATTCCCAC 


TTCCGTAAAA 


2160 




ATGCGGCGCA ACTGTCGACG 


CATAATATTG 


TAAATTAATC 


CATTGTGCCA 


CAAGTGCCGG 


2220 




ACCAGAAATG 


ATGGTATTTA 


ATAATGTGCC 


ATCTTTATCT 


TTACGCCAAT 


CATAATTGTG 


2280 


20 


TAAAAATGTC 


CGCCCTTCTA 


AATCAATGCC 


TTTTGTTAAT 


TGGCGTCGTC 


CAATTATAAA 


2340 




TGATGCATTT 


TTAGCCAATC 


CCCATTCTGG 


ACGTACCTCA 


CTCCAATCAC 


TCGCAAACCG 


2400 




CTGCGCTTCT 


TCCACTGGAT 


GATTCACACG 


ACCAATCGTT 


GGCAGTTTGT 


CCAAACGTTC 


2460 


25 


GCGATTCGcg 


TGtTCAGAAA 


TCATCGGCAT 


CGCGTCATTC 


AATGATTCAT 


ATGCATCTAA 


2520 




AGCAATAGAA 


GATAATGTGT 


CTGGCACATA 


TACCCATGCC 


AACGTATCAG 


TAGACGTATG 


2580 


30 


ATGTTCTGCT 


ACCGCAAAAA 


CAGTTGTCTC 


TGGAATATAC 


ACACCTGATT 


GTTTTAATCC 


2640 


TTGTCTGACA 


TTTGGACGAT 


TACATATCAT 


CGCTAATAAC 


TTAGCATTAA 


AACCGCTTGA 


2700 




TGCGCCACCA 


CAAGCCCCAC 


ATTCAAGTGA 


TGCATGATGT 


GGATTATTGT 


GAGAATGACT 


2760 


35 


AGCATGACCT 


GCTAACACAA 


CGAACGGCGC 


AAATGCTTCG 


GTTAAATCCA 


TCAATTTCAA 


2820 




CGCTTGTAAC 


GCGAAATCAA 


TTTGCTCTTG 


CTCAGTAAAT 


CCAACAGGTA 




2880 




TCGGTCAAAC 


TCACGATCAA 


TCGTCAACTT 


TGTTTCAGGC 


TTTTTCAACC 


X lllblli 


2940 


40 


TATTTTTTGT 


AAAGACGCGC 


GACTTTTTCT 


AGGCATAATC 


GAATTGACAA 




3000 




GCTTAAAAAT 


GGCCCACTTA 


ATTCAGGCAA 


TAACAGACTA 


GGCATGACAT 


it\A A A A Ji\mt\X 


3060 


45 


CAATTTAAAT 


GTGTAAAACA 


TCGATGACAT 


TGTCTGTTGC 


TGTTGTCGAT 


AAACATTCAT 


3120 


ATCGTAGCGG 


TCTGCAAATT 


CTTTAATGCG 


ATATGCCGGC 


GGTACCATGA 


CAGGTAATGA 


3180 




ATCATGTTTG 


AATTGTTCGT 


CTACGGCATC 


TTTTTGAATA 


GGTAATCCAA 


AGAAGCCTGC 


3240 


50 


AATACCAATC 


GTTTCAAAGG 


GCCCTGCTGC 


TTCGATATGT 


CTACGAAATG 


GTTCTGAACG 


3300 




AACATCTATA 


CAAAATGCAA 


TTTGCGCTTT 


CGTTGATGTG 


CCCATCTGAT 


TTAGCTCGCT 


3360 




ATTATTTTCA 


TCAACTGCTT 


GTGTGTCATT 


TAACAATACT 


GAATGTGGCT 


GATTAGCGTT 


3420 
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TGCTTTAATT TTTTGTTTTA ACTGAQATTC GTATGTCATT TCCCAGGCAA TTAGCCATAA 3540 

ATTTTTAAAT ACATTTTTAT TCATAGTTGC TGCAAAATGA ATAAACGTTT GAATTTCATT 3600 

GACGTCATGT TGTAGTAATA CATCGCTAGG CATATCACTG TAGTAACACC ATGATGCAAC 3660 

AGTTTGCTTA AACCAATTTT CCGATCTACT TTCACAATCT TTAGCGACTG ACTTAAACTC 3720 

ATCACCAACT AGCAATTGTT CGACAACTAA CCGAATTGCC AAATAATCCG TTAACAAATG 3 780 

TTGTTCAAAG TGATGCTGTT GTGAACGGTA ATACAACATA CCTGCCCAAC CCGGTAACGC 3840 

CAAAAGATGT CCTTCAACAT AAGCTTGGTA GTCTTCCTGA TCTATTGAAA AATGAGTTAA 3900 

TACTGACTCT ATCGTCATTT CAGGATCATT GGGTAAGCCT TTAATCACTT GGCGCTGTGC 3960 

TTTAGTAAAA CTATGGTCAT GTTGCGCTAA ATGCAACCAT GCATGGTAAA AACTTTGcTC 4020 

ACGCTTCGGC ATTGTCCAAC TCGATAGAAA TTGATCGATA TAAAGTTTCG TCCATTTAAT 4080 

20 CATTTGACGA TTCACTTGTT CGCTAAGTGG CTCACCTTGT TCATCTATTA TTGCATCACT 4140 

CATCGGACGT ACATCATAGT GATGATATGA TTCAGCCATA TCAOGTTTTG ATTTTTCTAA 4200 

TAGTAGATCA GCAACAACAT CAACATTTGA ATGATTCATA TATGATGCAG GTACGTCTTT 4260 

TAATGTTTTA ATGTTATCAA TATAAAGATT GATGTAGTGT TGCGGGATAT TGTAGTGATG 4320 

TTCAAGTAAC ATATCAGTAA CAAGTTGATT AAAGACACTT TCATCTAATT CACCACGTGC 4380 

CACAGCGCTT TCTATTAATG CTTTATTTGG GAAAATATCC ACATCTCGAA CATCACGTAA 4440 

CCATTTTG CG ACATCTTCAA ACGTATCCGC TTCTAATCCT TCCCATGGAT TTCGTGCTGC 4500 

AAAAATCGAA ATTGGTGATA ATGGTGTAAT AACACGTTTC GCATTTTCAA TGACTGAATT 4560 

35 GATATTTAAC TGTGTTGTCA TACCTTTCAC CTCCTATAAA TACTTCTTCA AATAATTCGG 4620 

ATGACTTTCT ATCGCTTTCG AGCGTGCTTC ACCTAGATTA ACTAACCACA CGTACAATAC 46 80 

CGCAAAAGCC TTAGAGTATC GATGCCGCGC CACCCAAATA CTTAATAAAC TGCCAAAGAT 4740 

TAAAATAACA ACACTAATGA TGACACTCAC TGTAGGCGGC GTTGTCGCAT GTGTTGTTAT 4800 

ATTTTGTAAT ACAGCGTAAA AATAATTATG TGTGATGACG TAGATAAATG TCACGATTGC 4860 

AATCAAAATC ATACCAACAA GACGTGCCAT GCGTCCTTTA CTAAAGGCTA CCATTTGATT 4 920 

CCAAGATACA AGTAATGACC ATCCTAGAAT GAGTGCACTT AACACTTCAT ATGCACTTCT 4 980 

GTCACTACTC ATCCAAAATA GAAATGCCAC GATAATAGCT AATACACGTC CCATGACAAT 5040 

SO CCAGCCATAA GCGTCTTTAG CAGATGCTTG TTTTGGAATA TTGAATCGCT TCACGATAGA 5100 

ACCTGATTGT AAAAATAATG TTGCTTTAAA AATACCGTGC AATATTAAAT GAATAATCGC 5160 

TGCTGAATAT ACACCCAATG CACATTGAAC TAACATAAAG CCCATTTGAC TCATCGTAGA 5220 

55 



25 



30 



40 



45 



1234 



EP0 786 519 A2 





AGAAATACTA GAAAGGATAA GTAATAATGA 


TAACGCAAAT CCATTATCAA ATATCGGCGC 


5340 




AAAACGAGTT AGAATAACAC 


CACCTGCATT 




TTG CCGATAC 

A A UUvWiAV. 


5400 


5 


TGQCQTTGGT G COG 11 ACAO 


ATTCAATCAA 




CSTGCTGCCGG 


5460 




TATCATGACA GCTAATACAA 


GTAGTACATT 


CGTGAACAAT viAl-vZA 1\j a V-Vj 


un X \3/V\V» A n A 


5520 


10 


ATGTTGTGGT ACCCGCCACT 


CGCCAGTCGC 


AATATAAATA Gl 1ACAA1 i\J 




5580 


TGCAAGCCAA CCACATAAAA 


ATGTCATGCT 


TGATAATTTC GCAGACTCAC 


GTGGCACTTT 


5640 




CCAAAAACGA TTAACGTTCA 


TCAGCAATGT 


TAAACATAAT AATGTAATAC 


C CCAG CAG AG 


5700 


1$ 


TGCCATCAGT CTTAAGTCTT 


CAGACATCCA 


TGCTAAAGAT GCAAACGACG 


TAAT CG CAGT 


5760 




GAACAATGGA AAGTAATGTC TATAATGATG 


ATCACCTAGT AAATATCGCA 


TTGAAAACTT 


5820 




TTGAATAATA AAGCCAAGCG 


CCATTACAAA 


GCCAGCTAAT AACCAAGATA 


AACGATCTAT 


5880 


20 


TTTAAATGGA CCTAAGACAT 


GTTGACCATG 


AATACCGAAA AAG CCAATGA 


CTGCAAATAA 


5940 




TACTGGCATG ACTAGTATGT 


ATAAATGTAA 


TTTAATATAT CTCATTGGCA 


T AACTGGTG C 


6000 




TAAAAACAAC AAGCCACTTA 


TCAATGCAAT 


GATAAGCGCA ATAACAAACA 


GTGAAAATAG 


6060 


25 


CAATTGAAAA CTTAACACTG 


CATAACCTCC 


TTATTTCTAA TCTCTCGCAT 


AATTGCTTAT 


6120 




GTAT AAAAAT AAAAACCTAC 


AATAGTAGAT 


TCTGTACATA ATGGCAGAAA 


ATTTACTATT 


6180 


30 


GCAGGTTTCA GTTTAACTAG 


ACACTG CATC 


ACGGTACGTT GATATACCTT 


GTTGCAGTGT 


6240 


TCTCTTTAAG CGTGCTCCCA 


TGCACATATG 


TATATAAAAT GTTACTTCTG 


TCTGTTCAAT 


6300 




TCATCTTCAT AAATATGCTT 


TGCCTAGACG 


AGACCTAACG TGTTATTCGT 


TTTAAACTTA 


6360 


36 


TAACATAAAA TATAATTAAA 


TTTCTGCTTC 


ATGTCAAATT CATGAGCTTA 


ACCTCTATTA 


6420 




AACCAATGAT TGTAAAGATT 


TTGTAAATGC 


ACCTGTACAG TTAGGCAGTA 


a 1 1 CCCGTCC 


6480 




TTTTAAAATA AAAAATTCGC 


AGTTATGATC 


AT AACAATT C AAGTTAGGAA 


Tv TV 7\ TV ATVTy rvT* 
AAAAAlvAAl 


6540 


40 


TACGCACAAG ATAACTATGT 


ACAATGAAGT 


TAACTCATAA GCAAAGVsAvjO 


*T* & 7\ T^r' w i w i 1 TV TV T 


6600 




GGGTATCATC GCTGGCATCA 


TTAAAGTTAT 




X »— r\^- A VJVJ Inn 


6660 


45 


ATAAGATTTC ATAACAAACA 


AAGGAGGTCT 


TTCACATGGG TATCATTGCA 


GGAATCATTA 


6720 


AATTCATTAA AGGATTAATT 


GAGAAATTCA 


CTGGTAAGTA AGTTATAAAA 


ATCTCATAGA 


6780 




TATGAACATC TTATTTGAAG 


GGGGC CATTC 


ACATGGAATT CGTAGCAAAA 


TTATTCAAAT 


6840 


50 


TCTTTAAAGA TTTACTTGGT 


AAATTTTTAG 


G TAACAACT A ATCTCAAACA 


TTAACGATCA 


6900 




ACAACTCATC ACTATGTTAA 


ATCAACATAC 


AGGAGGACAA AACGATGGCT 


ATTGTAGGTA 


6960 




CTATCATTAA AAT CATC AAA 


GCAATTATCG 


ACATTTTCGC AAAATAATTT 


AAGCGAATTG 


7020 
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TTATTGATGT GAGGTGAGTC TTGTTAGTTT GTTGCAAATA AATOGTCTTG OTGTTTTTTG 7X40 

TATAGGACGT TCTTAGTGGG ACATACGOAA TATTCOTGAT CTTTGTAGTC TGACGOGTTA 7200 

5 

TATTTTTGTG GCGTGTTTTA TGTTTGATAC TCGAGTTCTG AGACATTCAT GATTTGGCAT 7260 

GCGAAATCTT AATGATTTTC ATGATCTAGC GCAAGATATA TTGGCCACGT GCGGAATTGC 7320 

GTTGCACGTT TAGACTGAAA CACTCGTGTG ACCGTAAGTG TTAATAGTAC ATTGATAGCT 7380 

10 

GCATTTACTT CACTCATTTT TATGACTGTT AAACAATGAT TGTACCTTCA ATTAACAGTT 7440 

GGTACGATGG TTTTGCCATT TTTCATCAAC GTAAATATAA AAAGGACTAA GACACATACA 7S00 

15 TGTCCTAGCC CTATGGATAA AATGCAAATT TCTGCTTTAT CAAAACTATC ACACTTTAGA 7560 

TAGATTGAAA ACAAAAAGAT CCTAAGAACA CCTTAACTTT TTATTAATTG TCATAAATTG 7620 

CAAACAATTA AGCCACAATT CAAAAATGAT TATACTTCAT TCAACTTATC GTGCTGGTCT 7680 

20 AATTTGCCAT TGATATGGAT CTTCAAATTG TTGCCAATCT GCATCAATTT CTTGOGCATT 7740 

GACTAAG CAT GCGTCGAGTT CTTTTGTTAA TTTTTCTTCA TCTAATTCTG TACCAATAAT 7800 

GACAAATTGT GTATGACGAT CGCCATATTC TGGATCCCAT TCAGCTGCGA CATCTTGACG 7860 

25 

TTCTGCTAAT ATTTGTGTTT GTTGCGCTTC AGACATACTA GCCACCCAAT ATGTAACTGG 7920 

ATGAATATTG CAAGATGACC CTGCTTGAGA TAATAAACAT GCTACGTGAT TGTATTGTGC 7980 

3Q TAG CCATACG ATACCTTTTG AT CGAACGAC ATTATTTGGG ATGCTTTCTA ACCAATCATT 8040 

GAACCTTTTA GCATGGAAAG GTAGACGACG TTTATATACA AACGATGATA TACCATATTC 8100 

TTCTGTTTCA GGTGTATGCG ATGCATGCCC ACCAGACTCA AGTTCTTTGA TCCATCCTGC 8160 

35 TGACTCGCTC GCTTTTTCAA AATCAAAACG CTGCGTATTC AAGACTTCTT TTAAATCTAC 8220 

TTCAGAATTT GTTGTCTTAA TAATTTTAGC AGTCGGTTGC AATGCGCTTA ACATTTTTTC 8280 

TAACTTCGCT AGTTCTTCTT CACTAATTAA ATCAATTTTA TTAATAATCA ATACATCACA 8340 

40 

AAATTCAACT TGGTCAATTA ATAAATCAGC AATCGAACGC TCATCTGTTT CGTCAACGCT 8400 

TTGATCACGA TCCATCAATA AATCTTCTGA GTTGATGTCA TGTACGAAGC GGTTAGCATC 8460 

CACAACTGTA ACCATTGTAT CTAAACGGCA AATCGCTGTA AGATCAATGC CAAGTTCATC 8520 

45 

ATCAATATAT GAGAAAGTTT GTGCAACAGG TACTGGCTCT GAAATCCCTG TTGACTCAAT 8580 

AACAATTTGA TCGATGCCAC CTTTTTTCAC TAAACGCTCA ACTTCTTTTA ATAAATCGTC 8640 

so TCTAAGTGTA CAACAGATAC AACCATTAGA AAGTTCGACT AATTTTTCAT CTGTACGCGA 8700 

TAGTCCCCCA CCATCTGCGA CAAGATCTTT ATCGATATTT ACTTCACTCA TATCATTTAC 8760 

AATTACCGCG ATACGTCGAC CTTCTCGATT TTGTAAAATA TGATTTAACA ACGTTGTCTT 8820 
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ACTTCAATTT ATTTGTAAAT AGGAATAATT CTGTTTTACA TTATATAGGA GCGTTTCCTC 8940 

TTTCGCAATC TTCGATAATA AAAAAATAGT ATACTTAATT AAATTATTGA GCGCTTTACT 9000 

TTATAATGGA GACAAAGATA TATCTCACGA AAGAGAATCG AGGTGTATAA ACATGTTATT 9060 

TGTCATTTTA GTTTTATATG TTACTGGTAT TGCATTTATT CTACTCAGTG TTTTTGGTTC 9120 

AAAGACTGAA GGATTATCTA CGAAACATAC TTTATATACC ATTGGCAGTG CTATTATAAC 9180 

GATTGCTATT TTCATTTCAA TTGGCTATGC CATTCAATAC TTAACTGCAG CGCTTTATGG 9240 

TTTGTAAGGT GAAGGTGATG AGTAACGGGT AGTTCGGGAG AGGTTAACTT GCGTTGATTT 9300 

1S TGATAAAGTG ATCATAGCTT TTAGTACTTG AGGATTTTTA TTGTTGCTGT TACGAATGTG 9360 

GTCATGTTTA ATGCGGGACA GTAATTTAAG TTGTTTTTTT ACAATTGAGA GTGTGATATT 9420 

TCGATTCGGT TCGAATTACT TTACATGGGA ATAATATAAA TTAAAAAGAA GCGGCCTAGT 9480 

20 GTCAGTTGTG AATATACTGA ACATTGGTCG CTTTATTTAG TAGTATGATA TGTAGTTTAG 9540 

CTATTAATTT TTTTCAGGTC ATCCTTAATG CTGTCTATCT CAGACATGGC ACTTTTAACC 9600 

CAATCTCCTT GAGCTGCACC TTTAAAATTA GCTTTAAAAg ctTCGCAATG TTGCGCCATT 9660 

TGTTCAATTA ATACTTTTTC TTCACCTTTT AATCCGTTTT CAATATCTTT GTATTTATGC 9720 

TTATGTTCAG GTGCAATAAC TGTGCGAATA TTTTCTTTTT GCGCTTCCAT TTTAGATATG 9780 

AGATTAAGTG TTTCTACTGT AGTACTTATA TCTGGCATTC TTAAGGTCAT ATCTGGTTCT 984 0 

ATTAGAGTCA TTTAATCTCC TCCAAATTAT CAGTCACTTA GCTTATCTAA CTGCTTTTCA 9900 

TAAGACTTTT TTAAGTCTTC TTTATATTCT TCTAATTTCC CATTCTTGCT TTCTGA 9956 
35 (2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2411 base pairs 

(B) TYPE: nucleic acid 
4Q (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

TTTCTTTTAA CAGaTTTATC cCATTAATTG TTCTACAAGC CCACGaTGAG CAATATCATT 60 

TTTAGCAACC ATTAATAAAC CAGAAGTATC CATATCTATA CGGTGAACAA TACCTGGACG 120 

AATTTCTCCA TTAATACCTG ACAAATTTTT AATTTGATAC ATTAAACCAT TAACTAATGT 180 

ATTGGTATAA TGCCCTGGTG ATGGATGAAC TACCATGCCT TTCGGTTTAT ATACAACTGC 24 0 

AACAT CGTCA TCTTCATAAT AAATATCTAA ATTTAAATTT TCAGGTAGAA TATCAGCTTC 3 00 
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AACAACTTTA 


TCGTTTGCAA 


CGACTAAACC 


TGCTTTAATC 


CAATCTTGTA 


TCTGGTTACG 


420 




AGACCAATCA 


TTATTTAATT 


CAGGCAGCAA 


CTTATCTACA 


CGCATACCTG 


TTTGTTCTTT 


480 


5 


ATCTGTAATG 


TTAAATTCAT 


AAGTCTCCAT 


TACTTAACCT 


CCTTCTCCTT 


TTTATTGGAA 


54 0 




GTATCCTTTA 


ATAAGGCAAT 


AATAATTAAT 


ATTACACCAA 


TTGTTAAACT 


TGAATCTGCG 


600 


10 


ATATTAAATA 


TTGGAAAATC 


ATAACCAAAA 


ATATTTGTAT 


CAATAAAGTC 


AACAACTTCT 


660 


CCTPGTTAAAA 


TTCTATCAAT 


AAAG1"ItCCA 


AGTGCACCTG 


CAAAAAGTAA 


ACTAATAGCA 


720 




ACTTGCATAA 


ACAAATTATA 


TTGAGCATCT 


TTAATAAAGA 


AATATACTAA 


GG CT ATT AAT 


780 


1S 


ATAATAATGG 


TAATAATAAA 


GAAAAATGTC 


n x x x x x 


TCAATATTC C 


CCATGcAGCA 

V» wl X wWYvvil 


84 0 




f*f AVTATTTf* 

^bVpwvw • * » * a X 


GA.TGTGa.1X5T 

^**»X>* X X X 


TATGTTTAAA 


AAGTG CGGTA 


TCACTTCAAA 


TGAATCTCGA 


900 




ATTTTCATTfi 


TAVfyPAfV^TAT 
ifw x nuv x*%x 


HAT AT ATTT A 
/V\ 1/% x^v X X X f\ 




PAAATATAAP 

K^nnn, x A X nnv 


GAPAAATAPT 
wiLnnn iAv X 




SO 


GOTATT&AAA 




A ATA A A AT AT 
AnliinHniAl 


X X X X XvJ X Vjrv— M. 


X X X X l«\J X X 


T»ppa »TPa at 

X *v\-AA X InHn X 


X U« v 






fW^lfW* X \— XXX 


ATATTATAftr 
nXnX lnlnu\r 


TTAHAPrTYir' 
X X/\V^\^*VX\9\v 


TAATAAAAAA 


AGTAAGPATA 


ionn 

1UOU 


25 


*P*Pft fft^PT^ A A 


TPTA ATrtTTA 
X V- XrVrtX \9 X X n 


V— X i\r\\~ X \—r\*\ X 


AOTTV2ATAAA 
Avl X Van X Ann 


V- X AL XAXVJ X X 


TTflAPATTAA 
X XuAUil Xaa 


X X 1 * u 


AlniunAL X X 




^P^P & T*^^ A^P A^^P^P 


X nnunl V*nl_n 


^P^P & & A & & i 1 ^ 
1 A nnAMA X 1a 


ITS A A Ar" , /~' A/^/"» 
uuaaauUaw 


XzUU 




r**iY2T2a a r* a t a 


iin X t—v~V» X Ann 




Innunlnl X X 


X ^- Innl IaHA 


Ani IaILI XA 




30 


f m \ TS/'Tfyi* POT 1 
V* luU X\9 


>— XA X X inlnk. 


MnlnV, X X X 


ATTftAATfiflP 


X X *~v?\- In 


a Tr^rnrar 1 a 


i ion 




^AT 1 *!* A ^"FVIVI'A 




nnlAUnnl X X 


X ivinlVlnlurVj 


nTPP PTT f^f^T* 
VJ X V^V^V, X X x 


AVJVMIULvU X 


i Tan 






V3 N— X X \»-VJ./A.Vi_ X 


GGPACTGrTr* 
x va x 


\~V^ X wnVJxMvO X 


PTPRPP-ATTA 


A TAPTA HCZT A 


i a a n 


3S 


*I W I'JX A f ATG TA 


ATTTTAPTTT 

f\ X X X X XXX 


T A AATAPTTT 


AAAAAAATAA 


GAGATGAATC 


ftTPTAPAPTT 

w X ■ 1 AV_n^. X X 


IjUU 




AATTGGACAA 


ATT CT ATG AG 


AATAGATATT 


GTTAATTTAA 


GAAAGT AGG C 


TATTTTGAGT 

X ' ^ X X X X VJ J^VVJ X 


1560 




Tt, CACTCGAA. 


TGTCAGTTCG 


AGGAATAAAT 


AAAGTTAAAC 


GAGAGCTAGG 


TTTTGTATTA 


1620 


40 


ATGGCAATTA 


ATATAAGGAA 


AATAGCAGCT 


CAACGAGCTG 


TACATTATAA 


AATACATATC 


1680 




AAAAAAGCTG 


ATTTCTATCA 


AATAATTAAT 


AGAAATCAGC 


'rrrrrrACAT 


TGCCTAAGAA 


1740 


45 


CTTAATGTCC 


CAAGCCCTAA 


AACTTGTTGT 


TATTTATTTG 


ATTTAGCAGC 


GATACGTTTA 


1800 


TATCTTAAGT 


ACATAAATGC 


TAAAAGTATA 


AACCAAATCG 


GAATAAAATA 


AATTGCACGT 


1860 




CTTGTATCAA 


CATTAATAAA 


TAATAACCCG 


AACACAAAAA 


TGAAGAATAC 


AAATATTAAG 


1920 


60 


TAGCCCATAT 


ATTTGCCACC 


TAATAGTTTG 


T ACGT AG CAT 


TTTTATGTAG 


ATCTGGGTTT 


1980 




TTACGACTAT 


AATTGATATA 


TGCAATGATA 


ATCAGACCCC 


ATACAACTAA 


AAATAACACT 


2040 




GTAGAGATGG 


TAGTCACATA 


CGTAAATACT 


TTTGTCGCAT 


CTGGGAAAAT 


ATAGTTTAGT 


2100 



66 
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TTATTCGTCT TAGAAAAOTT COGAGOTOCT TGTyCTTGAC TTOATAAACC OaAAAG CAT A 



2220 



CGGCTATTTG AGAATATACC ACTGTTACAT GATOAAGCAG CAGCGGTTAA TACTACAAAA 



2280 



TTAATCAAGC CCGCAGCAAA CGGAATTCCG ATCAATGCGA ATrATTTTnC GAATGGACTG 



2340 



TTATCAGGAT CAACTTGCTG CCAAGGGGTA ATAGACATGA TAACCGCTAA CGCCCCAACG 



2400 



TTnnATATTA A 



2411 



10 



1$ 



20 



25 



30 



35 



40 



45 



SO 



(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 

GnGAATTATT TTTAATAATG AAAGGATTAC TTnCATGGGT TTTTACTAGG AnTACCCAGA 60 

AGGTCAAAAT ATTTTTGATG CGCTAAGTCA ATATGAAGTT AAGCGACOCG GCOATATGGA 120 

AGAGGATCCA TCATATAAAC AACTCATTTC TTATTGTTTA CTTGAAAATG AGCATGGCGA 180 

GATATTAGTG TATGAACGAT TATCTGGCGG TGGAGAAGCT CGATTGCATG GACAATCTTC 240 

AATAGGTGTA GGCGGTCATA TGAATGATGT TCCAGGAGCA GAATCTATTA ACGAAGTATT 300 

GAGAGTTAAT GCACAGAGAG AATTAGAAGA AGAAGTAGGT TTAAGTGAGC AAGATTCACA 360 

AAATATGGAA TATATCGGTT TTATTAATGA CGATAATAAT GAAGTGGGCA AGGTACATAT 420 

TGGTGTTGTA TTTAAAATCA CTGTAAGTAC GAATGATGTA GAAGCTAAAG AAACAGATAC 4 80 

TTTACGAATA AAATGGGTTG AAAAAGGCAA CATAGAGTCA TATGATGATT TCGAAACGTG 54 0 

GAGTGCATTA ATCCTTCAAG ATTTATAATC AAACGAGGTG ACATATATGT CAGATATTAT 60 0 

TCCAG 605 
(2) INFORMATION FOR SEQ ID NO; 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 
TTTATTAGCA CATCCAAACT ATTCATATGT TGGACAATTT TTAAACGAAC TAGGATTTAA 60 
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ACAATTAGAC ACTGAACATT TAGCTGATTT AAATCCAGAQ CGTATGATCA TTATGACAGA 180 

TCATGCTAAA AAAOATTCTG CTGAATTCAA GAAGTTACAA GAAGATGCAA CATGGAAAAA 240 

GTTGAATGCA GTTAAAAATA ATCGCGTGGA TATTGTTGAC CGTGATGTTT GGGCAAGATC 300 

TCGTGGCTTA ATTTCTTCTG AAGAAATGGC TAAAGAACTT GTTGAATTAT CAAAAAAAGA 360 

ACAAAAGTAA GGTGGAAGTA AATGGCTATA AAAGAAATAA GTAGCCAATC TGCCATAGAT 420 

CATAAAAGAA AAAGACGCAC AACACTCACG TATATAGTGA GTTTGTGCTT TCTTTTTATT 480 

TGTATATATT TAAATATGGC GATTGGTTCT TCGAAAATTA ATTTTAGCGA TATCATTCAC 540 

TATGTTACTG GTCATACAGA TACGAAAGCA AOGTTTTTAT TGCATAATGT ACGTATGCCA 600 

AGGATGATTG CAGGGTTATT TATTGGCGGT GCATTAGCGG TATCTGGTTT GTTAATGCAA 660 

GCAATGAC 668 
SO (2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



10 



15 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

ATACAAAAAA ACATATCGAA AATAAAGCTA AAAGAAACTA TCAAGTTCCA TATTCAATTA 60 

ATTTAAATGG TACATCTACA AACATTTTAT CGAATCTTTC ATTTTCAAAT AAACCTTGGA 120 

CAAATTACAA AAATTTAACT AGTCAAATAA AATCAGTACT GAAGCATGAT AGAGGTATTA 180 

GTGAACAAGA TTTAAAATAT GCTAAGAAAG CTTATTATAC TGTTTATTTT AAAAATGGTG 240 

GTAAAAGAAT CTTACAGTTG AATTCAAAAA ATTACACAGC AAACTTAGTT CATGCGAAAG 300 

40 ATGTTAAGAG AATTGAAATT ACTGTTAAAA CAGGAACTAA AG CGAAAGCA GACAGATATG 360 

TACCATACAC AATTGCAGTA AATGGCACAT CAACACCAAT TTTATCAAAA CTTAAAATTT 420 

CGAATAAACA ATTAATTAGT TACAAATATT TAAATGACAA AGTGAAATCT GTATTAAAAA 480 

GTGAAAGAGG CATCAGTGAT CTTGACTTAA AATTTGCGAA ACAAGCAAAA TATACAGTAT 540 

ATTTCAAAAA TGGAAAGAAA CAAGTAGTGA ATTTAAAATC AGACATCTTT ACACCTAATT 600 

TATTTAGTGC CAAAGATATT AAAAAGATTG ATATTGATGT AAAACAATAC ACTAAATCAA 660 

AAAAAAATAA ATAAATCTAA TAATGTGAAA TTCCCAGTAA CAATAAATAA ATTTGAAAAC 72 0 

ATAGTTTCAA ATGAATTTGT GTTCTATAAT GCAAGCAAAA TTACmATTAA TGa TTTAAGT 780 

SS 



45 



60 



1240 



EP0 786 519 A2 

(2) INFORMATION FOR SEQ ID NO: 357: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 
5 <B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 
AAAGTAAAAA TAAATCTCCC TTTTTAACTT TCGTTTCTGC CATAGCCATT GCTTCTTCTG 60 
TGATAGTTGC TACAATATCT TTTCTTTCAC GGTTAAAATG TTCAACTTGT TCTGCTAAAA 120 

15 

ATGCAGCTTC TTCTTCGACG TCAGTCATCA ACAATTCGCa AGCTAATGAT GCGTCATCTA 180 
AACGACCTAC AGCATTAAGT CTAGGTCCAA TAATAAAACC AATTGTTTCT TCATCAATAT 240 

20 TGTCATTGTA TCCCGCTTCT TTTAGCAATG CTTTAACAGA GGT CGGACAT TGATCATTTA 300 

AGACTTTTAA TCCTTGTtTC ACTAATGATC GATTTTCATC AGTTAAGGAT ACTAAATCCG 360 
CAATGGTACC TATCGCAACT AATGCTTTAA AATAATCAGG TACATTTtCA ATCAATGCTT 420 

25 GTGCTAATTT GTATGCAACA CCTGCACCAC ACAATTGTTG GAACGGATAA TTAAACGATG 480 

GATGCATTGG ATGTACGATT GCATATGCTT CTGGTAATGT ACTACCAATT TCAT 534 
(2) INFORMATION FOR SEQ ID NO: 358: 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

40 GGTGAGTCAA ATTAAATGAA TCTAATAAGT CATAACTATC TATTTGTAAT GTGCAACGCT ' 60 

TAACGCATAT ACAAAATGAA TGTGCTGATA ATGATTTACT CAAATTAAAA GGTGATTTTT 120 

ATTCAATGAT GAATGAAAGT TGCCTTTTTA TTTTTGGTAA AAGTTAATGC GTCAGTOAAT 180 

45 TGTGTAAGTT TTTCAAAAAG TAAAAAGAAA TAATAAAGGT GAATTATTAG AATTCCArAA 240 

ATAATTCATT ACATTCATAA AGCATTTTAC AAATGGTAAG AAAATGAGTG TTACAAATCT 300 

AAATATTGCA AAAGAAGCTG ATTTAGTCAC AAAAAATGTC CTATGTAATA ATTCGAGAAA 360 

50 

GATGCACTAT ATACGGTCTT CTTACTATTC AAATGTAAAA GTTGCTTATT TGCGTGGCTT 4 20 

TTTGTTTTAT AAAAGTATAA AATTTTACTA TAATATATCT TGTAGAGAAC AATGAAATGA 4 80 

55 
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10 



ATAGCAAACT GTATTACTTT GATACAAAAA TGGTTGTAAT AAATATTTAT CGATATGACG 600 

ACTTGAATAT GATAAAGTGA CATATTTATG TATATGACTA TTTCGCAAAA TGTAATCGAG 660 

GTAGAATTTC TTGACAATTC TGTCAGTTTA TAAGATGTTA TAAATATGTA GTGTATAAGG 720 

AGGCAAACAA GATGACTG AA GAATTCAATG AATCAATGAT TAACGATATT AAAGAAGGTG 780 

ACAAAGTCAC TGGCGAGGTA CAACAAGTTG AAGACAAGCA AGTTGTTGTT CATATCAACG 84 0 

GTGGTAAATT TAATGGGATT ATTCCTATTA GTCAACTATC TACGCATCAT ATTGATAGCC 900 

CAAGTGAAGT TGTAAAAGAG GGCGACGAAG TTGAAGCATA TGTCACTAAA GTTGAGTTTG 960 

,5 ATGAAGAAAA TGAAACTGGA GCTTACATCT TATCTAGAAG ACAACTTGAA ACTGAGAAGT 1020 

CTTATAGTTA TTTACAAGAA AAATTAGATA ATAATQAAAT CATCGAAGCG AAAGTAACAG 1080 

AAGTAGTTAA AGGTGGTTTG GTTGTTGATG TAGGACAAAG AGGTTTTGTT CCGGCTTCAC 1140 

20 TAATTTCAAC AGACTTCATT GAGGATTTCT CTGTGTTTGA TGGACAAACA ATTCGTATTA 1200 

AAGTTGAAGA ATTGGATCCT GAAAATAATA GAGTCATTTT AAGCCGTAAA GCAGTTGAAC 1260 

AAGAAGAAAA OGATGCTAAA AAAGATCAAT TATTACAATC TTTAAATGAA GGCOATGTTA 1320 

25 

TTGATGGTAA AGTAGCGCGT TTAACTCAAT TTGGTGCATT TATAGACATT GGCGGTGTTG 1360 

ATGGTTTAGT GCATGTATCT GAACTTTCTC ACGAACATGT TCAAACACCA GAAGAAGTAG 144 0 

TTTCAATTGG TCAAGATGTT AAAGTTAAAA TTAAATCTAT TGATAGAGAT ACAGAACGTA 1500 

30 

TTTCATTATC AATCAAAGAT ACGTTACCAA CACCTTTCGA AAATATTAAA GGTCAATTCC 156 0 

ACGAAAATGA TGTCATTGAA GGTGTCGTAG TAAGATTGGC AAACTTTGGT GCATTTGTTG 1620 

35 AAATTGCACC AGGTGTACAA GGACTTGTAC ATATTTCTGA AATTG CACAC AAACACATTG 168 0 

GTACGCCAGG TGAAGTGTTA GAACCTGGTC AACAAGTAAA TGTTAAAATA TTAGGTATTG 1740 

ATGAAGAGAA TGAAAGAGTA TCACTATCTA TTAAAGCAAC ATTAC CAAAC GAAGATGTTG 1800 

40 TTGAAAGTGA TCCTTCTACG ACTAAGGCGT ACTTAGAAAA CGAAGAAGAA GATAATCCAA 1860 

CAATTGGCGA TATGATTGGT GATAAACTTA AAAATCTTAA ACTATAATTT AATATTTAAT 1920 

AGTCAACTCC ACATGTTTAT GATTGcATGT GGAGTATTTT TATGTAACAA AATATACTCG 1980 

GAATGATAAC GTGGgACAAA TTTAACTAAG TGTTTAAAAA GATArAGTTT TAAGTGCtGa 2040 

tTTTTATCAT TACAGTAATA AACTCATTTT GAATACACAG TCTCATGTGA TATTATTAAA 2100 

AAGATATaAG AAAGAGAGGA AGTT AG CTT A TGACTAAACC TATAGTAGCT ATTGTAGGTA 2160 

GGCCTAATGT AGGTAAATCT ACAATTTTTA ATAGAATAGT TGGAGAACGT GTTTCGATTG 2220 

TGGAAGACAC GCCAGGTGTA ACACGAGATC GTATTTATTC TTCAGGTGAA TGGTTAACAC 2280 

SS 



45 



SO 
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10 



IS 



20 



25 



30 



35 



40 



45 



AAATTAGAGC GCAGGCAGAA ATCGCCATAG ATGAAGCGGA 

ACGTGCGTGA AGGATTGACA CAAAGCGATG AAATGGTCGC 

AAAAACCGGT CGTATTAGCG GTTAACAAAG TAGATAATAT 

ATGATTTCTA TTCATTAGGA TTTGGTGAAC CGTATCCGAT 

GTCTTGGTGA CTTGTTAGAT GCAGTTGTTT CTCATTTTGG 

ATGATGAAGA TACAATTCGA CTATC CATT A TTGGACGACC 

TAGTAAATGC TATTTTAGGT GAAGATCGCG TTATCGTTTC 

GAGACGCTAT TGATACAGAG TATAGTTATG ATGGACAAGA 

CTGGTATGCG TAAAAAAGGA AAAGTATATG AATCAACTGA 

CTTTAAAAGC GATTGAACGT TCAAATGTTG TTTTAGTGGT 

TCATTGAACA AGATAAACGT GTTGCAGGAT ATGCACATGA 

TTGTCGTAAA TAAATGGGAT ACTGTGGAAA AAGATAGTAA 

ATGAAGTACG TAAAGAATTC CaATTTTTAG ATTATGCACA 

AAGAACGCAC AAGATTACGT ACATTATTCC CTTACATCAA 

AAAAACGTGT TCAAAGTTCA ACTTTAAATG AAGTTGTTAC 

CTACACCAAC AGACAAAGGT AGACGTTTGA ATGTCTTTTA 

AACCACCGAC ATTTGTTGTA TTTGTTAATG ATGTAGAATT 

GCTATTTAGA GAATCAAATC CGTGCCGCTT TTGGTTTTGA 

TAGCTCGAAA GAGAAATTAA CGATTGGGGG ATAACAATGA 

ATGGGAAGTT TTGGGACAGC CCTTGCCAAT GTTCTTGCAG 

ATGTGGGGTA AAAATCAAGA TGCTGTTGAT GAATTAAATA 

TATTTAAAAT ACGCGAAATT A 

(2) INFORMATION FOR SEQ ID NO: 3 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



TGTTATTATT 
TCAAATTTTA 
GGAAATGCGT 
ATCAGGGTCA 
TGAAGAGGAA 
AAACGTAGGT 
TAATGTTGCA 
TTATGTTTTA 
GAAATATTCA 
CATAGATGCA 
ACAAGGTAAA 
AACGATGAAG 
AATTGCTTTT 
TGAAGCAAGT 
TGATGCAATT 
TGCAACACAA 
AATGcATTTT 
AGkTACACCA 
CTAAAATTAC 
AAAATGGACA 
CATGTCATAC 



TTTATGGTTA 
TACAAATCTA 
ACAGACGTGT 
CATGGTTTAG 
GAAGATCCTT 
AAATCAAGTT 
GGGACAACGA 
ATCGATACTG 
GTATTAAGAG 
GAACAAGGCA 
GCAGTCGTGA 
AAATTTGAAG 
GTGTCTGCTA 
GAAAACCATA 
TCCATGAACC 
GTTGCTATAG 
TCTTATAAAC 
ATTCATATTA 
CGTTTTTGGT 
TGATGTTTTG 
AAATAAAAAG 



2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

31B0 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3621 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 
CTTTCGGAAA TTAGGAT CnG nGCTATCTTG GCCCCAATTA CCAAGGGAAC TAnTGGCACC 



60 



55 
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10 



1S 



AATGCtTCTT TCATCTCCAT GCCCTGTTOC TCATTATTAA TAACACGGTC TATTAACACA 160 

ATGGCATTTG TTACTACGAT TCCAATTAAC ATT AG CAT AC CAATTAAACT TGGTACTGAT 240 

ATTOTTTCTC CTGTGATTAA TAGTGCAATA ATTACACCGA TAACTGTAAA TGGTAAAGAG 300 

AATAAAATTG TAAATGGTGC TAGGCCACCT TTAAATGTAA TAACTAGGAT TAAATATACG 360 

ATAATGATTG CAGCTAACAT TGCAAAGGCT AATTGTGTCA TTGCATTGTT AATATCATCT 420 

GATGCACCAC CGATATTAAC CTTTACATTA TTCGGTTTAT CCAAATTATT TATTTTAGAC 480 

ATCACTTGTC GTGTTGTGCC ACCCACATCT TTATTTGTTA CTTTAGCAGA TACCGTCGTT 540 

GCATAATCTC CTTGTTCTTG CGTCAATTTA CTTGGTGTCG TTGTTTTAAC TAACGTAGCG 600 

ATATCTCCCA ATTTAATCGT ACCACCAGTC GGCTTTTTCA AAG 643 
(2) INFORMATION FOR SEQ ID NO: 360: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2524 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 

TTTCAGGCyG TGCAAGGGCT TTTTCTTTTG CTTTTAAGAT TATGATTTAT CGTGCAAAGT 60 

TAAGTGGTCG TATATAGTTT TAGTTTTAAA AAGGTAATTA AATAAAATAG TTTGCCGAGG 120 

GAGATGTCAA AATGATTAAA ATACCTAGAG GGACGCAGGA TATTTTACCT GAAGATTCAA 180 

AGAAATGGCG TTACATTGAA AATCAATTAG ATGAATTAAT GACATTTTAT AATTATAAAG 240 

AAATAAGAAC ACCAATTTTT GAAAGTACAG ATCTTTTTGC AAGAGGTGTT GGTGATTCAA 300 

CCGATGTCGT ACAAAAAGAA ATGTATACAT TTAAAGATAA AGGCGATAGA AGTATTACAT 360 

40 TAAGACCTGA GGGAACAGCT GCAGTTGTGC GTTCATATAT TGAACATAAA ATGCAAGGTA 420 

ATCCAAACCA ACCAATTAAA CTTTATTACA ATGGACCGAT GTTTAGATAT GAACGTAAGC 480 

AAAAAGGACG CTATCGTCAA TTTAATCAAT TTGGTGTAGA AGCTATTGGT GCTGAAAATC 54 0 

CTAGCGTAGA TGCAGAAGTA TTAGCTATGG TTATGCATAT TTATCAATCA TTTGGATTAA 600 

AACATTTAAA GCTTGTTATT AATAGTGTAG GGGATATGGC GTCTCGAAAA GAATATAACG 660 

AAGCGTTAGT GAaACACTTT GAACCAGTAA TTCATGAATT TTGTTCAGAT TGTCAATCAC 720 

GTTTGCATAC AAATCCGATG CGAATTTTGG ATTGTAAAGT AGACCGTGAT AAAGAAGCGA 780 

TTAAGACTGC ACCTAGAATC ACTGATTTCT TAAATGAGGA ATCTAAGGCA TATTATGAAC 840 

55 



30 



35 



45 



SO 
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GTGGATTGGA TTATTATACA CATACAG CAT TTGAATTAAT GATGGATAAC CCTAACTATG 960 

ATGGTGcCAT TACAACGCTT TGTGGTGGTG GCCGTTATAA TGGTTTATTA GAATTGCTAG 1020 

ATGGTCCAAG TGAAACAGGT ATTGGTTTTG CGCTAAGTAT AGAACGATTA TTGCTTGCAC 1080 

TTGAAGAAGA AGGTATCGAA TTAGATATTG AAGAAAACTT AGATTTATTC ATTGTTACAA 1140 

TGGGTGATCA AGCAGATCGA TATGCTGTGA AGCTATTAAA TCATTTGAGA CATAATGGTA 1200 

TTAAAGCAGA TAAAGACTAT TTACAGCGTA AAATTAAAGG ACAAATGAAA CAAGCAGACC 1260 

GTTTAGGTGC CAAGTTTACA ATCGTTATTG GTGATCAAGA ATTAGAAAAT AATAAAATCG 1320 

ATGTTAAAAA TATGACAACT GGTGAATCTG AAACAATTGA ATTAOACGCA TTAGTCGAAT 1380 

ATTTTAAGAA GTAGAGAGGG CGTTAAAATA TGAGTAAGAG AACAACTTAT TGTGGATTAG 1440 

TTACTGAGGC ATTTTTAGGA CAAGAAATTA CATTAAAAGG ATGGGTTAAC AATCGTCGTG 1500 

AC CTTGGTGG ATtGATTTtC GTTGATTtAA GAGATAGAGA AGGAATTGTA CmAGTCGTGT 156 0 

TTAATCCTGC ATTTTCAGAA GAGGCaTTGA AAATTGCTGA AACAGTACGT TCTGAATATG 162 0 

TTGTAGAAGT TCAAGGTACA GTTACGAAGC GTGACCcTGA AACAGTTAAT CCTAAAATTA 1680 

25 AAACTGGCCA AGTTGAAGTA CAAGTTACAA ATATTAAAGT GATTAATAAA TCTGAGACAC 174 0 

CACCATTTTC TATAAATGAA GAAAATGTTA ACGTTGATGA AAATATTCGA TTAAAATACC 1800 

GTTATTTAGA TTTACGTCGT CAAGAGTTAG CGCAAACATT TAAAATGAGA CATCAAATTA 1860 

CACGTTCTAT TCGTCAATAT TTGGATGATG AAGGGTTCTT TGACATCGAA ACACCAGTAC 1920 

TAACGAAGTC AACACCTGAG GGTGCACGTG ACTATTTAGT ACCATCTCGT GTTCATGATG 1980 

GTGAATTTTA TGCATTACCA CAATCACCAC AATTATTTAA GCAATTATTG ATGATTAGTG 2040 

GATTTGACAA ATACTACCAA ATCGTAAAAT GCTTCCGTGA CGAAGATTTA CGTGCAGATC 2100 

GTCAACCTGA ATTTACACAA GTCGATATTG AAATGAGTTT TGTAGACCAA GAAGATGTGA 2160 

40 TGCAAATGGG TGAAGAAATG CTTAAAAAAG TTGTTAAAGA AGTTAAAGGC GTTGAAATTA 2220 

ATGGCGCTTT CCCACGCATG ACATATAAAG AAGCGATGCG TCGCTATGGT TCTGATAAAC 2280 

CAGATACACG TTTTGAAATG GAATTAATTG ACGTTTCTCA ATTAGGACGT GATATGGACT 234 0 

45 TTAAAGTATT TAAAGATACT GTTGAAAATG ATGGTGAAAT TAAAGCAATT GTCGCTAAAG 2400 

GTGCAGCTGa ACAATATACT CGTAAAGaTA tGGGaTGCTT TAACAGAATT TGTaAACaTC 2460 

ymTGGtGCTA AgGtTAGCGT GGGGTTAAAG TTGTGGGAAG GTGGTTTTGA CAAGGTCCCA 2520 

60 

ATGG 2524 
(2) INFORMATION FOR SEQ ID NO; 361: 
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<A> LENGTH £ 1507 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 361: 



10 


TCGTTGAGTA AAAGTCCAGA 


AAATTGGATG 


AGTAAACTTG 


ATGATGGAAA ACATTTAACT 


60 




GAGATTAATA 


TACCGGGTTC 


ACATGATAGT 


GGCTCATTCA 


CTTTAAAGGA 


TCCAGTAAAA 


120 




TCAGTTTGGG 


CAAAGACTCA 


AGATAAAGAT 


TACCTTACCC 


AAATGAAGTC 


GGGAGTCAGG 


180 


1S 


ttttttgata 


TTAGAGGTAG 


AGCAAGTGCT 


GATAATATGA 


TTTCAGTTCA 


TCACGGCATG 


240 




GTTTATTTGC 


ATCATGAATT 


AGGAAAATTT 


CTCGATGATG 


CTAAATATTA 


CTTGAGTGCT 


300 


20 


TATCCAAACG 


AAACAATTGT 


GATGTCTATG 


AAAAAGGACT 


ACGATAGCGA 


TTCTAAAGTT 


360 


ACGAAGACAT 


TTGAAGAAaT 


TTTTAGAGAA 


TATTATTATA 


ATAACCCGCA ATATCAGAAT 


420 




CTTTTTtACA 


CAGGAAGTAA 


TGCGAATCCT 


ACTTTAAAAG 


AAACGAAAGG 


TAAAATTGTC 


480 


2S 


. CTATTCAATA 


GAATGGGGGG 


TACGTACATA 


AAAAGTGGTT 


ATGGTGCTGA 


CACGTCAGGT 


540 




ATTCAATGGG 


CAGACAATGC 


GACATTTGAA 


ACGAAAATTA 


ATAATGGTAG 


CTTAAATTTA 


600 




AAAGTACAAG 


ATGAGTATAA 


AGATTACTAT 


GATAAAAAAG 


TTGAAG CTGT 


TAAAAATTTA 


660 


30 


TTGGCTAAAG 


CTAAAACGGA 


TAGTAACAAA 


GACAATGTAT 


ATGTGAATTT 


CTTGAGTGTA 


720 




GCGTCTGGAG 


GCAGCGCATT 


TAATAGTACT 


TATAACTATG 


CATCACATAT 


AAATCCTGAA 


780 




ATTGCAAAAA 


CGATTAAAGC 


AAATGGGAAA 


GCTAGAACGG 


GTTGGCTGAT 


TGTTGACTAT 


840 


35 


GCAGGATATA 


CGTGGCCTGG 


ATATGATGaT 


ATCGTAAGTG 


AAATTATAGA 


TAGTAATAAA 


900 




TAAGGATTCA 


ATAATGATAT 


TAAGACGAGT 


ATGAAAATAG 


TTAGATTCTA 


ATTATTTTCA 


960 


40 


CTACTCGTTT 


TTATTTTGAA 


AATAAGTAAT 


AATTCAACAA 


TATTATAAAT 


TGAACAGATT 


1020 


GTTTGTGAAA 


TTTTTGATAA 


TATTAAAGTG 


AAAAAGTGTT 


ATAAATTGAT 


AAATATATGT 


1080 




AATTAACAAA 


AACAAATCAT 


TTTAAAAAGA 


AGAGAGTTGT 


AAGATGATGa 


AACGATTAAA 


1140 


45 


CAAATTAGTG 


TTAGGCATTA 


TTTTTCTGTT 


TTTAGTCATT 


AGTATCACTG 


CTGGTTGTGG 


1200 




CATAGGTAAA 


GAAGCGGAAG 


TTAAGAAAAG CTTTGAAAAA 


ACATTGAGTA 


TGTACCCTAT 


1260 




TAAAAATCTA 


GAGGATTTAT 


ACGATAAGGA 


AGGCTATCGT 


GATGATCAGT 


TTGATAAAAA 


1320 


50 


TGATAAAGGT 


ACATGGATTA 


TAAATTCTGA 


AATGGTTATT 


CAACCTAATA 


ATGAAGATAT 


1380 




GGTAGCTAAA 


GGCATGGTTC 


TATATATGAA 


TAGAAATACC 


AAAACAACAA 


ATGGTTACTA 


1440 




mTATGTCGAT 


GTGACTAAGG 


ACGAGGATGA 


AGGAAAACCG 


CACGACAATG 


AAAAAAGATA 


1500 
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(2) INFORMATION FOR SEQ ID NO: 362: 

<i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 





TACATGTTTC 


GGATGCTACT 


TTATTTAGTT 


TGAAGGGTGC ATTATGGACG 


TTAGCGCAAG 


60 


1S 


AAGTTTATCA 


AGAATGGTAT 


TTAGGATCGA AGTTGTATGA AGATGTTGAA AAGAAAATAG 


120 




CACGAACTAC 


TTTTAAGACA 


GGTTATATTT ATCAAGAAAT TATTTTGAGA CCAGTAGATG 


180 




AAGTTAAGGT 


ACTTCTGAAT 


GATTTAAAAG 


GTGCTGGTTT CGAATTAGGT 


ATTGCAACAG 


240 


20 


GTCGTCCTTA 


TACTGAGACT 


GTTGTGCCAT 


TTGAAAATTT AGGATTGTTA 


CCATATTTTG 


300 




AAGCTGATTT 


TATTGCAACA 


GCAAGTGATG 


TTTTAGAAGC AGAGAATATG 


TATCCGCAAG 


360 




CACGACCATT 


AGGAAAGCCG 


AATCCTTTTA 


GTTATATCGC AGCTTTATAT 


GGTAATAATC 


420 


25 


GCGATAAATA 


TGAATCTTAT 


ATCAATAAGC 


AAGATAACAT TGTAAATAAA 


GATGACGTAT 


480 




TTATAGTAGG 


CGATTCGTTA 


GCTGACTTAT 


TAAGTGCTCA AAAAATAGGT 


GCAACGTTTA 


54 0 


30 


. TTGGAACATT 


AACAGGTTTA 


AAAGGTAAGG 


ATGCTGCAGG TGAGTTAGAA 


GCGCATCATG 


600 


. CCGACTATGT 


TATTAATCAT 


TTAGGTGAAC 


TTAGAGGTGT ACTAGATAAT 


TTGTAATTTG 


660 




ATTGTTGTTT 


GACAGCATAA 


CTTGTAGTGA 


ATGATTGAAC CAAAGGTTTC 


ATATTGAGTT 


720 


35 


ACAATGAAAT 


TAATAATGAA 


AAAATGCCAA 


GAAGCAATGG AAGTAATCCA 


ATGTCTTCTT 


780 




GGCATTTTGA 


ATTTACATAA 


ATTGTTTATG 


ACTGTAC CGT CAATTCAGTT 


GTGAAAATTT 


840 




GATTGTATTC 


ACCAACTTGT 


TTAAGTTCAT 


CAATTATATT GTTTGAAACA 


GGTTGATCAA 


900 


40 


CGGATAAAAT 


CATTAGCGCA 


TCTCCGCCCG 


CTTCAGTTCT ACCTAAAGTC 


ATAGATGCAA 


960 




TGTTGATATT 


GTATTTACCT 


AACAATGCGC 


CAGTTTTTCC TACCATACCT 


GGAGTATCAT 


1020 




TATGATATGA 


CACAATTTGA 


TATTGATTTG 


GCTTTAAGTC TACAGAAAAA 


TTATTAATTC 


1080 


45 


TAACAATTCT 


TGGACCGAAA 


CCTGTAAAGA 


CAGAAGCGCC AACTTTAACG 


GAATCGCnAT 


1140 




CGnTTGATAG 


TTCTACCTCT 


AAGTAGTTAC 


TAAAACCTGT CTCTGCTTTA 


TTATTTTCAA 


1200 




TATTTAATGT 


CACTTG 








1216 



60 

(2) INFORMATION FOR SEQ ID NO: 363: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 867 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 363: 



10 



IS 



SO 



25 



ATCAAAATAT 


TTTAAATAAT 


ATTGATGGTC 


ACATTGTAAA TrmAATAGAA AATAAATTTG 


60 


ATCAAATATT 


ACAAGAACCA 


TTAAATCCAT 


TAAATTATGA 


TACTGTCAGT 


GGATTAGCTG 


120 


GGATAGGGAG 


ATATTTGCTA 


AATAGAGTAG 


ATGAGAATGA 


ATTTAATGTT 


AAAGCATTAA 


180 


AAAGCATATT 


AGTATACTTT 


AAAGATATTC 


AATATTCTAA 


AAATAGCTGG 


GTAGTCCCAC 


240 


AAGArAGTCA 


ATTTTTAGAG 


TCTGATAAAA 


ATTATTTTAC 


TGAAGGTAAT 


ATCAATCTTG 


300 


GCCTTGCACA 


TGGaGTGCTA 


GGACCGATGT 


CTTTATTTGC 


ATTATGCGTG 


ATTAAAGGAA 


360 


i lACXSATTvSA 






AAGACATGTA 


CAAATTTATC 


ATGGACGAAA 


420 


AATTTTGTAA 


CCACGAAAGA 


TGGTTGCAGC 


GTTACGATTT 


AATTTCTGAA 


COTAATCATT 


480 


TCAATTTTAT 


TCGGAATGGT 


TGGTGTTATG 


GCAATACGGG 


TGTAATGACG 


ACGTTGTTTT 


540 


TAATCGGCCA 


AGCATTACAA 


GATGATGAAA 


TAATTAAAAT 


GTCTAAAAAA 


GTGATGCTAC 


600 


AAGTAGTAAA 


TGATAAAGAT 


GAAAATTTAA 


TAAGTCCAAC 


TATTTGTCAT 


GGATTGTCAT 


660 


CACAAATATT 


AATGTTAACA 


ATTATGAATT 


TGAATTTTGA 


ATTAAATGAA 


GTGTCTGATT 


720 


ATATCACTGT 


ATTAATAAAT 


AAACTGATTT 


CTCATTATAA GGAAGATTAT 


CTGGTGAATT 


780 


TTATAGACAT 


TAATGAAAAT 


^AAGCAAGATG 


TATTTAAAAG 


TAGGAAAGTT 


GGCCTTTTAG 


640 


AAGGTGAATT 


AGGGGTCATC 


TAACATT 








867 



* (2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10813 base pairs 

(B) TYPE: nucleic acid 

40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 



SO 



TACCTTTTCT 


TtTAAATCAT 


TTTATATTTT 


CCCa CTAATA 


TCCGcTGtTA 


ATCaATCCtG 


60 


ACATCCtTGT 


ATCaCTATGA 


CAATTAATTG 


TTAAATACAT 


GAATTTCTAC 


ATTTTATGAA 


120 


AAAATCCATT 


TTTATTACAA 


TTCAACACTT 


TATATGACAA 


CTTCATTACA 


GTTACTTTTA 


180 


TTGTTGATTG 


CTTACATTGT 


TTTCTAAAAA 


AAATTTGTTA 


TCATAATTAA 


CGTTGAATAA 


240 


AGAAAAAAAT 


TAACTTGGGA 


GATAAAAATG 


GAATATAAAA 


AGATACTAAT 


TCGTTTATTA 


300 
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TO 



15 



CAATCGCATG CCGCAGTTAA TTATTATAGT AAAAACCAAT GTACATGGTG GGCATTTAAA 420 

CGTCGCGCAC AAGTCGGTAA ACCTGTTTCT AATAGATGGG GCAATGCTAA AAATTGGTAT 4 80 

TACAATGCAC GTAAATCAAA ATATGCGACT GGTCGTACAC CAAGAAAATT TGCTGTCATG 540 

CAATCAACTG CAGGATATTA TGGACATGTC GCAGTTGTTG AACAAGTATA TAAAAACGGT 600 

AGTATTAAAG TTTCAGAATA CAACTTTTAT CGCCCATTAA AATACAATAC ACGTGTACTA 660 

AGCAAAAAGG CAGCACGTAA CTTTAACTAT ATTTACTAAT CAAAAAACTT CTATCACGAA 720 

CGCTTCAATT TCCTGTATGC GTGTGATAGA AGTTTTTATT TTATGAAATT ATATTATTAC 780 

TTCTACAAAT TTCAAATTGC CGTAATTGAA CGTATATTTC TTCTTCAACT ATTATTTCAT 840 

CTTTAGCATA ATCTATATAT AAAATTTTAT GCTATTATTT AAATAATTCG CTATAACTTA 900 

ACATACGTTT TCGATATAAA CCTTGTTCTA AATCTCAATA ATTTTTTGCT GTTTTCATCG 960 

20 TCATTAGTTA AAAAAATAAT TTAACTGAGT TTTTGAGACC TGTTTAAATT GATGACTCAA 1020 

AACCTTTATC CCCTTTTCAC TCGGTTTAAT TGCTTTAATA TTTAACACAG TCT CATTGTA 1080 

ATTTTGCTAC TAAGTTTGAA ATATTTCGAT TCGAATGTGA AcATCATTTA TTATTACTTT 1140 

25 CGACAGCTAG AAAAATATTG TTAAACCAAA ACTATAATTA CCACTTTTAT ATAGAACTAT 1200 

ATATAAATTT TAACTTTGAG ACAGAACTGC TAGTCAGGTT TATGAATATA TTTCTTTAGT 1260 

TTACTTGaTA TACTTATTGG TAAATCATTA TTTATTTGAA GAGCATTTAT AACAAAAAGT 1320 

ATAATCCCAG TT AT AG CG AT TCCATGACCA TTTTTATCAG T CAAAG ATT C ATCAACAAAA 1380 

ACTTTTTGAT AAATTTATAT ATTTGTATAA TTTATTATGG TAGATATCTA CACCCGTATC 1440 

TATAACAGCT ACAGTTATCA TTGCGTCTTA TCCAAATAGT TTTTAAGAAA TAAATATAAT 1500 

TCATTAATAT CATGG CT ATT TGTAACGTCA GtGCTTaAGA CCTTGTCAAT TATTGTTTTA 1560 

TCT G TTTCTA AGAGCCCCTG ATTTTCTGCA TACATATTTT GTATATCGCT GTTAATCTCA 1620 

TTTAATTTTT TAAATTTTTT TCTAACCTTT TTCAAAATAA AATGCTCTTT AATTCCATCT 1680 

CTTAAATACA TATAAATCTC TATTATAATC TCCATAAATG TCATATCTCC TCTCTCAAAA 1740 

AACATT AT A C TCAATATATT AAAAATAAGA AAGGTTTTTT CAGAATTTTT AGTATTTTCA 1800 

GATATTTATT CTTAAGTATC TTATTATTAC TTCTTTTTAA CGCTAGTGGa ATAATTAATA I860 

ATAATTGTAT TTAAATGCAA TGCTGTTGTT CTTTCATACT TACAAGCAAG TCATAAGAAA 1920 

TCAGAAATTA AATTCTTATA TGATGAATAA ATGATGACTG CAGAAGAAAT TGCTAAAAAA 1980 

GGAGTGGAGT GAGGCATTCT ACTGTTCATA TTGTTAACAA ATATCAAAAT GATAGGAACT 2040 

TGAAGAATCT AAGACTGTTA AAATCGAGAC AAGAAGAAAC TCGTTCCAAA TTTGAATCCT 2100 



30 



50 
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w 



AAAACTTCTA TCACGAACGC TTCAATTTCC TGTATGCGTG TGATAGAAGT TTTTATTTTT 2220 

GTTCATATTA ATTTATCTAA GCGCTACGAT GGaCTGACTT AAACTTTTCT TTAAAATCGC 2280 

TATTOGCCAT TTACTATTGT TGTCTAATTT CTTGTAAAAT ATGTTCCGCT GCTTGTGTAT 2340 

TTGCACGGGG TTCTTTTTTC AAAGCTTCAG CTACTTTAGC AATTTCATCA CCTTTTGCCC 2400 

CTACAACGAT AGCTAATGAT TTATATTGTA AACTCATATG ACCTTGTTGA ATACCTTCTG 2460 

ACACAAGCGC GCGACATGCT GCAAAGTTTT GCGCTAAACC AACGGCAGCA ACTACATGAC 2520 

CTAATTCTTG TGCTGACTCT ACATTTAGTA GCTCTAATGA AGCTTTAGCA ATTGGTAATA 2580 

15 CTTTTGTACC ACCGCCAACG ATTGCGAATG TCATAGGCAC TTCAATTGTA CCAATCAATC 2640 

G TT G ACGATC TTGATCGTAA CGCCATGTAG CAATACCACG ATACTGTCCG TCACGACTCG 2700 

CGTATGCATG CGCACTTGCT TCTGCACCAC GCGTATCATT TCCTGTTGCT AAAACAACAG 2760 

20 CATGTATGCC ATTCATAACA C CTTT ATTAT GTGTTGCTGC ACGATGAATA TCTACTTGGG 2820 

CCAATACAGA AGCACGTTCC ATTCGTTTGG CAACCTCTTC TCCAGTTCTC TCGCCCCTTG 2880 

CTAAATCTTT AACATCAATT TCGCCTTGAA CTTTAACAAC GGACGCTGTT GCATGATTGG 2940 

ATAAAATACT CATTAAAATG TCGCTTTGCG GAAATTCATT TTTTAAAAAT GCAGTTATGG 3000 

CCTCTAAAAT CGTATTAAGC ATATTAGCGC CCATAGCATC TTTCGTATCA ACAAATACTT 3060 

TTAAAGATAG TAACTGTTGC TCAGGAAATG TATCAATCGC TATACGTTGG TAACCACCAC 3120 

CACGCGCTTT AATAGAAGGA TATGCCTCAT CCGCAATTTT ATGAATTTGC TTTTCTAAAG 3180 

CTTTAATGTC TGCTGATAAT TTTTCAGTAT CGTCAACGCC ATCAAAGACG ATTTGACCTA 3240 

TCATAATACG TTCAGAAGAT ACCGTTTTAA ATCCGCCAGT CTGATTCACT AGCTTTGCAC 3300 
CATAACTAGC TGCAGCGACA ACTGAAGGCT CTTCCACCAT CATAGGTACA ACATATGCCT . 3360 

TATCGTCCAC AATGATATTC GGTAATAATC CAACGGGTAA TGCACCTTGC GCGATGACAT 3420 

40 TTTCAATTAA ACTATTGGCT ACTTCTTCAT CGATTAATGG ATGATTCAGT AAAATGTCGA 3480 

ATTGTTCTTC TGATAAC CAT TGCTTATCAA CCAATTGTTG TAACTTTTCT TTACGAGATA 3540 

AATGTCGAAA ATTCTTATCT AAATTTTGCA TGGACGTACT CCTTTTACTT CACATAATTT 3600 

45 TTAACATTTC AATCACTACT ATTTTTACCA CAAAATAACG TCATTCGTCT TAAAATTCAA 3660 

TTGAATAATT GTCGTTTTGA CTTTAAAATA AAACAAGGTA AATTAAAACG CTTACAAGAA 3720 

ACGACAAATc ATTTTTAAAT TTAGTATATT TCTTTGTATA AAATTAGCAT ATTCTGATAT 3780 

50 

GATACAAGTG TTGCTTTTAT AAATTTGAAA GGATGTAAAA CCTTATGACA ATAGGTATCG 3840 

ATAAAATAAA CTTTTACGTT CCAAAGTACT ATGTAGACAT GGCTAAATTA GCAGAAGCAC 3900 

£5 



25 
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10 



15 



CTGTAAACCA AGACATCGTT TCAATGGGCG CTAACGCTGC TAAGGACATT ATAACAGACG 4020 

AAGACAAAAA GAAAATTGGT ATGGTAATTG TGGCAACTGA ATCAGCAGTT GATGCTGCTA 4080 

AAGCAGCCGC TGTTCAAATT CACAACTTAT TAGGTATTCA ACCTTTTGCA CGCTGCTTTG 4140 

AAATGAAAGA AGCTTQTTAT GCTGCAACAC CAGCAATTCA ATT AG CT AAA GATTATTTAG 4200 

CAACTAGACC GAATGAAAAA GTATTAGTTA TTGCTACAGA TACAGCACGT TATGGATTGA 4260 

ATTCAGGCGG CGAGCCAACA CAAGGTG cTG GCGCAGTTGC GATGGTTATT GCACATAATC 4320 

CAAGCATTTT GGCATTAAAT GAAGATGCTG TTGCTTACAC TGAAGACGTT TATGATTTCT 4380 

GGCGTCCAAC TGGACATAAA TATCCATTAG TTGATGGTGC ATTATCTAAA GATGCTTATA 4440 

TCCGCTCATT CCAACAAAGC TGGAATGAAT ACGCAAAACG TCAAGGTAAG TCGCTAGCTG 4500 

ACTTCGCATC TCTATGCTTC CATGTTCCAT TTACAAAAAT GGGTAAAAAG GCATTAGAGT 4560 

20 CAATCATTGA TAACGCTGAT GAAACAACTC AAGAGCGTTT ACGTTCAGGA TATGAAGATG 4620 

CTGTAGATTA TAACCGTTAT GTCGGTAATA TTTATACTGG ATCATTATAT TTAAGCCTAA 4680 

TATCATTACT TGAAAATCGA GATTTACAAG CTGGTGAAAC AATCGGTTTA TTGAGTTATG 4 74 0 

25 GCTCAGGTTC AGTTGGTGAA TTTTATAGTG CGACATTAGT TGAAGGCTAC AAAGATCATT 4 800 

TAGATCAAGC TGCACATAAA GCATTATTAA ATAACCGTAC TGAAGTATCT GTTGATGCAT 4 860 

ATGAAACATT CTTCAAACGT TTTGATGACG TTGAATTTGA CGAAGAACAA GATGCTGTTC 4920 

ATGAAGATCG T CAT ATTTTC TACTTATCAA ATATTGAAAA TAACGTTCGT GAATATCACA 4 980 

GACCAGAGTA GTCGGTGTAT TTAAAACACA TATAATAAAA CCTAAAAGCA GCAGTAAGAC 5040 

CACTTCTAAT TGAAATCGTC TTACTGCTGT TCTCTATTTA TAACACTTCG TATTGAATGA 5100 

ATTCATTATG CCTATTTGAC ACATTATTGA AGTTTTCCTA ATGCCTGGAT C CTTTATACG 5160 

TTACGGCTTC GTGCTATGTT TTGGTACATA AAGCTTTGAC ATATCGATAT TCTCCAACTC 5220 

40 TAACAGCTTA ATTTTATTAT TAATCGTTCC ACCGAACCCT GTTAAGCTAC CCGTTTTACC 5280 

GACAACACGA TGACATGGCA CGATAATAGA TAATGGATTA CTTCCGACTG CACCTCCAAC 5340 

CGCTTGGGCT GACATTTTTG GCTTGTTAAG CAGCTTGCCT ACTTTTTTGG CAATAGCACC 5400 

45 ATACGTTGTT AGAGTCCCAT AAGGAACCTG TCTTAATTCA TTCCAAACAC ACTGTTGAAA 5460 

ATGACTACCT GTTGGCTTTA AAGGTATTGT GATTTCAGGA TTGTCACCTT TAAAATACGC S520 

GTCTAACCAC TGTGTCGCCT CTCTAAATAT CGCTAAAGAC GTATTTTCTT CCCTAGTACC 5580 

ATCACCTTGT TGATTTTCAA ACAAAACAGC GGTCAGACTT ACCCCATCAC TCAAAAGTTC 564 0 

CAATCGTCCT AC AGGCGAAT CATAGTAACT CTTATACTCC ATAAAAATTC CCCCTTTTTC 57 00 
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ATAAGTCGTC AATTAGGTAT ATAAACACGT AATACCAGCT ATCACTTTGC 


TGCAATATAC 


5820 




AGTTACATAT 


CTTACTACAC 


GTGCTAACCT 


CTTACTTTGT 


AAACCAAATC 


TTAAATTAAA 


5B80 


5 


ATATTOAAAA 


TGCAATGAAT 


CCTTAATATT 


TTATTAAACC 


TATAATTACT 


TATTAAAAAT 


5940 




AACACACAAT 


ATTCATAAAG 


TTTTAAAAAT ATTCTGTTTT 


ATCACCTACT 


ATTAGTGGAA 


6000 


10 


AAGTACAATT 


GCAATTGTAT 


ATAGTTTGCA 


TAACGCTTCA 


AAAGTAATTT 


CTTTTTTGTT 


6060 




TAGTTCAAAA 


AAATTTAGAG 


GTGATGTTAT 


ATGAATAACG 


GTTTTTTCAA 


TAGCGACTTT 


6X20 




GATTCAATTT 


TTCGAAGAAT 


GATGAAAGAT 


ATGCAAGGTT 


CAAATCAAGT 


CGGAAACAAA 


6180 


IS 


AAGTACTATA 


TTAATGGTAA 


AGAAGTTTCA 


CCTGAAGAAC 


TAG CG CAACT 


CACACAACAA 


6240 




GGTGGCAATC ACTCTGCTGA ACAAAGTGCG CAAgcTTTTC AACAAGCAGC ACAAAGACAA 


6300 




CAAGGGCAAC 


AAGGTGGCAA 


CGGCAATTAT 


TTAGAACAAA 


TTGGTOGTAA 


CCTTACGCAA 


6360 


20 


GAAGCACGTG 


ACGGTTTATT 


AGATCCAGTC 


ATTGGTCGTG 


ATAAAGAAAT 


TCAAGAAACT 


6420 




GCTGAAGTTT 


TAAGTAGACG 


AACTAAAAAC 


AATCCTATAT 


TAGTTGGAGA 


AGCTGGTGTT 


6480 




GGTAAAACTG 


CGATTGTTGA 


AGGTTTAGCA 


CAGGCAATCG 


TTGAAGGAAA 


TGTACCAGCA 


6540 


25 


GCAATCAAAG 


ACAAAGAAAT 


T ATTT CTGTA 


GACATTTCAT 


CATTAGAAGC 


TGGAACGCAA 


6600 




TATCGTGGTG 


CTTTTGAAGA 


AAATATTCAA 


AAATTAATCG 


AAGGTGTTAA 


ATCTTCACAA 


6660 


30 


AATGCCGTAC 


TATTCTTTGA 


TGAAATCCAT 


CAAATTATCG 


GTTCAGGTGC 


CACAGGAAGT 


6720 


GATTCAGGTA 


GCAAAGGGTT 


ATCTGATATT 


TTGAAACCTG 


CATTAAGTCG 


TGGTGAGATT 


6780 




TCTATTATTG 


GTGCAACAAC 


ACAAGATGAA 


TATCGAAACA 


AT ATT CTT AA 


AGATGCTGCA 


6840 


35 


TTAACGCGCA 


GATTTAATGA 


agtgcttutt 


AATGAACCAA 


GCGCTAAAGA 


TACTGTTGAA 


6900 


ATTTTAAAAG 


GTATTCGCGA 


AAAATTCGAA 


GAACACCATC 


AAGTAAAATT 


ACCAGATGAC 


6960 




GTATTAAAAG 


CATGTGTTGA 


CTTATCAATT 


CAATATATTC 


CACAACGATT 


ATTACCAGAT 


7020 


40 


AAAGCAATCG 


ATGTGTTAGA 


TATTACAGCA 


GCACATTTAT 


CTGCGCAAAG 


TCCAGCTGTC 


7080 




GATAAAGTTG 


AAACTGAAAA 


AOGAATTTCT 


GAATTAGAAA 


ATGATAAACG 


TAAAGCAGTA 


7140 




AGTGCTGAAG 


AATATAAAAA 


AGCTGACGAC 


ATTCAAAATG 


AAATCAAATC 


ATTACAAGAT 


7200 


45 


AAATTAGAAA 


ATAGTAATGG 


TGAACATACT 


GCTGTTGCTA 


CAGTTCATGA 


TATTTCAGAT 


7260 




ACTATTCAAC 


GATTAACTGG 


TATTCCAGTT 


TCTCAAATGG 


ATGATAACGA 


TATTGAACGT 


7320 




TTAAAAAATA 


TTTCTAATCG 


TTTAAGAAGT 


AAAATCATAG 


GTCAAGATCA AGCTGTAGAA 


7380 


60 


ATGGTTTCAC 


GTGCAATTCG 


CCGTAATCGT 


GCTGGGTTTG 


ATGACGGCAA 


CCGTCCAATT 


7440 




GGCAGTTTCC 


TATTTGTTGG 


CCCTACTGGT 


GTTGGTAAAA 


CAGAGCTTGC 


TAAACAATTA 


7500 



55 



1252 



EP0 786 519 A2 





OACACAACAG 


CTGTTTCAAA 


AATGATTGGT 


ACAACTGCTG 


GTTATGTTGG 


TTATGATGAC 


7620 




AATTCAAATA 


CGTTAACTGA 


AAAAGTACGC 


CGTAATCCAT 


ACTCAGTCAT 


TCTATTTGAT 


7680 


6 


GAAATCGAAA AAOCAAATCC 


ACAAATTTTA 


ACATTGTTAT 


TACAAGTAAT 


GGATGATGGT 


7740 




AATTTGACTG 


ATGGTCAAGG 


TAATGTCATC 


AACTTTAAAA ATACAATTAT 


TATTTGTACA 


7800 


10 


TCAAATGCTG 


GCTTTGGCAA 


TGGCAATGAC 


GCTGAAGAAA 


AAGATATTAT 


GCACGAAATG 


7860 


AAAAAATTCT 


TCCGCCCTGA 


ATTCCTTAAC 


CGCTTCAACG 


GCATCGTTGA 


ATTCTTACAT 


7920 




TTAGATAAAG ATGCATTGCA AGATATCGTC 


AACTTATTAT 


TAGACGATGT 


ACAAGTTACA 


7980 


TO 


TTAGACAAAA 


AAGGTATTAC 


GATGGACGTT 


TCTCAAGATG 


CGAAAGATTG 


GTTAATTGAA 


8040 




GAAGGCTATG 


ATGAAGAATT 


AGGTGCACGT 


CCATTAAGAC 


GTATTGTTGA 


ACAGCAAGTA 


8100 




CGTGACAAAA 


TTACAGATTA 


CTATTTAGAT 


CATACAGACG. TTAAACATGT 


GGATATAGAT 


8160 


20 


GTTGAGGATA 


ACGAATTAGT 


CGTAAAAGGT 


AAATAACGAC 


ACTTTAACAT 


ATCGCGCATC 


6220 




AAAAATGAGC 


ATCAGGTCGC 


CCTTGCCTGT 


GCTCATTTTT 


TTAATTATTT 


CCCTGGAAAA 


8280 




TGATTCGCTG 


TGTG CTGTTC 


TGTTCCACAA 


CAATCACGAT 


TAATGTCACA 


TGTACCACAT 


8340 


25 


TTTCCTTGTT 


TTGAACGCTT 


GAAAAATTTT 


ACTAGTGTAT 


ATAAGGCATA 


TCCGAAAATT 


8400 




GCTAAAAAAA 


TTAAAATGTT 


AATAATGACT 


GACACTTTAA 


CCACTCCTTA 


AACAAATAAA 


8460 




TGTCCGACTT 


GATAAAAAAT 


GAATGTTAAG 


ACATATGCAG 


TGACTAGAGG 


ATAGGCAACT 


8520 


30 


GCAAGTGCCG 


TC CATTTC CA 


TGAATAAGTC 


TCTTTACGGA 


TTGCTGCTAC 


TGTAGAAACA 


8580 




CAAGGAATAT 


ACAATAGTAT 


AAATATCATA 


AATGCATACG 


CAGATAGGGG 


TGTGAATTGA 


8640 


35 


TTTTGAATCA 


CATTAACAAG 


G CCTGCATCA 


CCTGATGAAT 


AGATAATCGC 


CATCGAACTT 


8700 


ACGATAACTT 


CTTTTGCTAA 


AAATCCTGGC 


AcTAACGTAG 


CACCTGCTTG 


CCATGTTCCA 


8760 




AATCCGAGCG 


GTTGCACTAA 


CATACCAAAG 


AAACTACCAA 


CCATATGTAA 


AAAACTTTGA 


8820 


AO 


TTGATATTCA 


CATTGATACC 


ATGTGGTCCT 


ACATAACTTA ATAGCCAAAT 


GACTACTGAG 


8880 




CCGCCAAAAA 


TAAATGTACC 


TGCTTTACGA 


ACAAAGCCCT 


TAGCCTTTTC 


CCAAGTACTA 


8940 




CGCCACAACG 


TTTTAATGGA 


AGGCACACGG 


TATGTTGGCA 


ATTCCACAAT 


AAAGATTGCA 


9000 


45 


TTATCATTTT 


TTAAAATCGT 


CTTAGTAAGT 


ACTGTACTGA 


CTAAAAATGC 


CATAATAATA 


9060 




CCTAAAACAT 


ACAGGCTTAA 


TACTACTAAA 


GATTGATTCT 


CTTTGAAAAA 


GATACCTACG 


9120 




AACAACGCAT 


ACACTGGCAG 


TCTAGCAGAG 


CATOACATGA 


ATGGTGCAAT 


TAATATCGTT 


9180 


SO 


GTTAAACGCT 


CTTTTTCATT 


TTCAATACTG 


CGCGCAGCCA 


TAATACTCGG 


TACATTACAA 


9240 




CCAAATCCGA 


TAATCATTGG 


TATAAAAGAC 


TT CC CGCTT A 


AACCGAACGA 


TTCCATAATA 


9300 
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10 



is 



20 



25 



30 



35 



40 



45 



60 



AAAAAGAGCA CAACAATTTG TGGTACAAAG ACTAATACTG ATCCTACACC AGCAATAATG 9420 

CCATCTGTAA TTAAATCTTG TAAAAATGGT ATAACACCAA GATAATTCAT AATCGTCTTC 9480 

ACACTATCTG TAAATGTACC ACCTATAAAT GCATCGAGTT GATCCGACAA AGGTGTGCCA 9540 

ATCCATGTAA ATGTAGTTTG AAAGATCAAC CACATAATTG CTAGAAAGAT AGGCATCCCT 9600 

ATATATTTAT GTGTTAATAT CTTGTCTATT CTAGAGCTGA AATATTGCTT ATCTTCATCT 9660 

GGATAOGTTA CCACGTCTTG CAATAACGTC TCAATATAAT GATTGCGTAT ACGCTCCATC 9720 

TCTCGACGAA CAGATACAGC CCCTACTTGT TCAGCAACTT GATCACGTAA ACTCGACAAT 9780 

TTATTTACAA CCTCTGAATT AAGTTCGTTT GCAATTTGCA TGTTATTTAA TAAGAATTGA 984 0 

ATCGCAATAA ACCTAGCTTG ATACTTATCA TGAGATGTCT CTGTCATTAT TATTTGACAC 9900 

ATATTTTTAA TTGTCTCTTC AATCTTCTCA CCATAATTGA TTTTAAAATG CGGTTGATAC 9960 

CCTTCCCCTA GATGCTTTAT TTCGCCAAGT AAATATTTTG TTCCTTTGCC TGTACGTGCC 10020 

ACAACTGGAA AAATAGGTGT TTTTAACTTT TTCATCAATT TATGATAATC GATTTTTATC 10080 

CCGOGCTTTG TAGCTACATC AATCATATTT AATCCGATGT ATATTGGTTG ATTAAGTTCT 10140 

AACAATTGTA CTGTTAATTG CATATTTCTT TTTAGTTGAC TCGCATCAAC AATGTTAATG 10200 

ATTCCTGAAA ATGAATCGTT TAATAAATAG TCTGTCACTA CAGTTTCATC TTTAGAAATC 10260 

GGCGATAAAT CATATGTACC TGGTAAATCA ATTAATTGTC CTACATTTTC TTTAAGTTTC 10320 

CCTACTTTTT TCTCTACCGT TACGCCACTC CAGTTGCCTA TAT ATT CATA CGAACCAGTT 10380 

AAAGCGTTAA ACAAAGATGT TTTACCAACA TTAGGATTTC CTAAAATACA ATAATTTTCC 10440 

ATTCGTCCGG CTCCTATTCT TCTAATGCAA TAGAACAAGC ATCGCAATGT CTAATACTTA 10500 

ACTGTTGTCC GTTTACTTCA ATAATACATG GCCCTTTAAA TAAACATTTT TGTTTAATCG 10560 

TTATG AT AG C GTCATCTGTT AACCCAAAGG CACTTAGACG ATACAACATA TTCTCATTAG 10620 

CAATATCCAT TCGCTTTATT TTATAAGCCT TATTCATTTC ACCATTTTTA ATGTTTAACA 10680 

TACTATTTCG CTCTCCTATT AGAAATAATA ATCATTATCA CTTAAAAATC ATAACCCTTA 10740 

AAATTGTAGC TCGCAATACT TTATTTAAAT AATTTTGATT TTTCATGTAA AATTTGTGAC 10800 

ATTGCAAAAA TGT 10813 
(2) INFORMATION FOR SEQ ID NO: 365: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 





TGATGGATTA 


GCAGACATTT 


TACGAGCGAA 


TGGTTTCAAA 


GTGTTTGGTC 


CAAATAAGCA 


60 


5 


AOCAGCTCAA ATCGAAGGCT 


CAAAATTATT 


TGCTAAAAAG 


ATAATGGAAA 


AATATAATAT 


120 




TCCAACTGCT 


GATTATAAAG 


AAGTTGAGCG 


AAAAAAGGAT 


GCTTTAACAT 


ATATTGAAAA 


ieo 


10 


CTGTGAATTG 


CCCGTTGTTG 


TCAAGAAAGA 


TGGGTTAGCT 


GCTGGGAAAG 


GCGTTATTAT 


240 


TGCAGATACT 


ATTGAAGCAG 


CCAGAAGTGC 


TATTGAGATT 


ATGTATGGTG 


ATGAAGAAGA 


300 




AGGTACTGTT 


GTATTTGAAA 


CGTTTTTAGA 


AGGTGAAGAG 


TTCTCGCTAA 


TGACATTTGT 


360 


1S 


TAATGGTGAT 


TTAGCAGTAC 


CTTTCGACTG 


TATTGCACAA 


GATCATAAAC 


GCGCATTTGA 


420 




TCATGATOAA 


GGACCAAATA 


CTGGTGGTAT 


GGGGG CTTAT 


TGTCCmgTAC 


CACATATTAG 


480 




TGACGATGTT 


TTAAmACTTA 


CAAATGAAAC 


AATTGCACAw 


CCCATTGCAA 


AGGCAATGCT 


540 


20 


TAATGAAGGT 


TATCAATTCT 


TCGGTGTATT 


ATACATTGGT 


GCTATTTTAA 


CTAAAGATGG 


600 




TCCAAAAGTA 


ATAGAATTTA 


ATGCCCGTTT 


TGGTGATCCT 


GAAGCTCAAG 


TATTATTAAG 


660 




TCGCATGGAA 


AGTGATTTAA 


TGCAGCATAT 


TATTGATTTA 


GATGAAGGAA 


AAOGTACTGA 


720 


25 


ATTCAAATGG 


AAAAATGAAT 


CTATTGTAGG 


GGTCATGTTG 


GCATCAAAAG 


GATATCCTGA 


780 




TGCATATGAA 


AAAGGGCATA 


AAGTAAGTGG 


CTTTGATTTA 


AATGAAAACT 


ATTTTGTTAG 


840 




TGGATTAAAG 


AAGCAAGGTG 


ATACCTTTGT 


TACTTcAGGT 


GGTAGAGTTA 


TACTTGC CAT 


900 


30 


CGGAAAAGGT 


GACAATGTAC 


AAGATGCACA 


GCGAGACGCA 


TACAAAAAAG 


TATCACAAAT 


960 




ACAAAGTGAC 


CATTTATTCT 


ATCGTCATGA 


CATTGCGAAT 


AAAG CACTAC 


AACTTAAATA 


1020 


35 


AGTAAATTT A 


AAATACTAAG 


aTTAGCTATG 


AACGAATCTA 


TAACGATAGA 


TTTTTTCATA 


1080 


GCTTTTTTAG 


TTGTAGAGTC 


TAGGACATTG 


ATTTCTGTAC 


CAAATTTGTG 


ATTATGCATA 


1140 




TGTAATACAA 


AAGAGGCGCC 


ACAACATGTT 


TGGATGAACA 


AAATAACATG 


TTTGTGGCAC 


1200 


40 


CTCTTTTGTT 


TAGTATGGAA 


TAAATGGTTT 


TCTTTTTCTA 


TACAATGAAT 


TTCTAATTTA 


1260 


GTATCTATAC 


AATTATGGAT 


AAAATTTAAC 


CTACACGACC 


AAGACGAACA 


TCATCTATGC 


1320 




CCX3TGATGGG 


TAAGGTGATT 


GAACAATAAT 


ATGCCATAGT 


AATAATGGCA ATTAAAACTA 


1380 


45 


TAATAAAGAT 


TATATCTTTA 


TATGAGAAAG 


GTACGTTGTA 


ATAGTAAGTA 


CGAGG AC CAT 


1440 




CTCTAAATCC 


TTTCGACTCC 


ATCGCAACTG 


ATAATTGATG 


TGCCTTTCTA 


ATATTTTGGC 


1500 




TTAATAGAGG 


TATAATTAAA 


TGCTTAAATC 


GCTTTAACCC 


TCTATAATTT 


GCCGCGTCTA 


1560 


50 


TCATCTGATA GCGCATTTTT AAAGATCTGC GAAg cTGTAt 


TAAAGAACTA ATCATTAAAG 


1620 




GTATCATACG 


AATGGCAGCC 


ATGAATGCAT 


AAGCAACTTT 


TGATTTAACC 


TTTAAATGTT 


1680 
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ATGAAATAGC 


AATGGTTCTT AATGATACAT GTAAACCACG AACTAAACTT TCTGTTGTAA 


1800 




TATGGATAAA 


TCOGAATTTC 


AAAATTGTAT 


GGCTACCATT 


CCCGTATAAA 


ATCATGAACA 


1860 


5 


GGGAAGAGAG 


TAATGCAAAG 


CCAATACTTA 


TAGTTATAAA AATTGCTGTA ATTTTAAACT 


1920 




GAGTACCATT 


AAACATCAAT AAGAAAACTA ACATTAAGAT AGTGATATAA AGCATAAAAT 


1980 


10 


CGAAATTATG 


CACAAATATA ATAAAGAAAA ATAGTATAAT TCCAAGAAAT AGTTTCGTTA 


2040 


TAATGTTGAC ATCATCAACA AATGATTGCC GAACTTTCCA TTGCTCATAC 


A l iX-GTATCA 


2100 






CTAGTAACGC 


ACCATCTGAA 


ATTTTAAGTC 


TTCTTGATGG 


ATAACGTTCA 


2160 


IS 




CGTCATGTGT 


AaCCATGACA 


ATACTTTGTC 


CCAAATTAAT 


TCGCTTTTGG 


2220 




AAAAtil I IIjA 


TCAACTGGAA 


TGTATTATGG 


CTATCAAGTC 


CAAATGTCGG 


TTCATCTAAA 


2280 




AAGATAATAT 


CAGCTTTAGA ACTTAGTGCG GTAgcTACGC TAAGGCGTCG 


TTTTTGACCA 


2340 


20 


ATAGACAACT 


CATAAGGATG 


TTGATCTTTT 


ACATTTTGTA 


AATCTAAAAG 


TTTTAAAAGT 


2400 




TGTATCGTTT 


CATCATCACT 


TTGATCTTTA 


GAAAGGTGAT 


TAAAATGAAT 


GTTAATTTCA 


2460 




TCATAAACCG 


AATTTGTTAT 


AAATTGTAAT 


TCTGGGTTTT 


GATAAACTAG 


GTACATGTGT 


2520 


25 


TTTGCTGCAT 


GTTTAATTTT 


TGTTAAACGC 


TGATTTTCAA 


AATAAACATC 


ACCTTGATAT 


2560 




1 LAA.H 


GCATAATTGA 


TTCAAGCAAG 


GTTGTTTTAC 


CACTACCATT 


TGCCCCTGTA 


2640 






ACTCACCTAG 


ACCAATTTCT 


AAATCTGAGA 


ATGAGAGCAA 


TGTTGATTTA 


2700 


30 




TACGTCCATT 


TTTAAATTGT 


AATAAGTGTG 


AGTTTGTTGT 


TGGAAAGTCA 


2760 




ACACGACTTG 


GTGCGAATTC 


CCATGCACGT 


GGATGCCACA 


CACCATATTC 


ACTGAGTAAA 


2820 


35 


TGAACATACT 


TCTGTAATAT 


GATTTCAGGA 


CATTCATCGG 


CAATGATATT 


TCCGTTATAA 


2880 


TCCATCAAAA 


TGACGCGGTC 


GACATGATTC 


CAGATGTGTT. 


TAACTTTATG TTGAACGATT 


2940 




ACAACCGTTT 


GATCTTCCCA 


AAGTTCAATT 


AGTTTAGTCC 


ATAAATCTTC 


TGTTGCTTGA 


3000 


40 


ACATCTAACA 


TTGCTGTCGG 


TTCATCTAAA 


AACAATGTTT 


TTGATTGTTG 


AAGAATGGTT 


3060 




TCAACAATTG 


CCAATTTCTG 


TTTCATCCCG 


CCACTTAAAT 


CTTTGATATA 


CGTTTCAGGG 


3120 




GTAACATTTA 


AATTGACCAT 


ATTTAAAGCA 


TTGATAATTA 


ACGCATCCAT 


GTCTTCACGT 


3180 


45 


GGTAATTGTC 


TATTTTCTAA 


AACGAATGCA 


AGTTCTTCGT 


ATACTTTTGG 


CATACAAAAC 


3240 




TGGCTATCAG 


GGTCTTGGAA 


AATAACGCCA 


CTTAATGGGT 


CAACGATTAG 


TTCATCATAT 


3300 




TTCATAGGTA 


ATTCAATTAA 


ATTAGGAACA 


ATACCACTTA 


ATACATTCAG AAGTGTACTT 


3360 


SO 


TTACCGCAAC 


CAGAAGGACC 


GAGTAAAAGT 


ACTTTTTCTT 


TGTCTTGAAT 


AGTGATATTT 


3420 




AAATGATCGA 


AAATTTTACG 


TTGACCACTT 


GGATATTTTA 


ATCGTAAATC 


ACTTACTTTT 


3480 
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10 



15 



ATTTTGTTAC GCCTGTCTTA TCTAAAGCTT TTACTAAAAG GTAAGATAGG ACGCCGGCGA 3600 

CTACTGCACC ACTAATTAAT CTAAATAOGA TGAATAATGT TAAGTTCCAA CCTGCAACTT 3660 

CATTTAAATA ACCATAGAAA TAATCTATCG GGAAAGCCGC GATTOCTGTA CAAAAACCTG 3720 

CTAACATAGC TACCATAACT GAACGTGATT GATATTTAAA AATTGCAAAG ACAAGTTCAC 3780 

ACGCTAAACC TTGTATAAAA GCGTAAACGA TTGTCGGAAT ATCGAAACGA C CCATAATG A 3840 

TAGTTTCGCC GGCACCTGCA GCAAATTCAG CCAGTAAAGC AATAC CTGGT TTTGGAATAA . 3900 

TTAGATAGCA GACAATCGCT GCCATGAACC AAAC CCCGTT TGTTAATTGT TCGAGGTGAA 3960 

GGCCTGTAGC TTGCACACCA TTGTAAACAA ACCACCATAA ATTGTAAATA ACTG CGAAT A 4020 

CTACTGAAAT AAGTACGGTT ACTAGTATTT CAGATAGCTT TAAACCTTTT GACATTTTTA 4 080 

CATCCTCCTA ATAAAAAAAC GCACAACCAT CCATAGGAAA GTTATGCGTT CACAATATAT 4140 

20 ATTAGTAAAA CATATGTATA GTAACACTTT CCTACGCTAG TTCAAGCTAG ATCAGGTTCA 4200 

AAGGGTTTGA GGGCAAGCCT CATCTCAGTA TAAAACACCC CTAGTGTGTG CGATTTATTT 4260 

AATTAATTAT ACTGTAAGAC GTTTGTAAAC TTATGTCAAT AGGTTGTCTT CATGAAATTT 4320 

25 CGTTTAATTC GATTTAAAAT TTATAATATT AGCATTGGAT TTAAATTGAA GATGTAGTAG 4 3 80 

GAATGTTAGT AATTAAAGAT ATAAAAATAT GTGACATGTA ATAATATTGA GCTGATAAAT 444 0 

GAAGAGGGAT ACTTATCAAT CATACCTCTT TAACAACAGT GAAGAACCCG TGCATAATGG 4 500 

CTTACGAATT ATAGTTTATA AGGAAGAAGA GGGATACATG CGCCGAGCAC ATGCATAAAA 4 560 

GCCCCTAACA ACTAAAAGTT GTAAGGAAGG AG AGGGATAC ATGCG CCGAG CACATGCATA 4 620 

AAATCCCCTA ACAACTAAAA GTTGTAAGGG GATTTAAATT AATTTAGTGT ATCTTGGATA 4 680 

TCTTGTTTTG kTTG a TT AAT ATCTTCTGTT TTTTCTTCTT TTTTATCTTT TAATTTTTCT 4 74 0 

TCAACTTCTT TAGCTTTTTC TGCTGCTTTT TTATTTTGAT TTTCATTAGA CATGATTAAT 4 800 

TCCTCCCAAA TTGGATAATT ATTTATATAT AAATCTTACC CGGTTGTACT TTCGTTAAAC 4 860 

TTTTCTAAGT CTATAGCACT ATTTATTCAT TTATCTAAAG ACAACAACAT TAGATTAATA 4 920 

TATAATGATT TTGAGGTGAA CATAATGTCT TTTCTTAGGA AACACGCCGA AATTATTTTT 4 980 

45 AGCTATTTAA TCGGTmwCGT TcACTCTTCA CTGGkcTCAT TATTTTAATT AACTTGCCAT 504 0 

TAATTAAACA ATTAAATGGT GGTAAAAAAG TTGATACACA TGTTCATAAT GTGTGGGAAT 5100 

TTCTGAATGC ATTTTTCAGT GAAATTATTA AAGTAATGAG TCGATTTATA GGTAATTTCC 5160 

50 CtATAGTTAG TGCAATTGTG ATAATTATAT TCGGTATTTT AGTTATGTTG ATTGGTCATA 5220 

CATTACTTAG AACTATTAAG TATGACTATG ATATTTCTAT CTTTTTCTTA GTTATCGGTA 5280 
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TTTTCATTAT TCCATTTACA ATTCATATAG GATATATCGT CTATAAAGAT GAATTGAATC 5400 

AGGAAAATGT AAAAAATCAT TTCATGTGGA TAATTGTGAG TTATGGTATA AGTTACTTAA 54 60 

5 TTACACAAAT TGCATTGTAT GGCAOAATTG ATGCTAATGA AATAGAGTCA ATTGATATCT 5520 

TAAGTGTCAA TGCTTTCTTT ATAATTATGT GGTTACTTGG TCAAATGGCT ATTTGGAATT 5580 

TCTTGTTCTT GCGCCGAGCT TTACCTTTAA CAAAGCAAGA ATTAGGTGAA GAGGAGCCAG 564 0 

10 

AATTATCAAG AACAAGTAAA GGGAATGTCA CGAATCAAAC TAAAATTCAC TTGAAACAAC 5700 

TCCAAGATAA GACTACAGAA TATGCACGTA AGACAAGAAG AAGTGTCGAT TTAGATAAAA 5760 

TTAGAGCTAA AAGAGATAAA TTCAAAAAGA AAGTTAATGA TATTATCGAT ATTCAAGAAG 5820 

75 

ACGATATTCC TGATTGGATG AGAAAACCGA AATGGGTTAA ACCAATGTAT GTCGAACTAT 5880 

TTTGTGGTGT CGTCATCTTT TTATTCACAT TTTTAGAATT TAATAATCGT AATGCATTAT 5940 

20 TTGTATCTGG TGATTGGAAA TTATCACAGA CACAATATGT TATTGAATGG GTTACATTAT 6000 

TAATTCTGTT ATTCATTATT ATCGCATATA TCGCTACAAC GTTAACTTTC CACTTGAAAG 6060 

GTAAGTTTTA TTATTTACAA TTATTTATGG GGAGCATTTT ATTCTTTAAA TTGTTAACGG 6120 

25 AATTTATAAA TATAATGATT CATGGACTAT TACTTTCAGT GTTCATTACG CCAACATTAC 6180 

TATTAATGTT ATTGGCAATC ATCATTTCTT ATTCGTTACA ATTACGAGAG CGACCATAAT 6240 

TAAAAGCATT ATAAAAGTAC TATCTATTAA ACATTTTGAT GTGTACGCTA TAAGTTAGAT 6300 

qq 

ATATCTCTAA CTTACTTAGA TACAGGTCAA TGAAGTTTAT GGATAGTACT TTTTTTGTAA 6360 

CTAGATTTGA TTGATTCAGG TGATGTGAAT TAAGTATTGA rAATTGTATA CAAAGTTTTA 6420 

GTGCAAATAA AATAGTTGAA AAGTTATCCA TTTGTAAAAT CAAGAAAACT AG T AAATAGT 64 80 

35 

TGAAGCGACT TATGGaATTT GCGAAACGAT ATATAGTATT TCCTTTGTAG AAATTTmACA 6540 

TATATCATTC AAATTACTAA TTTGTTAAAA TCAACAGTAA GATTAGAAGT AGATGATATT 6600 

GAAATTTGGC AAACAaTtTA ATCTATATAA AACTACAACG AAACACAGAA AGGAAGTTGT 6660 

40 

CAGATGAAAA TAGCAACTCT GAACAAAGGC AAAGAAACAA AATATTTTAA TGGATATCCT 6720 

TTAATTGAAG AAGAGGATAT CTATTCACAA GATCATTTAA AAGAAGGAGA TATTTTTCAA 6 780 

4S ATTGTGACTG ATAAATCACA ATAT 6804 

(2) INFORMATION FOR SEQ ID NO: 366: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 
so (B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 





aaaAGAGACG 


CATTaAAACA AGCTATACAA ATTATCGATA AATTAACATG GGGTGTTTAG 


60 


5 


TGGTGGTTAA 


AGAAATTTTG 


AGACTATTAT 


TCTTACf AOC GATGTATGAG 


CTAGGTAAGT 


120 




ATGTAACTGA 


GCAAGTATAT 


ATTATOATGA 


CGGCTAATOA TGATGTAGAG 


GCGCCAAGTG 


180 


10 


ACTTTGAAAA AATCAGAGCT GAAGTTTCAT GGTAATAGCT ATTATCATTT TTGAATTAAT 


240 


TATATTAATG 


TGTGTAGCAA 


TAGCACTGGA 


GGTGTTGTAA 


ATATGTGGAT 


TGTCATTTCA 


300 




ATTGTTTTAT 


CTATATnTTT 


ATTGATCTTG 


TTAAGTAGCA 


TTTCTCATAA 


GATGAAAACC 


360 


*e 

it> 


ATAGAAGCAT 


TGGAGTATAT 


GAATGCTTAT 


CTTTTCAAGC 


AGTTAGTAAA 


AAATAATGGT 


420 




GTTGAAGGTT 


TAGAAGATTA 


TGAAAATGAA 


GTTGAACGAA 


TTAGAAAAAG 


ATTCAAAAGC 


4B0 




TAAAGAGAGG 


CGTTGGCTTC 


TCTGCTCTAT 


CyAAAATAAT GAAAGGAGCC 


saACATGTTA 


540 


20 


GaCmAAGtCA 


CTCAAATAGA 


AACAATTAAA 


TATGATCGTG 


ATGTCTCATA 


TTCTTATGCT 


600 




GCTAGTCGTT 


TATCTACACA 


TTGGACTAAT 


CACAATATGG 


CTTGGTCTGA 


CTTTATGCAG 


660 




AAGCTAGCAC 


AAACAGTTAG 


AACTAAAGAA 


GATTTAACTG 


AGTACAATAA 


AATGTCTAAG 


720 


25 


TCTGAACAAG 


CCGATATAAA 


AGATGTTGGC 


GGATTTGTCG 


GTGGATATTT 


AAAAGAAGGC 


780 




AAACGGCGTG 


CTGGTCAAGT 


CATGAATCGT 


TCAATGCTAA 


CACTTGATAT 


CGATTATGCA 


840 




GCCCAAGATA 


TGACTGACAT 


ATTATCTATG 


TTTTATGATT 


TTGCATATTG 


TTTATATTCA 


900 


30 


ACACATAAGC 


ATAGAGAGAT 


AAGTCCAAGA 


CTGCGTTTAG 


TGATTCCTTT 


AAAACGAAAT 


960 




GTAAATGCAG 


ATGAGTATGA 


AGCTATTGGG 


CGTAAAGTCG 


CAGATATCGT 


TGGCATGGAT 


1020 


35 


TACTTCGATG 


ATACAACTTA 


TCAACCACAT 


AGGTTAATGT 


ATTGGCCTTC 


AACTAGTAAC 


1080 


GATGCGGAAT 


TTTTCTTTAC 


CTATGAAGAT 


TTACCTTTGT 


TAGACCCAGA 


TAAAATATTA 


1140 




AATGAATATG 


TTGATTGGAC 


TGACACATTA 


GAATGGCCAA 


CGTCTTCAAG 


GGAAGAGAGT 


1200 


40 


AAGACTAAAA 


GATTAGCAGA 


TAAGCAAGGC 


GACCCAGAAG 


AAAAGCCGGG 


AATTGTTGGT 


1260 




GCATTTTGTA 


GAGC CTATAC 


GATAGAAGAA 


GCTATAGAAA 


CTTTTATTCC 


TGATTTATAC 


1320 




GAAAAACATT 


CTACTAACCG 


TTATACCTAT 


CATGAAGGTT 


CAACTGCAGG 


TGGATTGGTG 


1380 


45 


TTATACGAAA 


ATAACAAGTT 


TGCCTATTCT 


CATCATAATA 


CGGATCCCGT 


AAGCGGTATG 


1440 




CTTGTGAACA 


GTTTTGATTT 


AGTACGCATA 


CACTTATATG 


GTGCTCAAGA 


TGAAGAAACT 


1500 




AAAACAGATA 


CTCCGGTTAA 


TCGACTACCT 


AGTTATAAAG 


CAATGCAGCa 


AAGAGCGCAA 


1S60 


60 


AATGATGAGG 


TTGTTAAAAA 


GCAATTAATT 


AATGACAAAA 


TGTCTGATGC 


AATGCAGGaT 


1620 




TTCGATGAAn 


GAGAAAATAG 


CGATGATGCA 


TGGTCTGAGA 


CGTTnGAAAT 


TACTTCGAAA 


1680 
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(2) INFORMATION FOR SEQ ID NO: 367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 
<D) TOPOLOGY: linear 



10 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 





AAGATTAGGC 


ATCAAGACAG 


GGTCGCGATT 


GTTTGAAATC 


CCACATAGnA 


ATGATATTTa 


60 


15 


CaTTATCAAT 


CCaAGTATGC 


GTAAATATCT 


TAATGTTTCA 


GTTGCTATTT 


CTAAGATTGC 


120 




ATTGCGTTAT 


ATTCCACCTG 


AAGATTTACA 


CCAATATAGT 


ATTGACGAAT 


TTTTTATGGA 


180 




TGTTACTGAT 


AGCTATCATA 


GATTTAGTTC 


TACAGTACAT 


GCATTTTGCG 


AAAGACTTAA 


240 


20 


ACGTGAAATT 


TATGAAGAAA 


CAGGCATTTA 


TTGTACTGTG 


GGCATTGGTT 


CTAATATGTT 


300 




ATTAAGTAAA 


ATTGCTATGG 


ATGTTGAAGC 


GAacATAGTc 


AAAATGGTAT 


AGCTGAATGG 


360 




CGATATCaAG 


ATGTACCAAC 


GAAATTATGG 


CCAATTCmGC 


CCt TGCGAGA 


TTTTTGGGGT 


420 


25 


ATTAATCGTC 


GAACAGAAGC 


CAAATTGAAT 


AAAAGAGGAA 


TTTTTACTAT 


AGGAGATTTA 


480 




GCGAAATATC 


CATATAAATT 


TTTAAAAAAA 


GAGTTCGGTA 


TTTTAGGTGT 


TGATATGCAT 


540 




CTACATGCGA 


ATGGGATAGA 


TCAGAGTAAA 


GTACGTGAAA 


AGCACAAGAT 


CAGCAATCCA 


600 


30 


TCGATATGCA 


AAAGTCAAAT 


ATTAATGAGA 


GATTATCATT 


TTGATGAAGC 


AAAAGTAGTA 


660 




ATGCAAGAGT 


TAATTGAAGA 


TGTTGCTAGC 


AGAGTTCGAG 


CAAGAAAAAA 


AGTGGCAAGA 


720 


35 


ACGATACATT 


TTGCCTTTGG 


CTATAGTGAT 


GAAGGCGGTG 


TACATAAGCA 


ATATACTTTG 


780 


AAAGATCCAA 


CAAACTTAGA 


AAAAGATATT 


TATAAAGTAG 


TAATGCATTT 


CGCAGATAAA 


840 




TTATGTAATA 


AACAAGCACT 


ATATCGTACG 


CTAAGTATAT 


CTTTGAGTCA 


ATTTATTAAT 


900 


40 


GAGGATGAGC 


GACgTTAAGT 


CTGTTTGAAG 


ATGAATACCA 


ACGCAAACGT 


GACGAATGTC 


960 


TAGCTAAAAC 


GATAGACCAA 


TTACATTTGA 


AATACGGCAA 


AGGTATTGTG 


TCCAAAGCAG 


1020 




TATCGTTTAC 


AGAAGCAGGT 


ACAAAACACG 


GCAGATTAGG 


TTTAATGGCT 


GGACATAAAA 


1080 


45 


TGTAATGACT 


ATACGGTTTA 


AGTAATATAT 


AACTGTGATT 


CGTATAAAAT 


AAGTCTCTAA 


1140 




AGATAAATAT 


TTCATATATC 


ACAATAGATT 


TTCACAATAA 


TATCTAAGAA 


TACATGGAAT 


1200 




TTATCAAAAG 


AGACTTAATA 


ATTATTGGAT 


ATAACAATCA 


AAATCACTCA 


ATGCTTGCAT 


1260 


50 


ACCGCGTTCT 


CGGTCAGTAG 


GGTTTTTGAA 


ACTAATTTTT 


AAAGCACCGT 


ATATATCTTC 


1320 




GCGTACTTCT 


AAGATTCTTA 


AGTTGCTTAT 


AGATATGTTA 


TGTAAACTCA 


GGATATAAGT 


1380 
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TAOTCCACCT AGTTGTTTAO CGGGTAGTGC GTCGCGATAC GATTTAGCTT GGGCAAAAAA 1500 

TGATAACAAT TTTTCAGAAT CATTGCTTTC AATTAGTCTt TCTAAATCTT GAAACTGACT 1560 

TTTTAGCTGT CGAATCATTT CTAAAATATA CGTTTTATTA CTCAAGGTGA TATCTTTCCA 1620 

CATTTGTGCA TTACTACTAG CTATACGAGT GATATCACGA AAACCACCAG CTGCAAGTTT 1680 

ATTAACTAAA TGATGTTCTT GACCGTTCTT TTGACTAACA TGAACTAAAC T AG ATG CAAC 174 0 

GATATGAGGT AAATGACTTA CGACGCTTGT TACGTAGTCG TGTTCTTCAG CAGTAGTTAC 1800 

AATAAATTTA GCAAGAGTAG GTGATAACAG TTCTTTTAAC GTGTTTG 1847 
1S (2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 494 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

20 (D) TOPOLOGY : linear 



25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 

AACAAAAGGC ATAAGTTCGT GAATTAATGC GTATACAAGG ATAAAGCTTA TAACAGTAGT 60 

AATTGTTGCT ATCAAACGAA CAACATATAT TCTATTTTCA GATAGCAATC GATTCACTAA 120 

TCGATAATTT ACGTATACAA GAATTAACAA CAGCACAATA TAAACAATAA TCATATTTGG 180 

CCCCATTTAA TTTTTAAATT TGTTTTCACA TCATTTTCTA CTTTATTATC ATACTAATTT 24 0 

TAAAGGCAAA GGTGGACATC GGCAACCTCT CGTAAACTAT TTATCAAAAA TAAACGTATC 300 

TCATTGTTAT GATATTTATA AATCAATTCG TTTTTATTAA AGTCTTTTTC AACAAGTTTG 3 60 

TCACTATCTA TTAAATAATC ACGCATGCAT CCTTTTAAGA AATCATCTTT ATAACTTGGT 4 20 

GTGTACCATT TTCCATCCTC TTCAATGACA ATGTTGCCAA TATCAAATTC AAGGACCTTG 4 80 

40 CCGTCCTCTG AAGT 4 94 

(2) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2518 base pairs 
45 (B) TYPE: nucleic acid 

<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



35 



60 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 
GTC CAT ATTT CCCGTCCAAC CAACTAAAAA TGGGTAAATC CCTGGATTTa AATCTACTTC 60 
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10 



is 



20 



GATAATTTCn AATGTAGCGA CCATCATTAC GAGATGATTT GATATAAGCA CAGTTTGGAT 180 

GTTGACCAAT ACTATCGCCT TCTTCTTCGA TGATATCTAT TTTAATACCA TCATCAGCTG 240 

CAATTTCTAA TGAAOATTTA ATTCGGTTAT CAAATGTTGA ATATCCCATT GCTCCACCCA 300 

CAATAGCGAC ATCTGTACCA TGTCCTTGGT GTGTTTGAGC AAATGATTCA TAATAATGTA 360 

TTTCAATATT TTTAGGAATA TCTCCCAATA TTGCGCGTGC TGAATTCCCA ATCTTTACTG 420 

CACCAGCCGT ATGAGAACTT GAAGGGCCCA TCATAACTGG TCCGATAATA TCGAAAGCAC 480 

TTTGATAATC ATAGCTCTTT GCCATAATTA AACACTCTCC TTAATATGAT TCTTTTTGCT 540 

CGGCATTTTA AAGTTGATAT TCATTAAATT AAACTTATTA ATTAGTGTTC CAATAATATA 600 

GGCTAAGACA ATGCTGACAA TAATCACTGT TGCGATTGTT ACAATTGATG TCACTGCATT 660 

ATTAAAGCCA AACAATACGA TGGCGCCTGC AATTGGTGTT GCCATACCTT TGACACCTAT 720 

TACTAGTCCG CTAAATGTCA CGATACATGC GTTGACGACC CCAATCAOTG CATTTGTACC 780 

ATATAGTTGT ACTGGATATT GCGCTATTAA ATCAATTTGC GTCAATGGCT CAATACAAAC 840 

TGCAAATGCT TTTGACGGTC CACCAATGTT TAATTTTCGG AATAAAATAA GGTTAACAAA 900 

25 TGAGCTACCT GTACATGTTA GTGCTCCAAT AGCCATAGGA ACACCTGTCA GTCCTAATAA 960 

ACTTGTTAAT ACCATTGAAC TTAGGGGTGT CATACCTGTA ACAGGAATCA CTAGTCCTAA 1020 

AATGACCGCT AATGCATATG GATTGTTATC ACCTACCGCA GTGACAGCAC TACCTATTTG 1080 

TTTTAATGTT GCTAGCACAC CAGGTGTAAT GATTGATGCA AGTCCGAAAG CAATTGCTGG 1140 

TGCAAATAAG ATCACCACAA TTAAGTCCAA GCCTTCTGGA ACTTTCTTTT CAATCCATTT 1200 

AATTAAAAAA GcTACGCCAT AAGCTGCGAT GAATGCTGGT AATAATTTAA AGTCATGTAA 1260 

TACTAAACCA ACAATGACCG CAAATACTGG TGCAACGCCT AAGTTTAAGC ACGTTAGAAT 1320 

ACCTACTGcG ATACOGCTTA AACTTC CTGC TAAATCCCCA ATATCTTGTA GAAATTTAAT 13 80 

ATCAAATACG CCACCAATAG CATAACTTAA GAATGCTTGT GGTAGAAATG TCGCACAAGC 1440 

40 

TGCACCGGAT AATGCTTGTA GTCCTTGTTT ACCGTAOGGT GCATACTTTA AAAATAGCGT 1500 

CATGATCACT AAAACCAAGA CTAATGTGCC TACACCTAAC AGAATATCCA TTTCCCAAAA 1560 

4S ACCTCTCTCT ATGTTTATTT TATTTTCAGA C CAT AAACAT CGTACACCCT TAAGAAAACG 1620 

TTTTCAACTT TTATCTGTTA TCAAATCaAA TATTTAAGTG AAATATTTCA TATTTGTGAA 1680 

AGATTTTTAA AATGGATTGT TTCAAAAACT ACTTATGTTG TCGTTAATAT TTACTAATTA 1740 

60 ACTTTTTACT CTATATTTCA AACAGTTGTG TGACAGTTTT TTGATAACTT TTTTACATCT 1800 

GAAAGTAAGT AATATTTCTA AAAACTTTTA ATATTTATAC ACTTTATCTT TCGAGCTATT 1860 



55 



30 



3S 
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5 



ACACATATAT 


TTGCAATAAG 


ATAATTAAAO 


TAGGATATTA 


TTTTTAGTTT 


TCTGATAGGA 


1980 


ATGATGATAG 


TCTATAGGTT 


GAATCTTTAC 


TTTTTTTTAA 


AGCTAAATTT 


ACATCAACTT 


2040 


AACAATGGTT 


GGTTATACTG 


AAGATGAAAA 


TATTTAOTAT 


AACTTAGTGG 


AGGCGATAAA 


2100 


GGTGCAATTG 


AGTCATTCCG 


TTAAAGTTGC 


AATTTCTATC 


TATTTAGCAC 


TTATCTTTAT 


2160 


AACGTTCACT 


TCTTATTTAG 


TCATTATTTT 


ATATACGAGT 


ATGACTGGAC 


ATGATGTATC 


2220 


ACATTTCGTG 


TTAGATAGTC 


AGCATTCTCA 


TCATGGATCT 


CTTACGCAAA 


AACATTTGAG 


2280 


TCTTCCTGAA 


ATCTCATTTA 


AATAGTTTAT 


CCTCTGTGTT 


TCAACATTCA 


TTTCCCATAT 


2340 


CGATTCATTT 


ATCTATCATC 


TAGACCACTA 


CATCTTAGAT 


GATTTTTTTA 


TTTTCTCATT 


2400 


TCACTCTTTC 


TTAAAAGTCG 


ATATAATGAA 


TTAAATCATT 


ATCATACACC 


GACATATTTT 


2460 


ATGTTGTTGG 


TGTTAAGTTT 


aAAGGGGTGA 


GATACTTGGC 


GAATaATCaT 


TCAGCTTT 


2518 



20 (2> INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 <D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 

30 ATACT ACTGG ACCGTCnACC AATTGACAAA ACCTGAAAAC TGGATTTTTA TTGCAGGATT 60 

ACATCGAACC TGGAAGAACA ATCCCCAGGA GTTGATGATA TTCGAGATAT GGCATACAAT 120 

CAAGGTAGTT TAGATAAGAC AATTTATGAA ATTTCTAAAC GCACAGTACT ATTTTTAATA 180 

35 CAGAAAGATA TTACGGTATA TAATAAGACG ATTGACTGTT TAAATTATTA TAACTATAGT 240 

GACGAAAGAA TAAAGGATGA TTAAATGAAT TCACAAGAAT TATTAGCAAT TGCTGTGGAT 3 00 

GCAATTGACA ATACCCCAGG CGAAGATACG ATTTCTTTAG AAATGAAAGG TATCAGCGAT 360 

40 

ATGACAGATT ATTTTGTTGT AACGCACGGA AATAATGAAC GACAAGTTCA AGCGATTGCT 420 

AGAGCGGTGA AAGAAGTAGC CAATGAACAA AATATAGAAG TAAAACGTAT GGAAGGATAC 480 

AATGAAGCGC GTTGGATATT AATTGACTTA GCTGATGTTG TGGTACATGT TTTCCATAAA 54 0 

45 

GACGAAAGAA ATTATTATAA TATTGAAAAG TTATATCAAG ATGCACCATT AGAATCATAT 600 

AGTCAGGTTG CGTATTAATT ATGTCGCAAT ATGCAGAAAT GAGCCTAGTG TACGATCAAT 660 

50 TGACTCAAGA T CAAC CAT AT GAAAAATGGT TTGAAATTGT AAAAAATCAC TGCAAAGATG 720 

AATCAAATAT TTTAGATATT GGATGCGGTA CTGGTAGTTa ACAGTTCAAT TAGAAGCTTT 780 
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(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

ATAGATGAAG GTGCAAATAT TGAAaTAGGT TATTTACCTG GACGCTTGAA ATGGTTAGTT 60 

GCTGATTTAT TAACTAAACA AGGATTAAAA GTAGTTAACG ACGATATGAC AGGAAGAACG 120 

15 

TTAAAAGATC GTAAATTATT AACAGGTGAC AGTCCTTTAG CTTCAAATGA GTTAGGAAAA 180 

TTAGCAGTTA ATGAAATGTT AAATGCAATA CAAAATAAAT AATTAAATAT TAATTAGAGG 240 

20 AGCCTCATAT GTAAATGTAT GAGGGCTCTT TTTTTTGGCA AAATTTAAGT GATACTTGTA 300 

AAATAGAACC TATTATGAGT ATGATTTAAG AAAACGCTTG CAAAACTAAT AACCGCAACT 360 

AGCGATATGG AGGAAACATG ATGTCTTATA GCATTGGAAT TGATTATGGA ACTGCTTCAG 420 

25 GCCGTGTGTT TTTAATTAAT ACAACTAACG GTCAAGTAGT ATCAAAATTT GTGAAACCAT 4 80 

ATACACATGG TGTCATTGAG AGTGAATTAA ATGGTTTGAA AAT AC CACAT ACATATGCAC 540 

TTCAAAATAG TAATGATTAT t T AGAAATT A TGGAAGAAGG AATATCATAT ATAGTACGTG 600 

AATCAAAAAT AG ATC CAGAC AATATAGTAG GTATTGGTAT AGACTTTACT TCATCTACTA 660 

TTATTTTTAC TGACGAAAAC CTTAACCCGG TACATAACTT AAAACAATTT AAAAACAATC 720 

CACATGCGTA TGTGAAACTT TGGAAACATC ATGGTGCATA TAAAGAGGCA GAGAAATTAT 780 

ATCAAACTGC TATTGAAAAT AATAATAAGT GGTTAGGCCA TTATGGATAT AATGTTAGTA 840 

GTGAATGGAT GATTCCCAAA ATAATGGAGG TCATGAATCG AGCACCAGAA ATTATGGAAA 900 

AAACGGCTTA TATTATGGAA GCGGGCGATT GGATTGTAAA TAAATTAACT AATAAAAATG 960 

TACGCTCGAA TTGTGGATTA GGTTTCAAAG CATTTTGGGA AGAAGAAACA GGGTTTCATT 1020 

ATGATTTATT TGATAAAATA GACCCCAAAT TATCAAAAGT AATTCAAGAT AAAGTATCTG 1080 

4S CACCGGTTGT TAATATTGGT GAAGCAGTAG GGAAACTGGA TGATAAAATG GCACAGAAAT 1140 

TAGGATTATC AAAAGAAACT ATGGTAAGTC CTTTTATTAT TGATGCCCAT GCTAGTTTAT 1200 

TAGGTATTGG GTCTGAAAAA GATAAAGAAA TGACTATGGT GATGGGAACA AGCACATGCC 1260 

SO ATCTTATGTT AAATGAAAAG CAACATCAAG TGCCAGGTAT ATCAGGTTCT GTAAAAGGAG 1320 

CAATTATTCC AGAATTATTT GCTTATGAAG CGGGGCAATC AGCAGTAGGT GATTTGTTTG 13 80 
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CTGTATTTGA ATTAATGAAT GAAAAGATAA AACATCAAAT GCCAGOTOAA AGTGGGCTCA 15 00 

TTGCTCTTOA TTGG CAT AAT GGAAATCGAA GTGTATTAAG TGATAGCAAT TTAACAGGTT 1560 
5 GTATCTTTGG ATTAACTTTA CAAACTAAGC ATGAGGaTAT TTATAGAGCm TATTTaGaAG 1620 

CTACaGCATT TGGTACtAAG ATGATTATGC mACAGTATCA AGATTGGCAT ATGGaAGTAG 1680 
aAAAGGtATT TGCaTGTGGc gGTAtACCtA AAAAGAATGC TGTTATGATG GATATCTATG 1740 

10 

CGAATGTACT GAATAAAAAA CTAATTGTTA TGGATAGTGA GTATGCACCA GCAATAGGCG 1800 
CAGCAATATT AGGTGCAGTC AGT 1823 
(2) INFORMATION FOR SEQ ID NO: 372: 

15 

(i) SEQUENCE CHARACTERISTICS r 

(A) LENGTH : 1600 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 



25 


ACGATCATCT 


GCATCAGCGA 


ATTCCGATGc 


AaTTcATATG 


tCCtAATAAC 


AAAAGAATAG 


60 




GTTTTaAAAG 


ATATGCATAC 


CGTAATGaTA 


GATATAGTTT 


TAAACGTGAC 


TTCAAGCTAT 


120 




ATGaATGTGA 


TGACTGTTCA 


TCATGTTCTT 


TGAGACATCA 


ATGCATGAAG 


CCAAATTCGA 


160 


30 


AATCCAATAA 


GAAAATTATG 


AAGAATTATA 


ATTGGGAATA 


CTTTAAAGCC 


CAAATTAATC 


240 




AAAAGCTTTC 


TGAACCAGAA 


ACGAAAAAAT 


CTATAGTCAA 


AGAAAAATTG 


ATGTAGAGCC 


300 




TGTTTTTGGA 


TTTATGAAGG 


CTATTTTGGG 


TtTCACTCGA 


ATGTCAGTTC 


GGGAATAAAT 


360 


35 


AAAGTTAAAC 


GAGAGCTAGG 


TTTTGTATTA 


ATGGCACTTA 


ATATAAGGAA AATAgcaGCT 


420 




CAACGAGCTG 


TACATTATAA 


AATACATATC 


AAAAAAGCTG 


ATTTCTATCA 


AATAAATAAT 


480 


40 


AGAAATCAGC 


TTTTTTACAT 


TGCCTAAGAA 


CTTTAAGGAA 


CTTAATGTCC 


CAAGCTCTTT 


540 


TTTGTTATAT 


CTAATTCGTA 


ATTTATGATT 


GTTTATTCGG 


TCCTTTGATG 


TTCACTAAAT 


600 




GTGACTTTAA 


ATCTTGTTCT 


AATTGTTGTA 


ATTCTTTTTC 


AG CT AATTGT 


CGTTCTTCGC 


660 


45 


GACCGTGTTG 


TTGAATAATT 


AATGTTTCTT 


CAATTGTCTC 


AATAATGTTA 


CGCTGTGTAC 


720 




GTTTCAATGT 


ATCAAGATCA 


ACAATGCCAC 


GCTCATTTTC 


TGTTGCAGTT 


TCAATCGCAT 


780 




TTTGTTTCAA 


CA1TTCAGCA 


TTTGCTGTTA 


ATAAATCATT 


AGTTGTATCA 


GTGACAGCTC 


840 


SO 


GTTGTGCAGC 


AACTGCATTA 


CGCTGTCTCA 


TTAATGTAAG 


CGCAATGGCC 


ATTTGATTTT 


900 




TCCATAGTGG 


AATACTTGTC 


AAAATTGAAC 


TTTGTATCTT 


CTCGGCAAGT 


GCTTGATTAA 


960 
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AGTCATATAT GCGTTTATCT AGTCTATCTA TAAATTGCTG CATATCTGCA ACTTGTTGAA 1080 

TATCCATTTG ATTAGTGGAt TgtGCGCTTG CTGTTGCAAT TGTGGTAGCT TTTCATTTTC 1140 

S 

TAATTGCAAC TTTTTTTGcT QTGCAGCAAT GATATGCAAT GATAAGTCAT CAAAGTATTG 1200 

TTTGTTTTTA TCATATAGCG TATCTAATAA TTCAATATCT CTTGTTAAAT GTGTTTGATG 1260 

TTTCTGCAGT TGAATCGTTA TGCGATCGAC TTGAGCACTA ACTGATTGCA TTCTTGAAAA 1320 

10 

GATTTCATTG ATAGACGACT TTGCTCTGCT AAAAATTCTT TTTAACATAG ATGGTTTATC 1380 

AGTATTTAAC TCATTTGGAT TAACTGACTT TAGTTTTGAC ATTAGATCTG ACAAAGTATC 1440 

is TCCAATAGGA CCAACATCTT TACTTTGTAC TTCATCCAAC ATTTGATGTG AAAATTGAGA 1500 

CATTTGTTTC TGkAAATCAG AACCAAACGC TAATAAACCT TCATTGTCTA AAGGGTTAAT 1560 

TTGTTTACTG ATTGTGTCTA CCTGTTTTTG TTGTTCAATT 1600 

20 (2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
& (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 



35 



40 



45 



50 



ATGTTGATAA 


TGGTAGCTTC 


TACAAGAATA 


AAGACCAACA 


AGTTGGTGCA 


ACAATTCTTG 


60 


aTAGTAAAAC 


TGGTGGTTTA 


GTTGCTATAT 


CTGGTGGACG 


TGATTTCAAA 


GACGTCGTTA 


120 


ACAGAAACCA 


AGCAACAGAT 


CCTCACCCTA 


CTGGTTCATC 


TTTAAAACCT 


TTCTTAGCGT 


180 


ATGGACCTGC 


CATTGAAAAT 


ATGAAATGGG 


CAACAAACCA 


TGCGATTCAA 


GATGAATCTT 


24 0 


CATATCAAGT 


TGATGGTTCT 


ACATTTAGAA 


ACTATGATAC 


GAAGAGTCAC 


GGTACTGTAT 


300 


CTATTTATGA 


TGCTTTACGA 


CAAAGTTTCA 


ATATCCCAGC 


TTTAAAAGCT 


TGGCAATCAG 


360 


TTAAGCAAAA 


TGCTGGTAAT 


GATGCACCTA 


AGAAATTCGC 


TGCCAAACTT 


GGCTTAAACT 


420 


ACGAAGGCGA 


TATTGGTCCA 


TCTGAAGTAC 


TTGGTGGTTC 


TGCTTCAGAA 


TTCTCACCAA 


480 


CACAATTAGC 


ATCAGCATTT 


GCTGCAATCG 


CTAACGGTGG 


TACTTATAAC 


AACGCGCATT 


540 


CAATTCAAAA 


AGTAGTTACT 


CGTGATGGTG 


AAACAATCGA 


ATACGATCAT 


ACTAGCCATA 


600 


AAGCGATGAG 


TGATTACACT 


GCATACATGT 


TAGCTGAGAT 


GCTAAAAGGT 


ACATTTAAAC 


660 


CATATGGTTC 


TGCATATGGC 


CATGGTGTAT 


CTGGAGTAAA 


TATGGGTGCT 


AAGACAGGTA 


720 


CTGGTACTTA 


CGGTGCTGAA 


ACTTATTCAC 


AATATAATTT 


ACCTGATAAT 


GCAGCGAAAG 


780 
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AAGTTAAACA ATATGGTGAA AACTCATTTG TrGGACATAG CCAACAAGAA 


TATCCACAGT 


900 




TCTTATATGA AAATGTGATG TCAAAAATTT CATCTAGAGA TGGCGAAGAC 


TTTAAACGTC 


960 


5 


CTAGCTCAGT AAGTGGTAGT ATCCCATCAA TCAATGTTTC TGGTAGTCAA 


GATAACAACA 


1020 




CTACAAATCG TAGTACACAC GGTGGTAGTG ACACATCAGC AAACAGCAGT 


GGTACTGCAC 


1080 




AATCAAATAA CAATACTAGA TCTCAACAAT CTAGAAACAG CGGTGGATTA 


ACAGGTATAT 


1140 


10 


TCAACTAATC CACTCAACAT AAAATCCTCA GTT AT AC CAT ATTTATGGTG 
TTTTnTTAGG TTCTTCATCT TTTATGG 


TAGCCGAGGA 


1200 
1227 


15 

20 


<2) INFORMATION FOR SEQ ID NO: 374: 






(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 






25 


CCATATGGtG CAACATTCTT CGTATTTAGT GATTATTTAA AACCAGCGTT 


ACGTTTATCA 


60 




TCAATTATGG GATTAAATgC aACGTT CATC TTCACACATG AyTCaATTGC 


AGTAGGTGAA 


120 




GATGGTCCTA CTCATGAACC AATTGAGCAA TTAGCTGGAT T AAGAG C CAT 


TCCAAATATG 


160 


30 


AATGTTATCC GTC CTGCTG A TGGTAATGAA ACAAGAGTAG CATGGGAAGT 


TGCCTTAGAA 


240 




TCTGAATCTA CACCTACTTC ATTAGTATTG ACACGTCAAA ACTTACCGGT 


ATTAGATGTA 


300 




CCAGAAGATG TAGTTGAAGA AGGCGTTCGA AAAGGTGCCT ATACAGTTTA 


TGGCTCTGAA 


360 


35 


GAGACACCAG AATTCCTATT ATTAGCTTCA GGTTCAGAAG TTAGTCTTGC 


AGTTGAAGCT 


420 




GCTAAAGATC TTGAAAAACA AGGTAAATCA GTACGTGTTG TTTCAATGCC 


TAACTGGAAT 


480 


40 


GCATTTGAAC AACAATCTGA AGAATATAAA GAATCAGTTA TTCCATCAAG 


CGTAACAAAA 


540 


CGTGTTGCGA TTGAAATGGC TTCACCGCTT GGATGGCATA AATATGTAGG 


TACTGCAGGT 


600 




AAAGTTATTG CTATTGACGG CTTTGGCGCA AGTGCACCTG GCGATTTAGT 


AGTTGAAAAA 


660 


45 


TATGGATTTA CAAAAGAAAA TATCTTAAAC CAAGTTATGA GCTTATAAGA 


ATAATTTATA 


720 




AAGCGAGTAT GTTTAGAAGT CTAGGATGCA TAATCTTAGG CTTCTTTTAA AGTGTTGAAA 


780 




TTTAGAGTAT AGCACTTAAA CTACATCATA AGTGATAAGT TATGAAAGTA 


TACTATTTCA 


840 


50 


GATTAATCTT TAAAAGCTCT GTTATAACAG CATGATTTTT GATATTATTT 


TTAGTATCGA 


900 




TATTAAAATA CTTGAATAAA CTAGTTCTTG AAATAATGTG ATGAATTTAG 


TAAAATTCAG 


960 



55 
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10 



is 



TGCATTAATT TTAGGTTTAA TTGGAGGTTT CCTTTTAGCT AOAAAATATA TGATGGACTA 1080 

CTTGAAGAAA AACCCACCAA TCAACGAAGA AATGCTTCGT ATGATGATGA TGCAAATGGG 1140 

TCAAAAACCT TCTCAGAAGA AAATTAATCA AATGATOACO ATOATGAATA AAAATATGGA 1200 

TCAAAATATG AAGAGTGCGA AAAAGTAAAT TCGCAATTGA TAGAGGCTAT TTTCCAGATA 1260 

TGGAAATGGC CTCTTTTTAT AATCAAATTA ATAAGAATAA ATATGTTTAT TAAAATTAAA 1320 

GTTAACAAAA TGACGAATAG ACTGAGAAAT GCTATAATTC ATTTTGTATG ATTTACAGAG 1380 

AGTTTATTTA ACGAGAAGGT GTCyGCGTGC TCTATTTAAT ATTTTCAATC ATTGTAGCTT 1440 

TATTTATGGG AACTATAGTT ATAGTTATTC GTATGAAAGC TCAAAATTAT CCGGTAAAkG 1500 

AGAAAAAAAT AGTTTTGCCa CCgTTTTTTA tGGCgACCGG TGCATTGATG TACGTCGTTC 1560 

CaTATTTTAG GCTAACAGGA TCGGAAATGC TAGAAGCCTT TATAATTGGT TTGCTTTTTT 1620 

20 CtACAGTTCT AATTTGGACT . TCTCGATTTG AAGTCAAAGG TACAGAAATT TATATGAAAC 1680 

GATCTAAAGC ATTTCCAGTT ATTTTGATTT CATTACTTAT CATTCGTACT GTGATGAAAA 1740 

TATTCATTAG TAATGAAATA GATCCTGGAG AATTAGGCGG CATGTTCTTT TTATTAGCAT 1800 

2S TCTGTATGAT TGTTCCTTGG AGAGCAGCAA TGCTATATAA ATACAAAAAA CTAAAGAAAA 1860 

CATTAATCAA TTAATTACTT TTaAAACCAC TTGTGATOGA CTTCTAAATC AGTCAATGAG 1920 

TGGGTTTAAT nTTACTTGGA AAAGGnGGAA AGG 1953 
(2) INFORMATION FOR SEQ ID NO: 375: 



30 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 787 base pairs 

(B) TYPE: nucleic acid 
3S <C> STRAND ED NESS : double 

<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 



45 



SO 



ACATTTGATC 


AAnTTATCGA 


CATTAAAGAT 


GAATTCAnTT 


GATCGTnTCA 


ATGATTATCC 


60 


TGTTGAAGTA 


GCACGTTTGC 


TTGATATAGT 


GGAmATaAAA 


GTACACGCAT 


TACATTCAGG 


120 


TATCcACGTT 


GATTAAAGAT 


AAAGGGAAAA 


TAATTGATAT 


TCATTTATCT 


GTAAAAGCCA 


180 


CTGAAAATAT 


TGATGGCGAA 


GTGCTGTTCA 


AAGCAACACA 


ACCTTTAGGT 


AGAACAATGA 


240 


AGGTTGGTGT 


TCAAAATAAT 


GCAATGrCAA 


TTACTTTAAC 


GAAACAAAAT 


CAATGGCTTG 


300 


ATAGTTTGAA 


GTTTTTAGTT 


AAGTGCATTG 


AAGAAAGTAT 


GAGAATCAGT 


GATGAAGCAT 


360 


AAAGAAGCAT 


TTAATGGCGT 


TGTCGTGTTA 


ACTGCTGCAT 


TAATTGTCAT 


TAAAATTCTG 


420 
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CAACAAGTGT ATCCAATTGT AGCATTAGGA 
GCAATTACAC AAAATATAGG GAAGTATCAT 

6 TATATACAAT TAGTTGGTAT ATTATTATTT 

GCACATATGA TGGGTGATGG CCATTTAACA 
ATATTTATAG GTATGCTTGG CGTGTTAAGA 

10 GTTCCGGCTA TTTCCCAGGT TATAGAACAA 

ATTGTTATTT TTGTAGACAG AGGTTGGACG 
GCATCAACGA TAGGTTTTTT AGGTTCTTCA 

IS 

TTTAAAATGG TAAATAACAC TGCAAAGATC 
ATTTTCGCTA TCAGTCAATT AATCGTAATT 

2Q ATTAAGTCAC TTCAAGCGAT ACGCGTGCCA 

TATGACCGTG GTGCATCATT TATTCAGATG 
GCGCTCATTC CTCTGTTAAG TGAOGCAATC 

25 TATGCAAATG CGTCATTAAA GATTACGATT 

ATTAATTTAT TGCCTTTAAT GAACGGTGTG 
TTAAGTGTTT ATATGATTAC GGTCATTTGT 

30 TTACAAGCGC AACATG CTGT GAGACCTATT 

AAATTTATAC TTAATATGAT TTTGATTCGT 
ACTGTTGTAT CATTAATTAT ATTCGGTACG 

35 

CACTTATATG CGATGAGACG ATTTTTTATC 
ATTGTTGTTC AATGCGTGTT AAACATAGTG 
GAATTATTAT GTGCAGCAGT ATTAGGTATC 

40 

AATGTTTTGA CATATAAAGA GTTAACTTAT 
AAGAAAGGAA GACGTTGATG GCACATACCA 

45 TTGATGATTT GCCGCTAGGG ATATATAAAT 

GAACGTTAGA TCATCCAGTT ATAGAATCAT 
ACCATGTTTA TGAAGCACAT AACCAATTTG 

SO TGGTTGAAGC TGCTAATGAA AAAGATATTG 

CTGAGACAAC TACAGTGAAA TTACTGGCTT 

55 



ATGATATTAT CGATGAATGC CATTCCTAGT 540 

AGTGACGAAG CATATGCAAA AGCAGTCGCT 600 

ATTGCTATTT TTGTGTTTGC GAACAATATT 660 

CCAATGATTC AAGCTGCAAG TTTAAGCTTT 720 

GGTTATTATC AATCTGCAAA TAATATGACA 780 

GTTATACGAG TAGGTATTAT CATTGTTACT 840 

ATATATGAAG CGGGAACAAT TGCTATTTTA 900 

ATTTATTTAG TAGGGCACCG ACCTTTTAAG 960 

GTTTGGAAAC AGTTCGCACT TTCGGTTTTG 1020 

TTATGGCAAG TGATTGATAG TGTTACTATT 1080 

TTCGATGTTG CCATAACTGA AAAAGGAGTC 1140 

GGATTGATTG TAACTACAAC ATTTAGTTTT 1200 

AAAATGAATA ATCAGGTACT TATGAATCGT 1260 

TTAATAAGTA CAGCAGCGGG AATAGGATTA 1320 

T TTTT TA AGA CGAATGATTT AACCTTAACG 1380 

GTATCGTTAA TTATGATGGA TATGGCATTA 144 0 

TTTGTTGGTA TGACGGCAGG ATTGGTTATT 1500 

TTAAGTGGCA TTATTGGTGC GAGCATTAGT 1S60 

ATTATCCATA TTGCTGTCAC GAGAAAATAC 1620 

AATGTTGTTT TAGGTATGGT ATTTATGTCG 1680 

ACAACACACG GTAGAATCAC TGGACTCATT 1740 

ATTGCATTGT TTTTCTATAT TTTTAGATTT 1800 

TTACCATTTG GTTCAAAGTT GTATCAAATT 1860 

TTACGATTGT TGGCTTAGGA AACTATGGCA 1920 

TTTTAAAGAC ACAAGATAAA GTTTATGCAA 1980 

TGCAAGATGA ATTAACATTT CAGAGTTTTG 2040 

AAGATGTCTA TATTGATATT GTGGCGCAAT 2100 

TCTATGCGGT TCCGGGTCAT CCTAGAGTTG 2160 

TAGCAAAGGA CAATACTGAT ATAGATGTGA 2220 
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10 



15 



20 



25 



30 



35 



ATGATGGCTT 






WVi lACAAIQA 


AGTAACACTT 


AATGTTAGAA 


2340 


CGCATACATT 


G ATT APT! PA A 




CAATGGTTGC 


TGCTAATTTG 


AAAATCACTT 


2400 


TAATGGAACG 


ATATCCTGAT 


fiATTACPPTYT 


TTCAAATTGT 


CACTGGTGCA 


CGAAGCGATG 


2460 


GTGCGQATAA 




^^^^^^^^^^ x^p^n & 


ATGAATTGGA 


TCATGATGAA AATGCATTCA 


2S20 


w\ X r\n XXX \MW— 




uinwUuvinA 


TCATAACATC 


GACATATTTG 


TATCATGACT 


25B0 


A IWil J. J. X \J\_ 




ATTGATACTT 


TAGTTGATGA AGATAAAGGT 


X VJ X UWtiVTW 


2640 




AAUbua XvjttlA, 


m — —/una m -m 

AC g CT AAAGC 


GTTATTTACT 


TGAAGAAACA 


X 1 X \3>iA X Xvr x 


2700 


lUiAAbLlAI 


TUACAATGAA 


GATGATTGGC 


ATATGATTGA 


AGAACTAGGA 


GATATTTTAT 


2760 


TAJCAAGTGTT 


ATTGCATACT 


AGTATTGGTA 


AAAAAGAAGG 


GTATATCGAC 


Al X AAAGAAG 


2820 


TGAlx ACAAG 


TCTTAATGCT 


AAAATGATTC 


GTAGACACCC 


ACACATATTT 


GGTGATGCCA 


2880 


ATGCTGAAAC 


TATCGATGAC 


TTAAAAGAAA 


TTTGGTCTAA 


GGCGAAAGAT 


GCTGAAGGTA 


2940 


AACAGCCAAG 


AGTTAAATTT 


GAAAAAGTAT 


TTGCAGAGCA 


TTTTTTAAAT 


X xATATGAGA 


3000 


AGACGAAGGA 


TAAGTCATTT 


GATGAGGCCG 


CGTTAAAGCA 


GTGGCTAGAA 


ft ft ft m ,« ft 

AAAGGGGAGA 


3060 


GTAATACATG 


AGATTAGATA 


AATATTTAAA 


AGTATCACGG 


TTAATAAAGC 


GACGTACGCT 


3120 


AGCAAAAGAA 


GTAAGTGATC 


AAGGTAGAAT 


TACAATAAAT 


GGTAATGTTG 


i milt m (kMMVIA 

CTAAAGCTGG 


3180 


ATOGGATGTT 


AAAGTTGAAG 


ATGTGCTGAC 


GATTCGCTTT 


GGTCAAAAAT 


xAviTAACAGT 


3240 


TAAAGTAACT 


GCATTAAATG 


AACATGCATC 


TAAAGATAAC GCGAAGGGTA 


*TV"T , ft. , ¥Y*' TV TV TAT* 

Xvj J. A X VjAAA I 


• 3300 


CATTGAAGAG 


CGTCGACTTG 


AAGAAGCGTA 


AATTGGAGGT 


G ACAAG CAAT 


f2A AAA ATA 71 A 


3360 


GTAGAACATA 


TAGAAAATCA 


GTACACGTCG 


CAAGAGAACA 


AGAAAAAACA 


ACGTCAAAAA 


3420 


ATGAAAATGC 


GTGTTGTTCg 


TAGGCGTATT 


ACAGTATTTG 


CGGGCGTATT 


■ aCTTGCGATA 


3480 


ATTGTTGTTT 


TATCaATCTT 


GCTTGTTGTC 


CAAAAACATC 


GCAATGATAT 


TGATGCACAG 


3540 


VjAvyvUAAAAG 


CGAAAGAAGC 


ACAGTTTCAA 


AAGCAACAAA 


ATGAAGAAAT 


TGCGTTAAAA 


3600 


GAAAAGTTGA 


ATAATCTGAA 


TGACAAAGAT 


TACATTGAAA 


AAATTGCGCG 


TGATGATTAT 


3660 


TACTTAAGCA 


ACAAAGGTGA 


AGTGATTTTT 


AGGTTGCCAG 


AAGACAAAGA 


TTCGTCTAGC 


3720 


TCAAAATCTT 


CGAAAAAATA 


AATCCAAATT 


GATTCAAAAT 


TATCCGAGTA 


TAGACATTGT 


3780 


GAAAAAA 












3787 


(2) INFORMATION FOR SEQ ID NO: 376: 









45 



50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

5 TAAACCATTT CAAACTGAGG AACGCnAAOA CGGsACgTTT CCAGATTTAG AAGTATTTAA 6 0 

AAATGAATGT GATTTAAGCT ATGACATAAC GTCACTTTAT ACTTTTAAGC AACCTGTATC 120 

ACCACACCTT GCATTTAAAA TGACAGATCA AATTTTTCTA AATAAGCAGC GTGTATTAGA 180 

10 

TAAGGTAAAA GTTTTAGATA AGGAATTTGA TTTTATCTTA ATTGAGGGTG CTGGGGGAAT 240 

TGCCGTACCA ATATATGAAG GTACAGATGA TTTCTACATG ACT AAAGAT C TAATCAATGA 300 

TTGTGCAGAT TGTGTCATCA GTGTGTTGCC ATCAAAATTA GGTGCTATTA GCGATGCCAT 360 

1S 

TGTTCACCAA GATTATGTTA ATCAGAATGT ATCGGCGAGT AATTTTTTAA TAATGAATCG 420 

CTATACAGAC AGCTATATTG AAAAAGACAA TCAAATGACG ATTGGAAAAT TAACAAATAA 490 

20 AACAGTCTAT ACATTTGAAG AACATGCCAC GTATGAAAAT TTCTCAGAAG CATTTTTAAA 540 

ACAATTAATA GGAGTTAAAA ATGAATTACA CACAACAACT TAAACAAAAA GACTCAGAAT 600 

ATGTTTGGCA TCCATTTACA CAAATGGGTG TAT AT AG CAA AGAAGAAGCA ATCATCATTG 660 

25 AAAAAGGAAA GGGTAGTTAC CTTTACGATA CGAATGGCaA TAAATATTTA GATGGTTATG 720 

CATCGTTGTG GGTCAATGTG CATGGTCATA ATAACAAATA CTTnAATAAG GTAATTAAAA 780 

AG CAACT CAA TAAAATTGCC CATTCTACGC TGCTAGGATC ATCAAATATT CCGTCAATAG 84 0 

50 AACTTGCGGA AAAATTAATC GAAATCACGC CAAGTAATCT AAGAAAAGTA TTTTATTCTG 900 

ATACAGG CAG TGCGTCTGTT GAAATCGCAA TAAAGATGGC ATATCAGTAT TGGAAAAATA 960 

TTGATAGAGA AAAATATGCC AAGAAAAACA AGTTTATAAC GCTAAATCAC GGTTATCATG 1020 

3S 

GGGATACGAT TGGTGCGGTA AGTGTTGGTG GTATCAAGAC CTTTCATAAA ATATTTAAAG 1080 

ACTTAATATT TGAGAATATT CAAGTAGAAA GCCCATCTTT CTATOGCAGT AATTACGATA 1140 

CTGAAAATGA AATGATGACA GCTATTTTAA CGAATATAGA GCAAATTCTA ATTGAAAGAA 1200 

40 

ATGATGAAAT CGCAGGGTTT ATATTGGAAC CGTTGATTCA AGGTGCGACA GGCTTGTTTG 1260 

TTCATCCTAA AGGCTTTTTG AAAGAAGTCG AGAAATTGTG CAAAAAATAC GATGTCTTAT 1320 

45 TAATTTGTGA TGAGGTAGCA GTTGGTTTTG GGAGAACTGG AAAGATGTTT GCATGCAATC 1380 

ATGAAGATGT TCAACCGGAT ATTATGTGTT TAGGTAAGGC GATTACTGGT GGCTACTTAC 1440 

CACTTGCAGC TACATTGACA TCTAAAAAAA TATACAATGC ATTTTTAAGT GATTCGCATG 1500 

SO GTGTGAATAC CTTTTTCCAT GGTCaTACAT ACaCCGGAAA TCAAATcGTT TGTaCGGTTG 1560 

CATTaGaAAA TATaAGaCTT TATGaAAAAC GTaAGTtnAT TGTgCACATa TTGaAACGaC 1620 

SS 
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(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 
_ (A) LENGTH: 431 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EONESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 



ATGATTTTtA aAAATCATTA AGTTAAGGTr GATACACATC TTGTCATATG ATCAAATGGT 60 

TTCGCCAAAA ATCAATAATC AGACAACAAA ATGTGCGAAC TCGATATTTT ACACGACTCT 120 

CTTTACCAAT TCTGCCCCGA ATTACACTTA AAACGACTCA ACAGCTTAAC GTTGGCTTGC 180 

CACGQnTTAC TTGACTGTAA AACTCTCACT CTTACCGAAC TTGGCOGTAA CCTGCCAACC 240 

AAAGCGAGAA CAAAACATAA CATCAAACGA ATCGACCGAT TGTTAGGTAA TCGTCACCTC 300 

CACAAAGAGC GACTCGCTGT ATACCGTTGG CATGCTAGCT TTATCTGTTC GGGCAATACG 360 



aTGCCCATTG TACTTGTTGA CTGGTCTGat ATCcGTGAGC AAAAACGGCT TATGGTnTTG 

25 CGAGCTTCAG T 

(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2006 base pairs 
30 <B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

<D> TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 



40 



45 



TTTnTATAAC 


GTATTATAAA 


TCGTTAAAAA 


TTTTGGTTGT 


GTTTGCGTCA 


CGTAGACAAC 


60 


CTCCATAAAG 


TTACTTAATC 


ACTCTCATCA 


TACAATAATT 


TTTACTCAAA 


TTGGAAnAAT 


120 


TATAAAAATT 


AAATATAGAT 


AGGCTTTGAA 


AATTAGTTTT 


ATACAAGGTT 


AGTAGCTGTA 


180 


ACTGTAAAAT 


GTTCTTAATA 


TTGTCAAAAT 


GTAATGCTTG 


AAAGCGCTTT 


TAAAaAATAT 


240 


TATTATATAC 


ATGGTTAGAC 


AAATAGACAA 


ATCACTATAC 


AAATATTGGG 


AGGAATATTT 


300 


TATGAAATCA 


ACACCACACA 


TTAAACCAAT 


GAATGACGTC 


GAAATTGCAG 


AAACGGTTCT 


360 


ATTGCCAGGA 


GATCCGTTAA 


GAGCTAAGTT 


CATTGCAGAA 


ACTTATTTGG 


ATGATGTGGA 


420 


ACAGTTCAAT 


ACAGTGCGAA 


ACATGTTTGG 


TTTTACCGGA 


ACATATAAAG 


GTAAAAAAGT 


480 


TTCTGTCATG 


GGTTCAGGTA 


TGkGTATGCC 


ATCTATTGGC 


ATTTACTCTT 


ATGAATTAAT 


540 
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CATTGATTTA 


TATOATGTGA 


TTaTTkCACA AGGTGCCTCT 


ACTGATTCAA 


ATTACGTTCA 


660 




ACAATATCAA 


TTACCAGGTC 


ATTTTGCGCC 


AATTGCTTCT 


TATCAATTAT 


TAGAAAAAGC 


720 


s 


AGTTGAAACA 


GCACGTGACA AAGGTGTACG 


TCATCATGTA 


GGTAATGTGT 


TATCAAGTGA 


780 




TATTTTCTAT 


AACGCGK3ATA 


CAACAGCGAG 


TGAACGTTGG 


ATGCGTATGG 


GTATTTTAGG 


640 




TGTAGAAATG 


GAATCaGCTG 


CaTTATACAT 


GaATGCAaTT 


TACGCTGGTG 


TCGAAGCATT 


900 


10 


AGGTGTGTTC 


ACAGTGAGCG ATCATTTAAT 


TCATGAAACG 


TCAACAAcAC 


CTGAGGAAAG 


960 




GGAACGTGCA 


TTTaCAGATA 


TGATTGAAAT 


TGCACTGTCA 


TTGGTGTAGA 


TGATTATGAA 


1020 


15 


TGTTGAATAT 


TCTAAAATAA 


AGAAAGCAGT 


ACCTATTTTA 


TTATTCTTAT 


TTGTATTCAG 


1080 


TTTGGTTATA 


GACAACTCAT 


TTAAATTGAT 


TTCTGTAGCC 


ATTGCTGATG 


ACTTAAACAT 


1140 




ATCTGTAACG 


ACAGTAAGTT 


GGCAAGCGAC 


ATTAGCCGGT 


TTAGTAATTG 


GTATTGGCGC 


1200 


20 


TGTAGTATAC 


GCTTCATTAT 


CTGATGCCAT 


TAGTATACGC 


ACACTATTTA 


TTTATGGCGT 


1260 


GATATTAATC 


ATTATCGGAT 


CAATTATTGG 


TTACATTTTC 


CAACATCAAT 


TCCCATTACT 


1320 




TTTAGTTGQA 


CGTATTATTC 


AAACTGCCGG 


TTTAGCTGCT 


GCAGAGACAT 


TATATGTGAT 


1380 


25 


ATAT0TTGCA 


AAGTATCTTT 


CTAAAGAGGA CCAGAAGACT 


TACCTTGGCT 


TAAGTACGAG 


1440 




CAGTTATTCC 


TTGTCATTAG 


TTATCGGTAC 


ATTATCAGGT 


GGATTTATTT 


CTACGTATTT 


1500 




ACACTGGACA 


AATATGTTTT 


TAATTGCATT AATCGTAGTA 


TTTACGTTGC 


CATTCCTATT 


1560 


30 


TAAATTATTA 


CCAAAAGAAA 


ATAATACGAA 


TAAAGCTCAT 


TTAGATTTTG 


TTGGCTTAAT 


1620 




TCTAGTGGCA 


ACTATTGCTA 


CAACAGTCAT 


GCTGTTTATT 


AC G AACTTTA 


ATTGGTTATA 


1660 




TATGATTGGT 


GCCTTAATTG 


CGATTATCGT 


TTTTGCGCTA 


TATATTAAAA 


ATGCGCAACG 


.1740 


35 


TCCATTAGTA 


AATAAATCAT 


TTTTCCAAAA 


TAAACGTTAT 


GCTTCATTTT 


TATTTATAGT 


1800 




ATTTGTAATG 


TATGCTATCC 


AATTGGGTTA 


TATTTTTACG 


TTCCCATTCA 


TAATGGAGCA 


1860 




AATTT AT CAT 


CTGcAACTAG 


ACACAACATC 


ACTGTTATTA 


GTACCGGGgT 


TaTATAGTAG 


1920 


40 


CAGTCATTGT 


TGGtGgCACT AAGTGGgTtA AAATCGGgCG 


rAATATCTGG 


AATTCCAAAA 


1980 




CCAAGCGGAT 


TATCACAGCC 


AATTAA 








2006 




<2) INFORMATION FOR SEQ ID NO: 379: 









45 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 4858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

so 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



SO 



TGGGGAAAAA 


AAGACCCAGC 


AGTATTAGAA 


AATCGTATCA 


TAAAATTAGT 


CGAAGAATTA 


ATTTCTTCTG 


AAGCACAGAC 


CGTGCATATT 


TTAAAACAAT 


TGCATCCTGA 


TTATCAATTT 


ATTAAAATGA 


AACCTCATCA 


ATTCGAACAA 

*» * * ^» VJ*A#* 


AAATATGATG 


TGAAGAATAA 


GAAAATTAAA 


ATAATTGAAA 


TT ACAGAT CA 


TGGAATTGGT 


GATCGCTTTT 


ATCcrAGTGGA 


TAAATf^TfYST 
X nrlrl X w X X 


TTATCTATTG 


CTf*!AJVAAAAT 


uii XW*rwvl in 


ATTAACAAAfi 


fiAAHAArVyTT 


V A & A ZV Tf A T & 




x\r\i. X X X X liui 


O x AwitATTA 


AAf*AAATAAA 


*-Av» i XV 1 


TTAT CATTTC 


PATPTATH A 21 


V_\_ X AuAAuAA 


TCAACGCTTT 


fVT A A HTPT A f 




AAt 1AAATAU 


• iTkT|Miwiy«f| y^/^ ^ 
X X X X Avj luuA 


n/y^Tnn* a & a a 
V^Va\j i V> 1 AAAA 


a a a /"*/*^ a 
I vi A LTAAlGA 


^. X X V*V?V> 1 VjV_lj 


AATTTAGGTC 


*t*b a ^v^hph 1 ' a f a 
TAATGllAUA 




pbrp rtttbp 
VaiVWjA ill /\V-_ 


AAGTCTTATT 


ACPAfirTTT A 


A A & ArtTAOA & 


fST*a htp a a a a 


TTTAATTGAT 

A X XAAX X VJX^X 


nnliil XLUVila 


A Zl T A CnTlTlf^ 


TCCAAAAAGG 


A A ACLATfyT A r* 


fTA & ATTAf2 A 
V-» X Ann X X nun 


TTTGGAAGGT 


ATATCAGCAG 


fiAATTftTTTT 
vjrm x x v3 x x x 


TTATGAAGCA 


ATTTTAAGAA 


TGGAAAAACG 


ACATATTAAT 


AACAATACCG 


AACGTGGTTG 


AGTAACGTTA 


AATAATAACG 


AAAAAAGGGC 


GTTTGAAAAA 


TATCTTCATA 


AAAACTTCGT 


AGACGCACTT 


GTACCGATGT 


TACAACGTAC 


AAATATACAA 


ATAGGCATGG 


CTCACCGTGG 


AAAACCGTAC 


GAAATGATGA 


TTTCAGAATT 


TGAAGATGGT 


AGCTTGCAGT 


TAACTGCTGG 
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GAATGQTTAA 


ATATTTCTAT TGAAOAAATG 


60 


fTTTOAATTYSA 
w x x unn x xtxrl 


CTAAAGGAGA 


TGTAAATGAC 


120 


AATTI ATY2 A A A 
iinivjnluivvi 


TTCGCTCGCG 


AATACACTCA 


1B0 


un x *\v_uvxa l V* 


TGACATCTAA AAATCTAGAA 


240 


1 1A1 1 1 IIAA 


TCTTTATTGA 


TAATGCAATC 


300 


1 AA£*A<-AA 


GGTTAAAAAA 


TAAGCAAAAA 


360 


ATTCCAGAGG 


AAGATCAAGA 


TTTCATTTTT 


420 


TCAAGAAGTC 


AAGGCGGTAA TGOACTCGGA 


480 


AACGGAGGAT 


CGATTAAAAT 


TAAAAGTGAA 


540 


TTTTAATCAT 


GACTGAGACG 


TCAATCAAAG 


600 


GCTGTGACTA 


ATGTATAAGA 


AGAACTATAA 


660 


TGTTGTACTC 


CCAAAATTTA 


CAATAAAATA 


720 


TGTTGATTCT 


TCTTTTTAGC 


AGATAAATAG 


780 


TGATATTATT 


nUil IVixAAl? 


CQl 1 1CAACA 


840 


AAGAAAAGAA 


Villi L</UjiU3U 


v- 1 v_Vw x\j 1 AAA 


900 


TCTATATGAT 


Un^l XXX InL 


AHflMWlTr 
MnVvilA.AJAl ^. 


960 


CAGCACAATT 


AAGAATGATG 


ACTCAATTGT 


1020 


TAGCGACGGC 


ACAATTAAGC 


GTGTCATGCG 


1080 


TCTTAAAG CG 


GATATTTATC 


CTGTAAATCC 


1140 


GATTGAAGAC 


TTTGATTTAG 


ATCAACAGAC 


1200 


AGATCACTTT 


GCCGACATTT 


ATGATAATGC 


1260 


n*iia r^a a a a 

1 lALAAAbi>A 


CCAATTGCAT 


TTGAGTATAC 


1320 


flTTAA A A AO a 
vl InnnnnvjA 


AGAATTGAAA 


CGCCATATAA 


1380 


A PTR TTP A & a 


CAATTAGCGT 


ATGTTGAAGG 


1440 


TGGTGCAAAG 


CX3TTTTTCAA 


TTGAAGGGGT 


1500 


TATTACGATT 


GCTGCGAAAG 


AAGGTATTAA 


1560 


ACGTTTAAAC 


GTTTTAACGC 


ATGTCTTAGA 


1620 


TATGCATACA 


GATCCAATGA 


AATTCTTACC 


1680 


ATGGACTGGT 


GATGTGAAAT 


ATCACCTTGG 


1740 
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AAGTCACTTG GAAATTGTTG CACCTOTTOT TGAGOGGCGT ACGAGAGCAG CACAAGATGA I860 

TACACAACOA GCTGGGGCTC CGACGACTGA TCATCATAAA GCAATGCCAA TTATTATACA a 920 

TGGCGATGCT GCTTATCCTG GTCAAGGAAT TAACTTCGAA ACAATGAACT TAGGAAACTT 1980 

GAAAGGCTAT TCTACGGGTG GTTCATTGCA TATTATTACT AACAATAGAA TTGGATTTAC 2040 

TACAGAACCA ATTGATGCAC GTTCAACAAC TTATTCTACA GATGTGGCCA AAGGTTATGA 2100 

TGTGCCAATA TTCCATGTCA ATGCAGATGA CGTTGAAGCT ACTATTGAAG CAATTGATAT 2160 

TGCAATGGAA TTTAGAAAAG AGTTT CAT AA AGACGTCGTT ATTGATTTAG TAGGTTATCG 2220 

TCGTTTCGGA CATAACGAAA TGGATGAACC ATCAATTACT AATCCaGTTC CTTATCAGAA 2280 

TATTCGCAAA CATGACTCTG TTGAATATGT GTTTGGTAAA AAGCTTGTTA ATGAAGGTGT 2340 

CATTTCAGAA GATGAAATGC ATTCATTTAT AGAACAAGTC CAAAAGGAAC TAAGACAAGC 2400 

20 TCATGATAAA ATTAATAAAG CTGATAAAAT GGATAATCCA GATATGGAAA AGCCTGGAGA 2460 

TCTTGCATTA CCGTTACAAG CAGACGAACA ATCATTTACT TTTGATCACT TGAAAGAAAT 2520 

AAATGATGCA TTGTTAACAT ATCCGGATGG CTTTAACATT TTGAAAAAGT TAAACAAAGT 2580 

TCTTGAGAAG CGTCATGAGC CGTTTAATAA AGAAGATGGT TTAGTTGATT GGGCACAAGC 2640 

AGAAGAACTT GCATTTGCGA CAATTTTACA AGATGGTACA CCGATTCGCT TAACTGGTCA 2700 

AGATAGTGAA CGTGGTACAT TCAGTCATAG GCATGCCGTG TTACATGATG AGCAAACAGG 276 0 

TGAAACATAT ACACCTTTAC ATCATGTTCC TGATCAAAAA GCGACATTTG ATATACACAA 2820 

TTCTCCGCTT TCAGAAGCAG CAGTAGTTGG TTTTGAATAC GGCTATAATG TGGAAAACAA 2880 

AAAAAGCTTC AATATTTGGG AAGCACAATA TGGTGATTTT GCAAATATGT CACAAATGAT 2940 

TTTTGACAAC TTCTTATTCA GTTCTCGCTC AAAATGGGGA GAACGTTCAG GATTAACATT 3000 

ATTCTTACCT CAT GCATATG AGGGTCAAGG GCCTGAACAT TCATCAGCAA GATTAGAGCG 3060 

40 ATTTTTACAA TTAGCTGCTG AAAATAATTG CACAGTTGTC AACTTATCTA GTTCAAGTAA 3120 

TTATTTCCAC TTATTGCGTG CACAAGCGGC TAGTTTAGAT TCTGAACAAA TGCGACCATT 3180 

GGTTGTTATG TCACCAAAAA GCTTACTGAG AAATAAAACA GTTGCAAAAC CAATTGATGA 3240 

45 

ATTTACTTCT GGTGGATTTG AGCCAATTTT GACAGAATCA TATCAAGCGG ATAAGGTTAC 3300 

AAAAGTTATT TTGGCAACTG GTAAAATGTT CATTGATTTA AAAGAAGCAT TAGCTAAAAA 3360 

TCCAGACGAA TCAGTATTAC TCGTTGCGAT TGAAAGATTG TATCCATTCC CAGAGGAAGA 3420 

60 

GATTGAAGCA TTACTAGCAC AATTGCCAAA CCTTGAAGAA GTGTCATGGG TACAAGAAGA 3480 

ACCTAAAAAT CAAGGTGCAT GGTTATATGT CTATCCATAT GTTAAAGTGC TAGTTGCAGA 354 0 
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AGAAATTCAT AAACTTGTTC AAAATAAAAT TATAGAAAAT GCATTAAAAA ATAACTAGGG 3660 

GGAAATAAGT CATGCCAGAG GTTAAAGTTC CAGAATTAGC AGAATCTATT ACAGAAGGTA 3720 

CCATTGCAGA ATGGTTGAAA AACGTAGGGG ATAGCGTAGA AAAAGGTGAA GCTATTCTTG 3780 

AATTAGAAAC TGATAAAGTT AATGTCGAAG TTGTATCTGA AGAAGCAGGT GTATTATCTG 3840 

AACAACTTGC AAGTGAAGGC GACACTGTAG AAGTTGGACA rGCAATTGCT ATCATCGGCG 3900 

AAGGTAGTGG CAATGCTTCT AAAGAAAATA GTAACGACAA TACTCCACAA CAAAATGAAG 3960 

AAACAAATAA TAAAAAAGAA GAAACAACAA ATAATTCGGT AGATAAAGCT GAAGTAAATC 4020 

AAGCAAATGA TGACAATCAG CAACGTATTA ATGctACGCC TTCTGCGCGT CGATATGCTC 40B0 

GTGAAAATGG TGTGAATCTT GCTGAAGTAA GTCCGAAAAC AAATGATGTG GTTCGTAAAG 4140 

AAGATATTGA TAAGAAACAA CAGGCACCGG CATCAACACA AACAACACAA CAAGCATCTG 4200 

20 CAAAAGAAGA GAAAAAATAC AATCAATATC CTACAAAACC AGTGATTOGT GAAAAAATGT 4260 

CACGTAGAAA GAAAACAGCT GCCAAAAAAT TATTAGAGGT ATCTAATAAT ACAGCTATOT 4320 

TAACAACATT TAACGAAtTG AcATGACAAA TGTTATGGAA TTGCGTAAAC GTAAGAAAGA 43 BO 

ACAATTTATG AAAGATCATG ATGGTACTAA ATTAGGATTT ATGTCATTCT TTACTAAAGC 4440 

TTCTGTAGCA GCTTTGAAAA AGTATCCAGA AGTTAATGCA GAAATCGACG GCGACGACAT 4500 

GATTACGAAA CAATATTATG ATATTGGTGT AGCTGTTTCT ACAGATGATG GATTATTAGT 4560 

ACCATTTGTA AGAGATTGTG ATAAAAAGAA TTTTGCAGAA ATCGAAGCAG AAATTGcTAA 4620 

TTTAGCAGTT AAAGCaCGAG AGAAAAAACT TGGCTTAGAT GATATGGTTA . ATGGTTCATT 4 680 

TACGATTACA AATGGOGGTA TTTTTGGATC AATGATGAGT ACGCCAATTA TCAATGGTAA 474 0 

TCAAGCTGCA ATCTTAGGCA TGCATTCAAT TATTACAAGA CCAATTGCGA TTGATCAAGA 4 800 

TACAATCGAA AATCGTCCAA TGATGTATAT TGCATTAAGC TATGATCATA GAATTATT 4 858 
40 (2) INFORMATION FOR SEQ ID NO: 380: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2222 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



26 



30 



35 



60 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 
ATCAGTCACA CGGTAGGCAT ATAAAATGAG TCGTTTCTAC AACATTTTTA AACAGTTCAT 60 
TCAAT ATTAT TTTTATCTAA TAATnATATT GGGAGGATTA TACCTTTATA CACACCATGC 120 
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ATG CTATTTA 


GCTAAAGCTA 


AAAGACCAGA 


CACTATGCAT 


ATTTCAACTG 


GAAATATGTG 


240 


GCGATACTTA 


GTTGCAATTA TTGCCTGTAT GATTTGGTAC 


CTTAATAAAG 


CGCATGTAAG 


300 


TATCATCGGT 


ATAATTATTG 


GTTTAATGAT 


TTCATATGTT 


GTAGTTATCA 


TACGTCCTTT 


360 


ACTAAAGGTG 


AGCAAATAAA 


TTAAGAAAGA 


GGTGAGATTA 


TGGATCACAA 


ATCCCCGCTC 


420 


GTGAGTTGGA ATTTATTCGG 


TTTTGaTATC 


GTTTTCAATT 


TATCAAGTAT 


ATTGATGATA 


480 


CTTGTTACGG 


cGTTTCTTGT 


TTTTCTACTT 


GcTATCATTT 


GTACGCGTAA 


TTTGAAAAAA 


540 


AGACCAACTG 


GGAAACAAAA 


TTTCGTTGAA 


TGGATTTTTG 


ATTTCGTGAG 


GGG AATCATT 


600 


GAAGGTAACA 


TGGCTTGGAA 


AAAAGGTGGT 


CAATTCCACT 


TCTTAGCAGT 


AACGCTGaTT 


660 


CTGTACATTT 


TTATAGCTAA 


TATGTTAGGT 


CTTCCGTTTT 


CTAT AGTAAC 


GAAAGATCAC 


720 


ACATTGTGGT 


GGAAATCACC 


GAGAGCnGAT 


GCAACAGTGA 


PTTTJV&OrtTT 
vi i xnn\«\j x x 


GTCTACAACG 


780 


ATAATACTGT 


TAACTCACTT 


TTATGGAATT 


AAAATGCGTG 


GTACX5AAACA 


A VAT PTTJVAA 


840 


GGTTATGTAC 


AGCCGTTTTG 


GCCATTGGCA 


ATTATTAATG 


TTTT'TY^ & Af3 A 
x x X X iwviun 


£l'I"I'f A f^' r- j'f* A 

X X UlW X X ^n-n 


900 


ACATTAACGC 


TTGGTCTGCG 


TTTGTACGGT 


AACATATTTG 


r , &.nn r mzLC2Jx r v 

Wi*J<J X\JT\\3-T\ X 


APTATTAAfA 


960 


TTACTTGCTG 


GCTTATTCTT 


TAACGAACCA 


GCATGGGGTT 


ounl XJXX Inu 


TATPPPAfSGA 


1020 


TTAATTGTTT 


GGCAAGCATT 


TTCAATATTT 


GTAGGAACAA 


TC^C^WCiC x\Tl\ 


■P A T PTTT & TT 

X f\ T\ X X 


108 0 


ATGCTTTCGA 


TGGTTTATAT 


GTCACATAAA 


GTGGCAGATG 


A A C Jx PT ZV A A A 


ATTTCAATAA 

n xxx cnn X ftn 


1140 


TTATATACAA 


TCACAGGAGG 


AAATTAAATT 


ATGAATTTAA 




AATfTiPAATT 


1200 


GGTTTATCAG 


CATTAGGAGC 


AGGTATCGGT 


AACGGTTTAA 


*T* P*f2 TTT P* Ji A 




126 0 


GGTGTAGCAC 


GTCAACCAGA 


AGCACGTGGT 


CAATTAATGG 




CATTGGTGTA 


1320 


GGTTTAGTTG 


AGGCATTACC 


TATCATCGGT 


GTAGTAATTG 


CATTCATGAC 


ATTTGCTGGA 


1380 


TAATTAACAG 


ATAAAAGAGG 


TCGGGACAAA 


GCGCATAGGA 


CATAATTCAT 


GATGCATATA 


1440 


TAGTAATATC 


TTTGAACTTT 


ATTAAATAGT 


TGAGATATGA 


ACGCACCATG 


CCTATCGCAT 


1500 


AAATTCAGTA 


GGTCCTAACC 


TCGTCGTTTT 


TTTCTATATA 


ACACTAGCGA 


TTATTTTAAT 


1560 


GAAAGGAGTG 


TCATGAACCC 


GTGACTGAAA 


CAGCTAACTT 


ATTCGTTCTT 


GGTGCAGCTG 


1620 


GAGGCGTTGA 


GTGGGGTACT 


GTGATTGTAC 


AGGTCCTAAC 


TTTCATCGTG 


TTAcTTGCGT 


1680 


TACTTAAAAA 


GTTCGCATGG 


GGTCCATTGA AAGATGTAAT 


GGATAAACGT 


GAAAGAGATA 


1740 


TTAACAGAGA 


TATCGATGAC 


GCAGAACAAG 


CTAAGTTAAA 


TGCACAGAAA 


CTTGAAGAAG 


1800 


AAAATAAACA 


AAAACTTAAA 


GAAACACAAG 


AAGAAGTTCA 


AAAGATTTTA 


GAAGATGCTA 


1860 


AGGTTCAAG C 


ACGTCAACAG 


CAAGAACAAA 


TTATTCaTGA 


AGCAAACGTA 


CGTGCAAACG 


1920 



1277 



EP0 786 519 A2 

TTAATAATCa AGTATCTGAA CTATCAGTGT TAATTGCTTC TAAAGTTCTT AGAAAAGAAA 2040 

TTTCTGAACA AGACCAAAAA GCATTGGTTG ACAAGTATCT AAAAGAGGCA GGCGATAAAT 2100 

AATGGTAAAA GTAGCTAACA AGTATgcTAA AGCATTATTT GACGTGTCAT TAGATACAAA 2160 

TAATTTAGAG ACTATTAATG AAGAATTAAC AGTTATAAAT GAAGCAGTAA AAGATAAAAT 2220 

TG 2222 



(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

IS (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 





TAACAAiiCC 


TaTATTCATG 


TTTAATACGA 


AACACTACAT 


TTACATTGTA 


ATTCACTATC 


60 




TTTTGAAGTA 


ATAAAuTuAT 


TTGTTCAATC 


GATAGCTCAT 


TGCTTGTGTC 


GATTGTAACA 


120 


25 


ATTATATGCA 


AGTTTTCAGG 


ATTAACACCT 


AATCTTTGAA 


TGATTTGTTT 


AATAGTATAA 


180 




- TAATCCATCC 


AATAAAAATT 


ACTTCCTTTA 


ATATAAATGT 


TTTTAGGTTG 


ATACATTTGA 


240 




CTCCTTTTTG 


TAGGCTCAAA 


AGGTATATCA 


ATCTCGCGCA 


TACTTGAAGA 


ACTTTGATTA 


300 


30 


GTATCATCAA 


ATAATTCAAT 


TATATTTTTA 


TCAATTTCAA 


CTAATTGGGA 


ATGGTTAATT 


360 




GAATGAGATG 


TTGGACTATA 


TCTTTTTCTA 


ATTAATTTAG 


GAGTGTTACC 


ATACGTTTCT 


420 


35 


TTAAAGAGGT 


GTATAAAACG 


AGAATAATGA 


TTAAAACCAT 


GACTACTTGC 


GATTTCTTCA 


480 


ATAGGCTTCT 


TAGrAGTTAA 


AATATCAATT 


AAGCAATGCT 


CCAGTCTAAT 


ATGATTTAAA 


540 




TATTGAATAA 


AATTACTATA 


AGGTGTCGCT 


TTAAACATGT 


CACTTAAAGC 


TTTGTTTGTA 


600 


40 


ATACTAACTT 


GATTAATGAC 


ATCTTTCCTA 


TTTATCTTTT 


TATGGTGGTT 


GTTTGTTAAA 


660 




TAATCGTGCA 


CTTCTT CGGC 


TACTAAATGA 


CGACTACCAT 


CGTATGTATT 


TAATGACATT 


720 




AATTCAACAC 


ACATGTAATT 


AATAATCTTA 


TCATTAGCAT 


TATAAGACTG 


TTGTTTAATT 


780 


45 


TTGCTGTAGA 


TTAAATACTT 


AATCAAGATT 


CTAAAAGTGC 


TAGCAACCTC 


ATTTGTTAAT 


840 




ATCTTGCCAC 


GCATCATATA 


GTTTGCATTC 


ATAAACTTCA 


AAAATGTCTT 


TGCACTTACT 


900 




TCAACAACAC 


AACATACACT 


AT CATCATT A 


CCATCAATTT 


GATACAAATC 


ATTCAACATA 


960 


50 


ATAATGGTGA 


CATCATTTCT 


TTTTACATCA 


TATTGTTGTA 


GATTGATTGA 


AACTACACCT 


1020 




GTACCTTGTA 


GCCAATATAT 


AATTTTtAAG 


TTCGCTTCCT 


TCACACTACT 


CATTTTCATA 


1080 
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ATT 1143 
(2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 506 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

GGACCACCTT TTTTACCTGT ACGACTAAAO CCAACTGTTT TAAAGGCACC AACAGCATGC €0 

CACACTTGAA TAATTTCTTC TGATGGTCTA AAGCGAACCG TATAAATTAA TGGATGGAAA 120 

TCATCAACAA AAATATAATC GGCCTTACCA AGTAAATATG GCAATCTAAA CTTGTCTCTC 180 

CATTTGCGTC TATCCGTAAT ATTCTCCTTA AATACCGTTT TAATATCATA ATCAAAATCT 240 

ACTTTTTGGC GTAGTAACTC ATCATATACA TACTTGAAAT TCCCTGATAA ATTCGGACGC 300 

GAATCTGATG TGAATAATAT TGTTTTGCTT CTTTTAATAT GTAGTAACTT TGTAATATTA 360 

25 AAAATAGCTT TAAATAAGAA ACTTCTACTT TCAAATGAAG CTTTATGaCC TTGTTTaTGA 420 

AGCCAGTGTG cACTTgTCGC AATGaCCCCT GaTTTCyCyT GagGtAAGGk GaTTTCmATA 4 80 

TCAAATACAA ATTCGTTAAC GTCACT 506 
(2) INFORMATION FOR SEQ ID NO: 383: 



<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 
3S (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 383: 

AGCATCTGCA AtGAGTcTCT AATACAACGA TACGTTTTGC ATCTTTAGGT ACTTTTACTG 60 

TACCATTTTC ATCTTTTACC GAAATAGTAT CTTTAGTTGA TGATTCTTTT TTACTTGAAT 120 

45 TATCCGTATT ACCACAAGCT GCAACTAAAA GTAAGGCAAC TATTAATCCC AATATACTAA 180 

AAGTTTTTAG ACCTCTCATC GTTCCACTCC TTAATATGTA TAACTTCATT TATTATTTTA 240 

TTGATAACAA TTATCATTGT CAAGTAGCGT TCAATCTTTT TTATATTTCT AAAATGTATG 300 

60 ACTATATATT TCCTCTAATA ATTATGACTA CAATTAGCAC ATTTCCTTAG ACAAAATACT 360 

GATAATGTAT CATTG CT AT A TCATCTTTGC ATTAATACAA TTGACACCAT TTAGCATGAC 420 
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<2) INFORMATION FOR SEQ ID NO: 384: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 



20 



30 



X\tv^\V~ X J. Van X 


nS* x 1 iAuLnw 


ATvjAAAI 1\»C 


TAATwGATTA 


GTTGCAAATG 


ACAAGAATGA 


$0 


AUUvi^ X X 1 vj 




ATAAAATGGC 


AACGATTCGT 


TTTACAGAAC 


CTACGCTGAT 


120 


TUCAI xAUCA 


GGGGGTAATG 


TCAAAGCTTA 


CACTGAGCAT 


ATGACTATAT 


CTCCATATAA 


180 


ATTGTATTTG 


TTAGATAAAG 


GCGATGTTTT 


AAAGTTTAGA 


GAAACAAGTT 


ATACATCGOG 


240 


AGTGTATTTA 


GCTGTgGGAG 


GCGGATTTGA 


ATTAGATGCA 


TGGTTAGGAT 


CTAACTCAAC 


300 


CGACTTTAAT 


GTAAAAATTG 


GTGGTTTTAA 


AGGTAGAACA 


TTACAAGATG 


GCGATGAAAT 


360 


AAAGCTTAAG 


AGAGATTATA 


CAGCTCGTCA 


TCATAAGTTA 


TTTGAAAACC 


TTGCTCACAC 


420 


GAAACAAACA 


GATTGGGGTA 


TTGATGGATA 


CGCCTTGTCA 


TTTAATTATA 


TGTCTGATGT 


480 


ATTTCATGTC 


GTTAAAAATA 


AAGGTACGGA 


AGATTTTAAA 


GAAGATGCCA 


TTCAAAGATT 


540 


TGTGAAACAT 


GATTATAAAG 


TAACGAGCAA 


AGCAAATCGC 


ATGGGGATGA 


TGCTTGAAGG 


600 


TGAAAAAATC 


aaagctittt 


ATGAAGATAT 


GCCACCGTAT 


CAGACTGTCA 


AAAAAGGAAC 


660 


GATACAAATT 


AAGCGTGATG 


GCACACCTAT 


TATCCTATTA 


AATGATCATT 


ATACGCTAGG 


72 0 


TAGCTACCCG 


CAAATCGGTA 


CAATCGCAAG 


TTATCATTTA 


ACGAAATTAG 


CACAAAAACC 


780 


GCAAGGATCA 


CGTTTGAAAT 


TTCAATTTAT 


AGATATTTTA 


ACGGCTGAAA 


AGAACCTTGT 


840 


TAAgTATAGT 


AACTGGTTAA 


AC 








862 



<2) INFORMATION FOR SEQ ID NO: 3 85: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 <D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 
AAATGAGTTG ATGATGGCTC CTATTCCATC TATCTCTAGG TATGACAGAA SATAAACGGT 60 
CTTCTACCTG TCTsACATTA TCTTTCCAAC GATTAATACC TAAGCGTTTA GAAACACGTT 120 
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CAOTTTTACG TCCTACACCT GCTAAACTTT CTAATTCCTT ATGTGTTTGT GGTATTTCTC 240 

CATTAAATTG ATCAATCAAA GATTGACAAA GTTTCTTAAT ATTCTTAGCT TTGTTACGAT 3 00 

ACAGACCGAT AGAACGAATA TCATTCATAA GTTCTTCATC ACTGACTGCC AAATAATCTT 360 

CAGGCGTTTT GTATTTTTTA AACAGCTCAG TTGTTACTCT ATTTACTAGA ACGTCTGTAC 420 

ATTGCGCTGA CAATAATACA GCAATAGTTA ATTCGAACGG ATTATCATGT TTTAATTCAC 4 80 

ATTCTGCATC CGGAAACATA TTTGCTATAA CATCAATCAT TTCTAATGCT TTTTTCTTAC 540 

TTACCATCAA GGTTCTCCCC GTTTAACCAA TCAAATTTAG GTACCGTTTT AACTGTGTGC 600 

GTCATTTTCG GTTTATTGAA TTTTTCTCTT ATTTTTCTAG AATCGTCAAT TGTTTTGACA 660 

TTGTTTTTCT TCCAATTAAG TAAAATACGA TCTATATATT TAAAGCTAAG TTTATTCAAA 720 

CTATTCGCCT CGTCTAATGC CGCTTGTATA ATTGCAGTAT CGTGTTTATC AACATCAATC 780 

20 CATTGATTTA ACGTTTCTAT TTCATATGGA GATAACGGCC TTGCAAATGT ATCCTCTAAA 840 

ACTCTAAATA ATTGTTTAAA TTTTTCTTTA CTATTTAGCT CTTTCGTTTC CATACTTTGT 900 

TGCTTCAATA TATGACTTAA TTTTTCGAAA AAAGGATCTA GATTCATATA TTCGGkAAAT 960 

26 CTACCTTCyT CATCyTTTTG aACTkG t AAT tCTAGCAATT CACGTgTATC AAATTTTGGA 1020 
TACCATT 

<2) INFORMATION FOR SEQ ID NO: 3 86: 



10 



15 



30 



35 



(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1027 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 

40 AAGGnTTGGA GGGAATTAAT nGATGGCAAA TCCCAGaTAA AGTACACaAC GCATGGAATG 60 

TGATGCATTT AATATTTCCA GTAGTTAGTA CGACGTTTGC AAGCTTTAAA TCTATGTATG 120 

GGGGCATACC AAAAGATTTC ATAGACTACT TATTTATTGA TGAAGCAGOA CAAGCAATAC 180 

45 CTCAAGCAGC TGTGGGAGCA TTATATCGTT CAAAAAAAGT TGTAGCTGTA GGTGATCCGA 240 

TTCAAATAGA ACCGGTTGTG ACTTTAGAAA GTCATTTAAT TGATAACATT CGTAAAAATT 3 00 

ATCATGTTCC GGAATATCTA GTTTCTAAAG AAGCTTCTGT GCAGTCTGTT GCAGACAACG 3 60 

50 

CCAATCAATA TGGTTTTTGG AAATCTGATG CTACTGATAG TAATCAAAAA ACCTGGATAG 4 20 

GCATACCTTT ATGGGTGCAC AGACGATGTT TAAAACCTAT GTTCACGATa GCTAaCCAAa 4 80 
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GGTAXGACGT TAAAGGAAaC GCAGTTCAAA AACAATTTGT GAAAGAGCAT GGTGAAAAaG 600 

TAGTGGGATT ATTAGCTGAT GATTGGATTG AAGCAATTAA GGAAGGTAAA AATGAACCGA 660 

5 

GCTCATTTGT AATATCGCCT TTTTCAGCAG TACAGCAACA GATTAAACGT ATGTTAAAGC 720 

AACAACTACC GACTAGAATT GATATTGAAC GTACAAAAAT TAATCAATGG GTCGATAAAT 780 

CCATTGGTAC TGTTCATACT TTTCAAGGTA AAGAGGCTCA GAAGGTGTAT TTTGTAATAG 840 

10 

GTACTGATAA TACCCAAGAT GGTGCTGTGA ACTGGTCATG CGAAAAACCA AACTTGTTAA 900 

ACGTTGCAGT GACAAGAGCT AAGAAAGAGT TTaTGTAATT GGOGACATGC AAAGAATACA 960 

1S GATGAaACCA TTTTATGAGA CGATTTTTAn AGnAAGAAAT GTAAAA 1006 

(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 



35 



45 



CGTTTCATCA 


AGTnATCGAT 


CCTAATTTAC 


AAAATATTCC 


GGTTAGACTT 


GAAGAAGGGC 


60 


GTAAAATAAG 


AAAAGCCTTT 


AAACCAACTT 


CAAAAGATAG 


CGTTATATTA 


TCAGCAGATT 


120 


ATTCTCAAAT 


TGAATTGCGT 


GTATTAGCAC 


ACATTACACA 


AGATGAGAGT 


ATGAAAGAAG 


180 


CATTTATCAA 


CGGCGATGAT 


ATTCATACAG 


CAACTGCTAT 


GAAAGTATTT 


GGTGTAGAAG 


240 


CTGATCAAGT 


CGATAGTTTA 


ATGCGTCGTC 


AAGCAAAAGC 


AGTTAACTTT 


GGAATTGTTT 


300 


ATGGGATAAG 


TGATTATGGT 


TTAAGTCAAA 


GTTTAGGTAT 


TACTCGTAAA 


AAAGCAAAAG 


360 


CATTCATTGA 


TGATTATTTA 


GCTAGTTTCC 


CAGGTGTAAA 


ACAATATATG 


TCTGATATTG 


420 


TAAAAGATGC 


CAAAGCTTTA 


GGTTACGTGG 


AAACATTGCT 


ACATOGTCGA 


CGCTATATTC 


480 


CTGATATTAC 


GAGTCGTAAC 


TTTAATTTAC 


GCGGCTTTGC 


TGAAOGTACT 


GCTATGAATA 


540 


CGCCAATACA 


GGGCAGTGCT 


GCAGATATCA 


TTAAACTGGC 


AATGGTTAAA 


TTTGCTCAAA 


600 


AAATGAAAGA 


GACAACATAT 


CAAGCTAAAC 


TATTATTACA 


AGTACACGAT 


GAATTAATTT 


660 


TT 












662 


(2) INFORMATION FOR SEQ ID NO: 388: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 



1282 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 



5 


TCATCCAAAT 


TTTGGAAATT 


CCACATTTTA 


CATATCGTAA 


TTTTTTAGGA AACTAGTGAA 


60 




TATAACAAAT 


CCCTCCTCTC 


ATTTTTAAAA 


TAGATATATC 


*\\m X X 


XXI 1/\V» X mix 


120 




CTAAACTGCA 


ACGGTTCCTA 


ATACCAAAAT 


CCTGCCCTCT 


ATTTTTATCA 


ATTCAAGCAT 


180 


10 


ACTTATTGAA 


AAATGTTAAC 


GTTTTCTTGA 


TAATCATTGT 


AAGCGCATTT 


ATTTTATAAA 


240 




CTAACGTTTG 


AAATATACTA 


CAGGAGTOAC 


ACGTAATGAC 


TCAAATTACT 


GAAAAAGAAT 


300 




TAAAAAAGAA 


GTATTTAGAT 


TTACTATCCC 


AAAATTTTQA 


TACTCCAGAA 


AAACTTGCAA 


360 


15 


CTGAAATTAT 


CAATTTAGAA 


TCAATTTTAG 


AATTACCTAA 


AGGTACGGAA 


CATTTCGTCA 


420 




GCGATTTACA 


TGGTGAATAT 


GAAGCTTTCC 


AACACGTATT 


ACGCAACGGT 


TCTGGGAACG 


480 


20 


TGCGAGCGAA 


AATCAATGAT 


ATTTTCAAAG 


AGAGACTTTC 


AACTAAGGAG 


CTTAATGACT 


540 


TAACTGCTCT 


TGTCTACTAT 


CCAGAAGrCm 


AATTtAAAAT 


TgATTAAAAG TGATTTCCAA 


600 




AATTgCGGtC 


mACtTAATGt 


CyGGtATATC 


ACaACmATCG 


aACATTTAAT 


TGAGTTAATT 


660 


25 


AAATATTGT 












669 



(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 9 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 



40 



45 



SO 



CACATGGCTG 


TTAGAGATAT 


GAATGGCCAT 


GCGTTACCTT 


TAACAAAAGA 


TGGCAATTTT 


60 


TATCAAACGA 


ATGTAGATGC 


AAATGGTGTT 


AATCATGGTG 


GTAGTGAAAT 


GGTGCAAAAT 


120 


AAAACAGGTC 


ATATGAGTCA 


ACAArGCCAT 


ATGAATCAGA 


ACACACATGA 


ACCAACAGCC 


1B0 


ACACATGCAA 


CAAGGTCATA 


TGCAATCATC 


AAACCATCAA 


ATGATGAGTC 


CAAAAGCAAA 


240 


TATGCATTCA 


TCAAATCATC 


AAATGAACCA 


AAGTAACAAA 


AAAGTTTTAC 


CAGCTGCTGG 


300 


TGAAAGTATG 


ACATCAAGTA 


TTCTTACTGC 


AAGTATTGCC 


GCACTACTAT 


TAGTATCTGG 


360 


GTTATTCTTA 


GCATTTAGAC 


GACGTTCAAC 


AAATAAATAA 


ACATAATACG 


ATTAATAATA 


420 


GAAAAATCGT 


GTGATTATCT 


GaGGGAGCCT 


AGGACATAAA 


TCAATGTCCT AGGCtCnCTA 


480 


AtGTTATATT 


GGCAGTAGTT 


GACTGAATGA 


AATTGCGCTT 


GTAACAAGCT 


TTTCCATTTC 


540 
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5 



10 



1S 



20 



wVArV A X\J ivl 1 




uiul lit/11 X v» 


AGATATTTCC 


TGTGGAATAT 




ft 

WWW 


A ATfAT A KTT 






CAGTGTTTCA ATAATCAGCC 


AACAATGTGT 


r « u 


ATY*H fTa TO A 


AA\_*i\«V3 iuAL 


•1*21 *1 & TT^nTlP 
XnAunX X X Xft» 


GAAGTGGGGC 


GCTTTGGTAA 




*7 Rft 


aAAA1\. lunl 


i\±A 1 A luwil 


iuLinliinnl 


CGTTTGCTCA 


ACGAATGTCT 


X A Vj X V*W X >— 


ft4 ft 


TTCGTXTTGT 


GTATTCACTT 








^ntZT'h a anpa 

AVffV* X >VrvrlVjV— TV 


Qfin 


AAATGCATCA 


GCTTGCTTAG 


AATGATTGTC 


CTTTTTTTGA 


TAATAGCGTT 




jdU 


GACGGCAGAA 


GGATGGTTTG 


CAAACAAATG 


ATTTGTATAT 


TCACTTTCTA 


AATCAACACG 


1020 


ATAATTAATT 


GATGACATAG 


ATACGCGAGC 


TAG CAAT ATT 


TGATCAAGTG 


GATGCTTAAA 


1080 


TTGATCCATA 


CTTGAAGCGT 


GTTGGGCATT 


TGTTTGTGGA 


ATAACAAAGT 


GTCCCTTCCC 


1140 


TCTTGTACTC 


TCTACGATGC 


CATCTTCGGC 


TAACAATTTt 


ATAGCTTGGC 


GCAAAGTCAT 


1200 


ACQACTGGAC 


ATCAAAG CGC 


GCACAAAGTT 


CCTTTTCAGT 


AGGTAATGC 




1249 



<2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 



3S 



40 



45 



AATGATGATT 


CATTCAAAGA 


AATTAACGCT 


CGGTATATGC 


TTGGTATTAC 


TCATTATATT 


60 


GATTGTAGGT 


TATGTCATTA 


TGACAAAAAC 


AAATGGTCGA 


AACGCCCAAA 


TTAAAGACAC 


120 


ATTTAATCAA 


ACATTAAAAT 


TATATCCAAC 


CAAAAATCTC 


GATGATTTTT 


ACGATAAAGA 


180 


AGGCTTTCGA 


GATCAAGAAT 


TTAAAAAGGG 


TGATAAAGGT 


ACTTGGATAG 


TTAATTCTGA 


240 


AATGGTAATC 


GAGCCAAAAG 


GTAAGGATAT 


GGAAACGAGA 


GGAATGGTGC 


TCTATATCAA 


300 


TCG CAAT ACT 


AGAACCACAA 


AAGGGTATTA 


TTTTATAAGT 


GAAATGACAG 


ATGACAGTAA 


360 


CGGCAGACCA 


AAGGATGATG 


AAAAAAGGTA 


TCCGGtAAAA 


ATGGAACATA 


ATAAAATCAT 


420 


ACCAACGAAG 


CCACTACCGA 


ATGACAAGTT 


AAAAAAAGAG 


ATTGAAAACT 


TTAAGTTCTT 


480 


TGTACAATAT 


GGCAACTTTA 


AAGATATTAA 


TGATTATAAA 


GATGGTGATA 


TTTCATATAA 


540 


TCCTAATGTA 


CCAAGTTATT 


CGGCAAAATA TCAATTGAAT 


AATGATGATT 


ATAATGTCCA 


600 


ACAGTTaAGa 


AAAAGATATG 


ATATTCCAAC 


CAAACAAGCG 


CCGAAACTAT 


TATTGAAAGG 


660 


CGATGGAGAT 


TTAAAAGGTT 


CATCCGTAGT 


TcTAGAAGTC 


TTGAATTTAC 


CTTTGTCGAA 


720 
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5 



10 



15 



25 



30 



AGGTATGAGT 


CAAACTGAAT 


ATCAAATAAA 


ACCTGGTAAT 


ATAACAAGTA ACTCTGAAGA 


640 


AACAAGTTCG 


ATATCTAAAG 


TGAGCTGTGA 


AATATAGGTA 


GCCATTTCAA 


AAAATTTAAA 


900 


GGTGAATTTG 


ATAATGTAGC 


TCAAGGAGAT 


TGGGTTAAAA 


AGGCGAAGAA 


TGAACTGGAT 


960 


GATATTAGTA 


AGAAATTAAA 


AAATATTCAA 


AGAACGGAAG 


TTTAATAGCT 


TATATGATTC 


1020 


TTGGAGCTAA 


GACAGCATGC 


GTTCATTCAT 


GCCATTATTA 


ATATAAGCAC 


CGCAACAAAA 


1080 


AAGCTTCTAA 


TGTGATACAG 


GAACCTCATA 


TTCCGTATCA 


TGTTAGAAGC 


TTTTAATGTC 


1140 


TAAAGAACAT 


CTACATTTTA 


TCATATTTTC 


TGACTTATTA 


AACTTTTATA 


TAATTAAATA 


1200 


TTTCTTAATT 


TTCCAAAATA 


GTGATAAATT 


TGTGAAATAC 


ATCACAAATC 


CCTTTATTTA 


1260 


TTTGGAAATT 


CATGTAATAT 


TAGACTTGTA 


AGAAGTTAAT 


AAATAGAGAG 


AGACGAGAGA 


1320 


GTTTATATAA 


ATACTATATA 


AACATTGGAG 


TGATGATTAT 


GAGAAAAGAG 


ATTGAAGCGC 


1380 


TTATTTTCTC 


AGACGTAAtA 


GCTATGATAT 


TTACGTGAAC 


ACTGGTGTAA 


ATCAAGGATT 


1440 


AATTGGTGAC 


ATCAAAGATG 


GTTACCTAAC 


TATTGATTCT 


ATGCCTTACA 


TTGATGCTGA 


1500 


GCGTTTGTAT 


CACTTTGCTA 


TGGAACGTAA 


ATCGTTAGTC 


ACTAACTAGT 


TCTTATTGCC 


1560 


AATGATTACT 


ACCCCTAGTC 


GGCGGCAATT 


GAAGTGTGAT 


TGATGTAACT 


TGCCCTCGTT 


1620 


GGTGAGCAAT 


TGAGGGCAGA 


CCCCTTTAAT 


TAAGTAAACC 


CTAACTCCCC 


ACAAATCTGG 


1680 


AACGATACTA 


AAAGCCACGT 


CCTATATTGG 


ATGTGGCTTT 


AGTCAkACTT 


ATATTATTTT 


1740 


tAAAACGATT 


ACCTACAAGA 


TTTACATATA 


AAATTCTATC 


ATGnCTGC 




1788 



(2) INFORMATION FOR SEQ ID NO: 391: 
Hi) SEQUENCE CHARACTERISTICS: 

55 (A) LENGTH: 24 07 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 91: 
GCAAGTTTAG TTAAACAGAT ATTAAAACCA GATCAATATA CAGATGCAAA TTCTCTATTA 60 
GAGATTAGTT TGCAAACAGG TATGTTTATT GCAGGTGGAT TATCAGGAAT ATTATATAAA 120 

45 

ATAAATGGAT TCACTCTAAT TATAGCGATG ACTATAATGA TGTTTCTAAT CAGCATTTTT 180 
ATGTTATTTA GATTGCATGT AGATAAACCA ACTCATTCAG AGGAAGAATC AACAAATAGC 24 0 

50 TTATTGCAAG AATATTTATT AGGATGGAAA TTTTTAAAAG ACAACATGAT GATATTTATT 300 

TTTGGAGTTA TTTCAATCAT ACCAATGGTG TTTACAATGA TCTTTAACAT ATCATTACCA 360 

£5 
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TATGGCATTG GAGGATTATG TGCAGGTTTA ATTTCAOCTA TTCTTTCGAA GAAAATCTCA 480 

ACTAAAGTAT TQATATTTTT GTTATATTTC ATATTAGTCA TAAATTCAGC ACTATTTATT 540 

TGGATAAACT CAGCATTTTA CTTATTCATA GGATCATTTA TACTAGOCTA CTCAATTTCA 600 

TCAATCAGAA TTTATATGAA TACAGCTATA ATGAACACTG TTTCAGATAA ATATGTOGGT 660 

CGCTCATTTA CGATATGGAC GTCAATTTCA TTGTTACTGC AAAGTTTAAT TGCTCCATnT 720 

TTAGGAAGAT GGATTAATGA AATTAATGAT AAATTOGGTT TCTATATTAT ACTCATTTTA 780 

TCCTTACTCA TATTTGTaCA CTGCTGCTTG TTAACAAAAC AGACAAAATA AAATATGCAC 840 

ATAAAGAAGA GTGACCGTCA CTCTTCTTTA ACAAGCGACC ATTTATCGaT GGGCTTAGTT 900 

CTCTCTGCAC CCACACTTCA CTACTTCACT TTTTCAAATC ATTTTTTATG GTCTTAAATA 960 

AATCAGTGAG ATTTGTTGCT TCGGTAAAGT CTAGAATTAA TATCATTTCT TTAGAACCTG 1020 

GATATGGCGA AACTAATGTA TTATCTTGCA ATTTCTGCTG GGCACTTTTA GTCGCCT TG A 1080 

CCAATAATCT ATTATCATAC AAACCACCTA TAACCACGCC ATCATAATAA ATAATATATT 1140 

CTCCCATCAT CTTTCTTGTC TTAACCGCGT TTGAATTCAC ATGATTTAAA AATAAATCAT 1200 

2S GTACATCTTT CTTAGTCGCC ATTGTAATCG CTCCTTCAGT TTTATGTTTA ATCACATTCG 1260 

TATTAATGAT TCATTTCGTG TTGCTCTTAA TTTTATCTAT AATTATATGC GTAGTTAAAA 1320 

TCAAACCTAT GGAAAAGAAA ATAATGATAA CAATGTTAAG AAATATAGTT ATAAAATTAT 1380 

50 AGTTTGGAAG GTATGCGAAT AGCAGAAAAA TAGGTATCGC AAAAAATAAA TCCCACCAAC 1440 

CTAAACTTTT TAAAGAATGC TTTAAACCTT CCATAATATC ACCTTTATAA ATTTGTCTTT 1500 

GTTATAAGAT AACTAAAAAA TCGCTTTACT GTAAAAGTAG CCAAAGAAAA TTCTGAATCA 1560 

TATTCATAAG TAGTGTATCA TTAATAATGA ACAATTTAAT ACTATAATCC TTGATCTTTG 1620 

TATTGATCAA CTTACCACAA CATTTATTTT AGACTACTCT TAGACTTCCC TTTCAAATGG 1680 

TTGCATCTAT TGAAATTCCT TTTGTATAAG TTAGGCTTTT GTGGTAATAT CATCATGCAT 1740 

AAAAAATCGA GATACTAATT ATAAAGAGGG TATAAATATA TTATGAAAGA AAATTTTTGG 1800 

AGTGAATTAC CACGTCCATT TTTTATTTTG GCGCCAATGG AAGACGTTAC AGATATCGTC 1860 

45 TTTCGACACG TTGTAAGTGA AGCAGCTAGA CCGGATGTGT TTTTCACTGA ATTTACAAAT 1920 

ACTGAAAGCT TTTGCCACCC TGAAGGCATA CATAGTGTGC GCGGACGCTT AACTTTTAGT 1980 

GAAGATGAAC AGCCGATGGT CGCTCATATA TGGGGAGATA AGCCAGAACA GTTCCGTGAA 2040 

60 ACGAGTATTC AATTAGCTAA AATGGGCTTT AAAGGCATAG ACTTAAATAT GGGATGTCCT 2100 

GTAGCAAATG TTGCTAAAAA GGGTAAGGGT TCCGGCTTAA TCTTAAGACC TGACGTTGCT 2160 



65 



35 
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GGCTACTATG AAATCGATGA ATGGAAAGAT TGGTTGAAGC ACGTCTTCOA ACAAGACATT 2280 

GCCAATTTAT CTATTCATCT TCGTACACGT AAAGAAATGA GTAAAGTAGA TGCACATTGG 234 0 

GAATTAATCG AAGCTATTAA AAATTTACGT GACGAAATTG CACCAAATAC ATTOTTAACA 2400 

ATTAACG 2407 



(2) INFORMATION FOR SEQ ID NO: 392: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

16 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 3 92: 



20 


ATGATGnATG 


GCnCCGCCAA 


GAAGTTGAAC 


CAGTCTATTG 


GCTAGAATTG 


CTCAAAAAGC 


60 




GAGAcaTGTG 


GTATTCaTAT 


GTTaGTAgCt 


ACGCAAAGAC 


CATCTGTCAA 


TGTAATTACA 


120 




GGTTTAATTA 


AAGCCAACAT 


ACCAACAAGA 


ATTGCATTTA 


TGGTATCATC 


aAGTGTAGAT 


180 


25 














24 0 




TATCTTGGTA 


GCGGTATGAA 


TAAACCGATT 


AGAGTTCAAG 


GTACATTTGT 


TTCTGATGAC 


300 




GAAATTGATG 


ATGTTGTTGA 


TTTTATCAAA 


CAACAAAGAG 


AACCGGACTA 


TCTATTTGAA 


360 


30 


GAAAAAGAAT 


TGTTGAAAAA 


AACACAAACA 


CAATCACAAG 


ATGaATTATT 


TGATGaTGTT 


420 




TGTG CATTTA 


TGGTTAATGA 


AGGACATATT 


TCAACATCAT 


TAATCCAAAG 


ACATTTCCAA 


480 




ATTGGGTATA 


ATAGAGCAGC 


AAGAATTATC 


GATCAATTAG 


AGCAACTCGG 


TTATGTTTCG 


540 


35 


AGTGCTAATG 


GTTCAAAACC 


AAGGGATGTT 


TATGTTACGG 


AAGCAGATTT 


AAATAAAGAA 


600 




TAATTATGAG 


TAAGGAGTTT 


TATATAATGA 


CACACTATCA 


TTTTGTCGGA 


ATTAAAGGTT 


660 


40 


CTGGCATGAG 


TTCATTAGCA 


CAAATCATGC 


ATGATTTAGG 


ACATGAAGTT 


CAAGGATCGG 


720 


ATATTGAGAA 


CTACGTATTT 


ACAGAAGTTG 


CTCTTAGAAA 


TAAGGGGATA 


AAAATATTAC 


780 




CATTTGATGC 


TAATAACATA 


AAAGAAGATA 


TGGTAGTtAT 


ACAAGGTAAT 


GCATTCGCGA 


840 


45 


GTAGCCaTGA 


AGAAAtAGTA 


CGTGCACATC 


AATTGaAATT 


AGATGTTGTA 


AGTTATAATG 


900 




ArriTTTAGG 


ACAGATTATT 


GATCAATATA 


CTTCAGTAGC 


TGTAACTGGT 


GCACATGGTA 


960 




AAACTTCTAC 


AACAGGTTTA 


TTATCACATG 


TTATGAATGG 


TGATAAAAAG 


ACTTCATTTT 


1020 


60 


TAATTGGTGA 


TGGCACAGGT 


ATGGGATTGC 


CTGAAAGTGA 


TTATTTCGCT 


TTTGAGGCAT 


1080 




GTGAATATAG 


ACGT CACTTT 


TTAAGTTATA 


AACCTGATTA 


CGCAATTATG 


ACAAATATTG 


1140 
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s 



10 



1S 



20 



3S 



TGGCACATAA 


TGTTAAAAAA 


GGTATTATTG 


CTTGGGGTGA 


TGATGAACAT 


CTACGTAAAA 


1260 


TTGAAGCAGA 


TGTTCCAATT 


TATTATTATG 


GATTTAAAGA 


TTCGGATGAC 


ATTTATGCTC 


1320 


AAAATATTCA 


AATTACGGAT 


AAAGGTACTG 


CTTTTGATGT 


GTATGTGGAT 


GGTGAGTTTT 


1380 


ATGATCACTT 


CCTGTCTCCA 


CAATATGGTG 


ACCATACAGT 


TTTAAATGCA 


TTAGCTGTAA 


1440 


TTGCGATTAG 


TTATTTAGAG 


AAGCTAGATG 


TTACAAATAT 


TAAAGAAGCA 


TTAGAAACGT 


1500 


TTGGTGGTGT 


TAAACGTCGT 


TTCAATGAAA 


CTACAATTGC 


AAATCAAGTT 


ATTGTAGATG 


1560 


ATTATGCACA 


CCATCCAAGA 


GAAATTAGTG 


CTACAATTGA 


AACAGCACGA 


AAGAAATATC 


1620 


CACATAAAGA 


AGTTGTTGCA 


GTATTTCAAC 


CACACACTTT 


CTCTAGAACA 


CAGGCATTTT 


1680 


TAAATGAATT 


TGCAOAAAGT 


TTAAGTAAAG 


CAGATCGTGT 


ATTCTTATGT 


GAAATTTTTG 


1740 


GATCAATTAG 


AGAAAATACT 


GGCGCATTAA 


CGATACAAGA 


TTTAATTGAT 


AAAATTGAAG 


1800 


GTGCATCGTT 


AATTAATGAA 


GATTCTATTA 


ATGTATTAGA 


ACAATTTGAT 


AATGCTGTTA 


1860 


TTTTATTTAT 


GGGTGCAGGT 


GATATTCAAA 


AATTACAAAA 


TGCATATTTA 


GATAAATTAG 


1920 


GCATGAAAAA 


TGCGTTTTAA 


TATGTTTATA 


ATAGAGTAGT 


ATGGGTATTT 


ATTATTAATG 


1980 


ACATTATTAC 


ATGTTAATTA 


GGAGG CGTTT 


TTAATGGATT 


GGATTTTACC 


AATTGCTGGA 


2040 


ATTATCGCTG 


CGATTGCATT 


CTTAATTTTA 


TGTATCGGTA 


TCGTAGCTGT 


ATTAAATTCT 


2100 


GTTAAGaAAA 


ACTTAGATTA 


TGTTGCAAAA 


ACACTTGACG 


GTGTAGAAGG 


TCAAGTTCAA 


2160 


GGTATTACTC 


GTGAAACAAC 


AGATTTACTT 


CATAAAGTAA 


ACCGTTTAAC 


TGAGGATATC 


2220 


CAAGGTAAAG 


TAGATCGTTT 


AAACTCAGTT 


GTAGATGCTG 


TTAAAGGTAT 


CGGTGACTCA 


2280 


GTACAAACGT 


TAAACAGCTC 


TGTAGATCGT 


GTAACAAATT 


CAATTACACA 


TAATATTTCT 


2340 


CAAAATGAAG 


ATAAAATCTC 


ACAAGTTGTT 


CAATGGTCAA 


ATGTTGCAAT 


GGAAATTGCA 


2400 


GACAAATGGC 


AAAATAGACA 


CTAC 








2424 



(2) INFORMATION FOR SEQ ID NO: 393: 

40 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 738 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 
SO CnATAATATT CnTCAAnCCT TTAAATAGAG GAATAGACTG CCGACAGAGT CCGAGACTTT 60 

GTGGGTAGTT TTTTTAGTTT TGATAACGGA AGTTAGAGGC TCTCTGTCAA ATTGGGCAGA 120 

55 
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AGTCTAAAGT 


TTTCAACATA ATACTATTAG 


TTCGGTCATG 


TATCGGACTG 


ATGGAAAAGC 


240 




GTTTCACTTT 


TAATGACTCA 


TTAAGAACGG 


CCTGAAAATG 


TTTGGCGTAT 


TAAGTGCAAT 


300 


5 


GATAGTTTTG 


ACATTTAGTT 


TCTAATTGGT 


CATTACTGCC 


GAGCAAATCT 


AGTAGAGTAA 


360 




TCATGTAAAT 


CTTTAATGTG 


CCATTTGATT 


CACTAGCGGT 


GTTAATAACT 


ACGGAAATTG 


420 


10 


CATTTCCGAC 


TGAAATTTTT 


GAAAAATATC 


AACGTACGCT 


ACAAATAAAA 


TTTTAAACTG 


480 


TTATAAATGT 


GTCTCAATTT 


CATATGTTCA 


TCGAOGATAT 


GAAGCGTATT 


ATGGTAAAAT 


540 




GAAGAAATAA 


TAAACTTGTT 


AATAAATAAA 


ACATCACGAT 


TTGACTAAAG 


CACTTTATTA 


600 


15 


TTGTGTAGAT 


AATAGTTTTT 


TAACGAAATA 


AAAATGGCGA 


CTGGTTTTAA 


TAAATCAGCT 


660 




AATGAATCAC 


TACACCTATA 


AGTATGAATA 


TAGTGATTAG 


AATGCTTTGT 


ATAGTTGGAT 


720 




TTTGCAAAAT 


TGATGTTA 










736 


20 


<2) INFORMATION FOR SEQ ID NO: 394: 








25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1270 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394: 






30 


AAAAGTTGTA 


ATTAAAAGTG 


GGATTTTACT 


TAAGnnAGAA 


GGAAACTATT 


TATATGACTA 


60 




ATAAAAGAGn 


AGATGTCCGC 


AATATAGCAA 


TTATTGCTCA 


CGTTGACCAT 


GGTAAAACAA 


120 


35 


CTTTAGTAGA 


TGAGTTGTTA 


AAACAATCTG 


GTATATTCAG 


AGAAAATGAA 


CATGTCGATG 


1B0 


AACGTGCAAT 


GGACTCTAAC 


GATATCGAAA 


GAGAGCGTGG 


AATTACGATT 


CTAGCCAAAA 


240 




ATACGGCTGT 


TGATTATAAA 


GGTACACGTA 


TTAATATTTT 


GGATACACCA 


GGACATGCAG 


300 


40 


ACTTTGGTGG 


AGAAGTAGAA 


CGTATTATGA 


AAATGGTTGA 


TGGGGTTGTC 


TTAGTAGTAG 


360 




ATGCGTATGA 


AGGTACAATG 


CCTCAAACAC 


GTTTTGTACT 


TAAAAAAGCG 


CTAGAACAAA 


420 




ACCTGAAACC 


TGTTGTTGTT 


GTTAATAAAA 


TTGATAAACC 


ATCAGCACGT 


CCAGAGGGTG 


480 


45 


TTGTAGATGA 


AGTTTTAGAT 


TTATTTATTG 


AATTAGAAGC 


AAACGnTGAA 


CAATTAGAAT 


540 




TCCCTGTTGT 


TTATGCTTCA 


GCAGTAAATG 


GTACAGCTAG 


CTTAGATCCT 


GAAAAGCAAG 


600 




ATGATAATTT 


ACAATCATTA 


TATGAAACAA 


TTATTGATTC 


ATGTACCAGC 


TCCAATTGAT 


660 


50 


AACAGTGATG 


AGCCCATTAC 


AATTTCCAAG 


TAGCATTGTT 


GGACTACAAT 


GATTATGTTG 


720 


55 


GACGTATTGG 


TATTGGTCGT 


GTATTCAGAG 


GTAAAATGCG 
1289 


TGTCGGAGAT 


AATGTATCAC 
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GATTAAAACG TTTAGAAATT GAAGAAGCAC AAGCTGGAGA TTTAATTGCT GTTTCAGGTA 900 

TGGAAGACAT TAATGTTGGT GAAaCTGTAA CACCACATGA CCATCAAGAA GCATTGCCAG 960 

5 TTCTACGTAT TGATGAGCCT ACTCTTGAAA TGACATTTAA AGTTAACAAT TCTCCATTTG 1020 

CTGGCCGTGA AGGTGACTTT GTAACAGCAC GTCAAATTCA AGAACGTTTA AATCAACAAT 1080 

TAGAAACAGA TGTATCTTTG AAAGTTTCTA ACACAGATTC TCCAGATACA TGGGTAGTTG 1140 

10 

CTGGTCGCGG TGAATTGCAT TTATCAATCC TTATTGAAAA TATGCGTCGT GAAGGTTATG 1200 

AATTACAAGT TTCAAAACCA CAAGTAATTA TTAAAGAAAT AGATGGTGTA ATGTGTGAAC 1260 

CATTTGAACG 1270 

1$ 

(2) INFORMATION FOR SEQ ID NO: 395: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1365 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDBDNBSS : double 

(D) TOPOLOGY: linear 

25 {xi> SEQUENCE DESCRIPTION: SEQ ID NO: 395: 



30 



35 



40 



45 



AAGACCAGGA 


GAACAmGTAA 


AACAATATAC 


AGTTGaAATC 


GCTCGTAAAT 


TAATGGAATT 


60 


TGATATAAAA 


TGCTCGTGAT 


TGCTTGTAAA 


TACGCAACTG 


CTGTnGCTTT 


AGAATATTTA 


120 


CAAAAGACCT 


TATCAATCCC 


CAGTGATTGG 


CGTAATTGAc 


CAGGTGCTAG 


AaCAGCAATA 


180 


ATGACTACTA 


GAAATCAAAA 


TGTATTAGTA 


CTAGGAACGG 


AAGGCACAAT 


TAAATCTGAA 


240 


GCATATCGTA 


CGCATATTAA 


ACGTATCAAT 


CCACATGTAG 


AGGTACATGg 


CGTTGCCTGT 


300 


CCAGGTTTTG 


TGCCACTTGT 


AGAACAAATG 


AGATATAGTG- 


ATCCAACAAT 


TACAAGCATT 


360 


GTCATTCATC 


AAACACTGAA 


ACGTTGGCGT 


AATAGTGAGT 


CTGATACTGT 


CATTTTAGGA 


420 


TGTACCCACT 


ATCCATTGCT 


CTATAAACCT 


ATCTATGATT 


ATTTTGGTGG 


TAAAAAGACA 


480 


GTGATTTCGT 


CTGGATTAGA 


AACGGCTCGT 


GAAGTTAGTG 


CATTGCTAAC 


ATTTAGTAAT 


540 


GAACATGCAA 


GTTATACTGA 


ACATCCAGAT 


CATCGATTTT 


TTGCAACAGG 


TGATCCTACT 


600 


CACATTACTA 


ACATTATCAA 


AGAGTGGTTA 


AATTTATCTG 


TCAATGTGGA ACGTATATCA 


660 


GTGAATGACT 


AGGAGGATTT 


TTAATGAAAG 


AGATTGTTAT 


TGCATCGAAT 


AATCAAGGGA 


720 


AAATAAATGA 


CTTTAAAGTA 


ATATTTCCAG 


ATTACCACGT 


AATAGGTATT 


TCAGAACTAA 


780 


TACCAGATTT 


TGATGTGGAA 


GAAACAGGAT 


CAACATTTGA 


AGAAAATGCT 


ATATTAAAAT 


840 


CAGAAGCTGC 


TGCAAAAGCA 


TTGAATAAAA 


CGGTCATAGC 


TGATGACAGT 


GGACTAGAAG 


900 
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GCGATGAAGC AAATATTGAA AAATTATTAA ATAAGCTTGG TAATACAACT GATCGTCGTG 1020 

CGCAATTtGT TTGTGTCATA AGTATGAGTG GCCCTGATAT GGAAACAAAA GTATTTAAAG 1080 

GTACTGTTTC AGGTGAAATT GCAGATGGAA AATATGGCGA AAATGGTTTC GOATATGATC 1140 

CGATATTTTA TGTACCGAAA TTAGATAAAA CCATGGCTCA ACTTTCAAAA GAACAAAAAG 1200 

GGCAAATTAG TCATAGACGA AATGCGATTA ATTTACTTCA AGCTTTTCTT GAAGGTGATA 1260 

AAAATGTCTA AATGGATTAT TGTGAGTGAT AACCATACTG AATCAGGCGT TTTATATCAA 1320 

ATTTATGAAA TGCACCCAGA TGCAGATGTA TATTTACATT TAGGA 1365 



(2) INFORMATION FOR SEQ ID NO: 396: 

16 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1363 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 

2S AATTCCTGGT GCAATAATAA ATAGGATGAA AAAGATTCGG AAAATATGAT AACTCGTAAT 60 

CAT AG CAACA TCGG CACCAG TAGCTAATGC AACTAAAACT ATCTGATTAA CCCCTCCTGG 120 
TGCTG CACCA AGaAACAATT CATTAATAGG ATTATTATCA AAGAAATGTA TGATAT AAA C 180 

30 CATGATTAGC GCACCAATTA TCAACATAAT ATTTTGAATT GTAATTGCGA TTGCTAGTCT 24 0 

ACCTTTTAAA TCTGACAATA AATGCGCAAT TTGAACTCCA ATTCTAATCA TATATATTAG 300 
TTGTGCCATG TTCAACAACC AATGATCTAG TGTAAATGTT AAACCTGTAG AAAAATTCCA 360 

35 AACAATTAAT ACAATGAGTG GTGCTAATAA TTGAAATGTT GGAAACTTTA TTTTAGACAT 420 

AATTAGATAA ACTATAAAGA TAG CTATCG C TAAAATAACT ATTTGCCCTA TGTTTAATAC 480 
TTGTGATAAA GGCAAGACTT TTGTTAACTT TCCATTCGCA TGCATGTTAC CATCATGAAA 540 

40 

AAAATATGAA ATGAACGGTA CTAAAACAAC AACAAATATA ATTCGTGATG TTTGCGTTAA 600 
GCTAACAACT AACAAATTAG CACGTTTGTC TTGTTCAGCC ATGACCAGCA TTTGTGTTAG 660 
TGCTCCTGGT ATAACACTTA AAATAGCTGT TTCTGTATTA ATACGTGCAA TTTTTTTAAA 720 

45 

AACAAATGCC ATTACTATTG CAATTAATAA TATCGAAATA GATACAACAA TAATCGAAAG 760 
CCAATTGTTT TTAATATCCA TAACGACATT TTTCGTAAAC GTTGATCCGA TTTGCACACC 64 0 

TAATAGTACA ATACCTAATT CACTAAGTAA GAATGGCCAT TTAATATCAA GTTTGAAAAC 900 

SO 

TTTTACACAA ATGATTGATG CGATAATAGG ACCAAACATA AATGGAAGTA ATACGTGCGA 960 

55 
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TATCATTGCC ATGTTTTCCA CTTCTTTCAA TAAAAAATAA AATGACTAAA TTGCTGCTTG 1060 
AGCTTCACGT TTGTTAAGAT AACAATATCC GCTAGCAGTT tTGACTACAA AGCATATATG 1140 
5 GaCTTTCACT ATCAAGTCGC CGCCCATGCC TTATATACAT TTAAAAnGAG CCTGAACAAA 1200 

GTTCAGGCTC TCAATTTGTC OGTATATTTA TTTTACAATA CGACTTAAAG CCGTATCAAA 1260 
TGCTTGAATC GTTTTTCAAT ATCTTCTTTC GTGTGTGCCG TAGATAAGAA TGTACCTTCA 1320 

10 

AATTGAGATG GnGGnAAAAA CACACCTCTT TGnCATTCTC GGTACATTTC TGCAATAATT 13 BO 

TCC 1383 
(2) INFORMATION FOR SEQ ID NO: 397: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 

25 TCCACTAAnA TGATTACAAT TGCATTAGTT TGGTGGAGTG CATTTACAAT CTTAACGGGT 60 

ATGATTAAGA ACCAOGGTTT AATTaTTTAG TGAGATTCTT ATTTGGTGTA GGTGAGGCGC 120 

CAATGTACCC TTCTAATGCT GTGTTTAATT CATTTTGGTT CTCTAAAAAT GAAAAAGGTA 180 

30 GAGCATCAAG TGCATTATTA GCAGGATCAT ATTTCGGACC TGTATTAGCA CCAATAGTTA 240 

CAATTGCTAT TGTTAACGCA TTTAACTGGC AAGCAGTATT TTACATTTTT GGTGCAGTAG 300 

GTATTTTAAT GGCkGTATtA TGGGCGATTA TTGCCAAAGA CTTACCTGaG CrACATAGwa 360 

35 TGGTTAATGA AGCGGAGAAA CGTTTCATTA TGGAAAATCG TGATATCGTA GCTAC 415 



(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1141 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
. (D) TOPOLOGY: linear 



45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 
TTTTAGaTaA aTyCAATTnT CyATaCTaAA TgATTnTCTT ATTaCGTCAA TTcGCCTTTT 60 
aTTTTATCGT AATCTTTCCa CTGCAAAGCT AAAGCTTCTC CTATTCTAAG ACCAGAATAA 120 

60 

AATAACAGTC TAGTTAGCTG ACGAGAAGTA TCATTTGTGA TTTGTTCTAC TTTTTCATCA 180 
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5 



10 



IS 



20 



2$ 



AATQTGGGGT 


CGTATAAGAG 


CTTGTAATGC 


TTTTTGGCGT AATTGATAAC 


TGCTTTAAAA 


300 


CCTGCCCACA 


CAGATCGTGC 


ATAGTCAACA 


GAAAGACCTG CATCGTTTAA fcAAATAATTC 


360 


CTOAAAGCAG 


TACATTGCGT 


AGTAGTGATT 


TTGCCAATAG 


GGATATTTCC 


GAACCTTTCT 


420 


TTTATGTGAG 


TATTATATTC 


TGTAGTTCGC 


TTTTCTATTG 


AGCGTGCAGA 


AAGATTTTCA 


460 




GATCAAAAAA 


TATATATTCA 


AAGGGTTGAT 


TGTCCGAGTA 


TCCATATTTA 


54 0 


agattttgta 


TAAATTCGCT 


TTCAGCTAGT 


TTGGCATCTT 


TCTTACGTTC 


AAACCCACGC 


600 


TTCATTTTTC 


GTTTGTTATT 


ACCGTATACA 


TCTTTATATC 


TAATGGAAAA 


ATACCATTTA 


660 


CCTGTATTAT 


CATCCTTATA 


TACTGGCATT 


TTGCTTCTCC 


CTCCTCAAAA 


TTGGCAAAAA 


720 


ATAATAAGGG 


TAGGCGGGCT 


ACCCGAAATT 


TAGTACTAGG 


TACTAAATGT 


GATATAATAA 


780 


AATAAAAAGT 


AGGTGATGTT 


ATGACATTTA 


AAAACAATCA 


TAATTTCAAT 


GAATTAGTTT 


84 0 


TAACGAATGA 


AGACATTAGA 


ATTTTAAAAA 


ATGTCTTAGA 


AGATGCAGTC 


AGTGTTTATG 


900 


ATGAATATTC 


GGTATGTAAT 


GAAGAATCCG 


ATTTTGCTTA 


CTGTTTATTA 


AGAOACTTAT 


960 


ATACATTAGA 


CAGCTTAGCT 


ATTTCGTCAA 


ATAATGTTTG 


AATTATCGAA 


TTGTACTCTT 


1020 


CGATTTTAAT 


ACCATGCATA 


ATAGAGTTTC 


TGTGTTCAAT 


AGCAGCTTTG 


ACTGAATGTk 


10S0 


TTAAATGTTC 


TTCTATTAAA 


TCGTTGTTTk 


CCAtTTCGtk 


TAaAAATGTt 


CyTATATTCC 


1140 



T 1141 

30 (2) 'INFORMATION FOR SEQ ID NO: 399: 

(i) SEQUENCE CHARACTERISTICS: 

£A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
35 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 

40 

ACTGTGGTAG GTTTTTTATT TTGAAGTATT AATCATAACA GACTAATAAT CATGAGGTAA 60 
CTAATAACAC ATATTTAACT TGTATTCTTA AACTGGTATA ATAAATTTAT GTTGAAATGA 120 
ATATTGTATG ACAGGGTATT CACTTTTATT AAAAGGTAAA ATTAAATAAA GGTTTTATAG 180 

45 

AACGTATTTA AATATATGAG GAGTAAACAA ATGGCTGATA GAACGAATAA AGAAATTAAA 240 
ACAGGACGCT TTATTGCAAC TGCATCAATC GTATTCTCAA TATTATTGAT TATTCATTAC 300 
TTTGTTTCGT TGGATAATGC GACTGCCAAA GCATTACTTA ATTTAACGAA TCAAAACACT 360 

60 

TCAGATAAAG CGATTGATTA CATTTTAAAC AGCTTTAGAT TCACTGGTAT TATGTATATT 420 
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ATGTTTGCAG TTTATGTATC AAATAGTTTG TTTACGTTGA TTAATTTATC AATCACAATT 540 

CAAGCAATAA AAGCTGCACA CGGTGCGTAC TTAACATTGC CAATTTTAAT TGTTATTATA 600 

GGTTCGGTTG CATTAGCGAT TTATATGCTT GTTGTTTCTA TCAAACGTAA AAGTACATTT 660 

AATCGCTAGA AAATTGATTT TAACAATAAA AATATGAAAA AAAAnn . 706 



<2) INFORMATION FOR SEQ ID NO: 400: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1187 base pairs 

(B) TYPE : nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 



30 



35 



SO 



ACACAATCTG 


AAGATTCACG 


TTGTGGTGCT 


GGACATGATC 


GAAAAATTAG 


AGCTGAACAA 


60 


ATGAnAOAAA 


TCAGTGATTT 


TGTTAAAAAG 


AAAAATATCC 


CTAAAGATGA 


AACGGTATAT 


120 


ATAGGTGGCG 


ACCTTAATGT 


CAATAAAGGC 


ACTCCAGAGT 


TCAAAGATAT 


GCTTAArAAC 


180 


TTGAATGTAA 


ATGATGTTCT 


ATATGCAGGT 


CATAATAGCA 


CATGGGACCC 


TCAATCAAAT 


240 


TCAATTGCGA 


AATATAATTA 


CCCTAATGGT 


AAACCAGAAC 


ATTTAGACTA 


TATATTTACA 


300 


GATAAAGATC 


ATAAACAACC 


AAAACAATTA 


GTCAATGAAG 


TTGTGACTGA AAAACCTAAG 


360 


CCATGGGATG 


TATATGCGTT 


CCCATATTAy 


aCGTTTACAA 


TGATTTTTCA 


GATCATTACC 


420 


CAATCAAAGC 


CTATAGTAAA 


TAGTGCTCAA 


CTAACTAATA 


ACTTGCTtCG 


TTCTAAAAGG 


480 


ACGAAGCGAG 


TTATATTGTT 


AAAATTTGAA 


TTGACTTACA 


TTTTAATAAA 


ATCATCTTAA 


54 0 


CAACTTTAAT 


TTTTCaTTAA 


TACAaGTCTT 


TACTCTACAC 


TCAAACnAGA 


TTCATACACT 


600 


GCACGTCATA 


ATAAATCTAT 


CTATTCAAAT 


ATAAATAAAA 


GTTACCTACT 


ACATTCTATG 


660 


TAG CAGGCAA 


CTTTTATTAC 


TTATTTCTTT 


TCATTATCAT 


TAAGTACTTT 


TACAAACTTC 


720 


ACATTATGTG 


TCTTCCAATC 


AACTTCATAT 


AATGCTGATA 


ATTTTTCTTC 


TTTTTTATCT 


780 


ACATGGTTTT 


CACCAGACCA 


ATAGCCCCAG 


AAACCATGGC 


GATTCCAATC 


TATTTTAAAC 


840 


TCATCCATTG 


ATCTTTTATA 


ATGAACAACA 


AATTGTGATT 


TACCTTTGTC 


TTTTTTATCA 


900 


TGTGACATAA 


CAGCTAAAAA 


TTCTGGATTA 


AACCCTTCAG 


ACACAGTTAC 


AGGCATTTTG 


960 


TCTTTAGGTG 


TGAAATTATC 


TTTCGCCCAT 


AAATTTCCAT 


TTCGTGTTAA 


AGAAAAAATT 


1020 


TCACTTTTAG 


TTCTATTATC 


ACTATCATTA 


GTTAATTGTC 


TCGTATGGTC 


ATGTCCCATA 


1080 


TTATTTATCA 


AATGTGCTTC 


TACTTTCCAA 


CCTACACCTT 


TATGTGACGT 


AGATTGATCA 


1140 



55 



1294 



EP0 786 519 A2 



(2) INFORMATION FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 847 base pairs 

(B) type : nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 

CCAGAATTAT TTTTTCAAAA AGGACAATTT AACAATGTCG ATAACGTTAT CATAAGCAAA 60 

CCGATGAAAG GGACAATGCC TAGAGGTAAA ACGGAaGCTG AAGATCAACA GTATTATAAA 120 

ACATTGCAAA CTTCTTCGAA AGATCGTGCA GAAAATGTCA TGATTGTTGA TTTACTAAGA 180 

AACGATATAG GGAGAATATC ACAGAGTGGC TCAATTAAGG TGTATAAACT ATTTTTTATT 240 

20 GAGGCATATA AAACTGTATT TCAAATGACT TCGATGGTAA GTGGAACTTT AAAAAATAAT 300 

ACAGACTTAA CTCAAATTTT AACATCGTTA TTTCCTTGTG GTTCGATTAC AGGTGCACCG 360 

AAACTGAATA CAATGAAATA TATTAAACAA TTAGAAAGTT CACCTCGTGG TATATACTGC 420 

GGACAATTGG ACTATTACTT CCAACTGAAG ATGATAAAAT GATTTTTAAT ATTCCGATTC 4 80 

GCACTATTGA GTATAAATAT GGACAAGCGA TTTATGGAGT CGGAGCAGGT ATTACAATTG 54 0 

ATTCTAAGCC AAAAGATGAA GTGAATGAAT TTTACGCAAA AACCAAGATT TTGGAGATGT 600 

TATAATGCAA TTATTTGAAA CAATGAAAAT TGATAATGGA CATATCCCTA GACTTACTTA 660 

TCATACTAAT CGCATAAAAT GTTCTTCTGa GCGATTAAAC TTTAAATTTG ATGAACATGC 72 0 

ATGGCGAAAT GAATTAAACG ATGTAACAAC AAAGTATCAC AGTGGTCAAT ATAGACTTAA 780 

AATCGTATTA AATG CTG AAA GCAAATTTGA AACGATAGTG TCACCTTTAC CTGAGAAAAG 84 0 

TAGTTTT 847 
40 (2) INFORMATION FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 740 base pairs 

(B) TYPE: nucleic acid 
4S (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 

60 

TGAAGATGAA GCAGAAGCAG AAGACAATCT GCTACGAGTA CAATCGAAGA AAGAAGAAGT 60 
GTATCGTCGA TTACTTGCTT CTAACTTAAC AAGCGTTCCT GAAAG t TTAT CATCATGAAA 12 0 

55 



1295 



EP0 786 519 A2 

GTTAAACAGT TAAAAGATAA AGTGTCTAAA ATTGTGATTC AAATGAATAC ATTTGAAGAT 240 

GAAGCAAATG ATGTTCTTGT TAATGCTGTT TATGCAGAGA AATTAATTCA ATATGGAAAT 300 

5 AGATATCGTA AGGACTATAG CAATGTTGAT AAGAGCTTAA ATGAAGCTGA ACGATTATTT 360 

AAAAATAATC GCTATAAGCg TGCGATTGAA ATTGCAGAGC AAGCTCTTGA AAGTGTTGAG 420 

CCAGGTGTTA CTAAACATAT TGAAGAAGAA GTTATTAAGC AATAGAAACT AGTATGTAGT 480 

10 

TATACTTAAA TAATATGAGC ACTCTGTCAA ATTGGACTGA TGAGTTTAAT AATTGAAGTT 540 

AGCCAACGAT ACGTTGTCTA GCTTCTTTTT TATATGGATA AATGaAAGGG ACAATAAATA 600 

1S TAAAT AG CAA TTGTTTAAAG ATAAACOTAA TCAAATGTGT TGTTTTAATT AATATAAGTA 660 

GTGAAAAAAG CATAATCACA CAGCTGTTTA AATAGAGTGA AATAGTCTAA TTCTTATTTA 720 

ATAAGTAGAA ATAAGATTAT 740 

20 (2) INFORMATION FOR SEQ ID NO: 403: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 <D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 

30 

ATGCCAATAA TTAAACCATG TAATAAATAT 
TATAATTTTT TCT TTG TTGA CATTAAATTT 
CCATATAATA TAAGTCGTTT AAAAGGACTG 

35 

GTATATGGTG AACTTCCCAA TAACCAATCT 

AAAAACAAAA TAAAGGTAAT GATAGATACT 
40 GTATGTTTTT TGGTGAAAAT GTAACCTAGA 

GAAATGCTTA AGTAGCTATC GATGTTATCT 

GAAACTGATA GCACTTTATA TGGATTAAAT 
45 AAAAATAGCG TGATCAAAAA CCATAACGCA 

TCGTCACTTT TACCTGTTAA GAAATAATAA 

GGTACTATCA AACGTTTTGA AATTTTTTCT 

60 

(2) INFORMATION FOR SEQ ID NO: 404: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6254 base pairs 

55 



ACGTATAGCG TACGACTACC 
AGAAACGCAG TCATTGCGAT 
AATATACTCT GTCCTTCATT 
GCATTGATAG GATGAATCAC 
GGTATTAGTT TTTTATTTTT 
TAAAATATTG GGAAAAATAC 
GAAAAACCTG CTCCAATAGA 
CTTCTAACTA TTACTAAAAT 
AATACTGGGT TAAAAGGATC 
ATTGAAAAGA ATGCAAAAAA 



AATATAAGTA 60 

TAATATAATT 120 

TTCAAGTGAA 180 

GTAAACGATA 240 

AAAAATAGCC 300 

GATTGTCCTT 360 

TATAATAATT 420 

GACATGAAAG 480 

AAGTTGTAAT 540 

TATCATATAA 600 
630 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 

ATTTAAACGT TCAGTTTCTA AAAGTGGAAC AATCCAAGAA GTACGTAAAC OTGAATTTTA 60 

CGAAAAACCA AGCGTAAAAC GTAAAAAGAA ATCAGAAGCT GCACGTAAAC GTAAATTCAA 120 

ATAATTAATA CCTCTGTTGA CTCCCTCAAC ACGAATATTA ATTATATAAA ACAAACATCA 180 

CAAGTTAGTG TCTGACACTA ATATGTGATG TTTTTTTGTT GTCAATTTTT AATTAAAAAA 24 0 

,S AGTTATATAO TTTATAAATA ATCAAATTGA TATTCTATAG GTTCTTATAA CTATAAAGTA 300 

TATTCAATTT CATGTATAAT TAATGTGAGG GCGAGGTOAA ATTGTOAGTT ATAATAATTT 360 

TTTACAAATG ACAACTATCT TGGAATCAAC GG CTGGAGAT ACTTGGGTTG AACAAGTTAG 420 

CAATATAATT GTTCAACCTA TTTTTACGTT AATATTAACC TGTTTGACAT TCTTAGGATT 480 

TGTATATCAA CTTTACTCTA AAAAAATCAA TGCAGCTGGT ATTATCGCCA CATTATCATT 540 

ACTTATTTTA TTTTTGGGAT TTCTAATCCA AGGAAATGTC AATATGCATT CTATCTTAAT 600 

25 

ATTCTCAATT GGCGTTATAT TAGTTGTAAT TGAATTATTT GTAGTTGGTG CAGTAATTGG 660 

TATTATTGGC ATGATACTGA TAACTATAAG CATAACAACG CTCGGTGATA ATTTGCTATT 720 

30 TATGCTTGCG AATGTTATCG TTGCCTTGAT TTTAACGATT GTAGAATGGG TGATATTAGT 780 

GAAGATTTTC AACAGAAAGA TTCCGTTTTT GGATAAAGTT ATCTTAAAAG ATTCAACTAA 840 

TTCTGAGTCA GGTTACAATT CTCATGATAA CCGCTCGCAC CTCGTAGGAA AGACTGCTCA 900 

55 AACAGTTACA GATCTTCGAC CTGCAGGGAT TATTTTTTGT GAAAATGAAC GTATTGATGC 960 

TGTTTCAGAT GGCAACTTTA TTTTGCGCAA TAAAACGGTA AAAATCCTTG AAGTTGAAGG 1020 

AACAAGAGTA GTTGTGAGGG AAGTAGATTA ATTAAAAGGA GCGATACCAT GTTTAGTTTA 1080 

40 AGTTTTATCG TAATAGCAGT TATTATAGTA GTTGCATTAC TTATTTTATT CTCATTTGTA 1140 

CCCATTGGTT TATGGATTTC AGCGTTAgcA GCTGGCGTTC ATGTTGGTAT AGGTACATTG 1200 

GTTGGTATGC GTTTACGTCG TGTATCTCCA AGAAAAGTTA TAGCGCCATT AATTAAAGCG 1260 

CACAAAGCAG GACTAGCATT AACAACAAAC CAATTAGAAT CGCATTATCT AGCAGGAGGA 1320 

AATGTTGACA GAGTTGTTGA CGCTAATATT GCTGCACAAC GTgcTGACAT TGATCTTCCT 13 80 

TTCGAACGTG CTGCTGCAAT TGaCCTTGCA GGACGTGACG TATTAGAAGC GGTTCAAATG 144 0 

TCTGTTAATC CTAAAGTCAT TGAAACACCA TTTATCGCAG GTGTAGCAAT GaACGGTATT 1500 

GAAGTGAAAG CCAAAGCTCG TATCACAGTT AGAGCTAATA TTGCTCGACT TGTTGGTGGT 1560 
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10 



25 



AGTAAGCATC ATACAGAAGT ACTTGAAAAC CCAGATAATA TTTCTAAAAC AGTTTTAAGC 1680 

AAAGGTTTAG ATTCAGGTAC TGCATTTGAA ATTTTATCAA TTGATATTGC TGACGTTGAT 1740 

ATTAGTAAAA ATATTGGTGC AGACTTACaA ACTGAACAAG CATTAGCAGA CAAAAATATT 1800 

GCACAAGCAA AAGCTOAAGA ACGTAGAGCT ATGGCTGTAG CAACTGAGCA AGAAATGAAA i860 

GCGCGTGTAC AAGAAATGCA TGCTAAAGTA GTTGAAGCCG AATCTGAAGT ACCATTAGCT 1920 

ATGGCTGAAG CATTACGTTC AGGTAATATC AGTGTTAAAG ATTATTATAA TTTGAAAAAT 1980 

ATCGAAGCTG ATACAGGCAT GAGAAATGCA ATTAATAAAC GAACTGATCA AAGTGATGAT 2040 

1S GAGTCACCTG AACATTAAGT CGAGAGGTGA TTAAATGAGT GTCGGTATTC TAATTTTTGT 2100 

CATATCAGTG ATCATTTCTA TCATTACTAC TATCCGCGAA AATAGTCATA AAGATAGACA 2160 

AAATCAAAAG CCACCTCAAA AAACATCTAC CGATAATGAA CCAAAAAAAG GTGGCTTTTT 2220 

20 TGAAGAAATT GAGCGAACGT TTAAAGAAAT AAGTGAAGAA TTAAATGAAG AAGAAAAGAA 2280 

ATCATCGAAA COAAAATATG ATGATACGTT ACCACCTTTA TTCGATGAAC TTCCAAAGGA 2340 

AGAGCCTAAA TCGAAACCTG TTGTAGAACC TATGGCACCT AAAAAACAAC AAGAAACAAA - 2400 

ACCGATGACA GAGAAACCAA TCACAGTGCC TAAAGCAGAA CCGGTGGAGC AGAAACATAG 2460 

ACCTTCTAGA CAAGATAATT CTGACGAAAT TAGACGTCAA TTAGAAAAAT CACTTAGAGA 2520 

TGATATTAAA ACGATTCGTA CTGACATTGA TAGAGAAAAA GAAAAGCAAA TTGCTAAAAT 2580 

GGAAAAACGT GCTAGAGATA TTATTGAGGA TAAATACTTA TCTGAACGTA CAAAACGTTT 2640 

GAAATTAAAG CAGCTGCTTA ATTCTCAAAA TGTCGAAAAA GATTTGACTA AATCAGCGTT 2700 

CCAATTTGAT AAAGATGAAG TAATCAATGG TATGATATGG TCAGAAATTT TAG CTAAACC 2760 

AAAACAATTA TAAAATTTTT TGAAAACAAG CACTATCGTA ATGGTAGTTG CTTGTTTTTT 2820 

TACGTTAAGG AAAATTAAAA AACAAAGAGA ATTTTTCGAG AAATATTAGT TATTTAAATT 2880 

40 ACAGCAAAAA ATTGATTAGT CTAAAATTGA ATCTGCTTTT ATGACAAGGT GAAAAGTATA 2940 

AATGATTATT TTAAATTAAA GAAAAATGAG TAAGTCAATG CAAAGATGTT TAAATCAATC 3000 

AATTGCATGA TATAATTAAG TAGATATTAA AGCATCATAG AATGAATATA AATGATATAT 3060 

GAAAAGGAGC GCGTGTATGC CTGGAATTAT ACAAATAGAC GATATGAACC AATCTCAAGC 3120 

TTTAATTGGA AATAATGATG AACATTTAAA AGCAATTGAA GAGAGTTTCG ATGTTGTCAT 3180 

CCATGCAAGA GGACAAGaAG TTGCCGTTAA AGGTACAAAA ATAGAAAACG TAGAAAAAGC 3240 

GGAATCAGTA TTAATCAATT TGCTGAAGGT TATTGATTTA GGTAATAATA TTACAATTAA 3300 

AGATGTTGAA GCAGCTATTA AAATGGCGCA TAATAACACA ATTCAACATC TGTTAGATTT 33 60 

£5 
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45 
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GCAACGTATA TATGTTAATG CCATGAAAAA 
TGGTACAGGT AAGACATTCT TAGCTGTAOT 
5 TGTTAAACGT ATTGTATTAA CAAGACCTGC 

ACCAGGAGAT TTGAAAGAAA AGGTAGATCC 
TACTGTTCTT GGGCGTGAAC AAACAGAGCG 

10 

GCCACTTGCA TATATGCGCG GACGAACATT 
GCAGAATACG ACACATGCGC AAATGAAAAT 

1S AATGGTAGTT ACTGGTGACC AAACTCAAAT 

TAAGOAAGCG GTCAGTAGG T TACACAACGT 
GAGCGATGTA GTCAgACATC CATTGGTAAG 

20 TTAAATGTTT ACGATAGATT TTAGCGATCA 

ACAAATTGAA GATTTATTAG AATTTGCTAA 
GCTTTCTGTT ACATTTGTAG ATAAACAAGA 

25 TAArGwTAAr GTTmCaGATG tAaTCyCaTT 

T t TTAGTGGT CTTGATATAC CACGTGTTTT 
GCAAGrACAA GCAAACAATT ACGGACATTC 

30 

ACATGGATTT TTGCATCTAT TAGGTTATGA 
GTTTGGTCGA CAAGATACAA TATTAAACGC 
AGGTTTAAAT ATGCACTTGA TGGGCTGAAA 

35 

TTACATGTGT TTGCAATGAT TGTTGCTATT 
ATTGAGTGGA TATTTATACT CATTGCTATT 

40 ACTGCTATTG AATATGTTGT CGATTTAGTG 

GCTAAAGATA TTGCGGCTTT TAGTGTACTT 
TTAATAGTAT TTTTACCACA TTTTATAGCG 

45 ACCTCATTAT TTTCAAGAAG TTAGAAAAGC 

ATTTAAAGTA GGGGCTTATT TAAAAmCGAA 
AGAAAATGCT TCTTATCCAT TATCGATATG 

50 

TTCTCAAGGA TACAGACCAG GTGATTTTGA 
ACCGTCATCA CCTTGTGGTG CATGTCGTCA 

65 



TAATGATTTA GTATTTGGTA TAGGTCCTGC 34 BO 

TTATGCAGCA AAGCAACTCC GTAAAGGTGC 3540 

TGTTGAAGCA GGAGAGTCAC TTGGATTTTT 3600 

ATATTTAAGA CCTTTATATG ATGGTCTATA 3660 

ATTTATTGAA AGAgGCATTA TCGAAATAGC 3720 

AGAAGATGCA TTTGTAATTC TTGATGAGGC 3780 

GTTTTTAACA AGACTAGGTT TTGGCTCAAA 384 0 

CGATTTACCT AAAGGTGTTA AAAGTGGACT 3900 

TAAAGGTATA AGTATATTGA AATTAGATCA 3960 

TAAGATCATT GAACATTATG AAGGAGAGAA 402 0 

CACAGGCTTA GTTAAAGATG CTTGGTATAA 4080 

AAAAGAAGAG CATATAGAAG ACGATGCTGA 4140 

AATACAAGAA ATTAATCGAA CATATAGAGA 4200 

t GCTTTAGr A GrAGATGAGC CmGaGATkGA 4260 

AGGGGATATA ATTATCTGtA CGGATGTAGC 4320 

TTTTGAACGA GAATTAGGAT TTTTAGCATT 4380 

TCATATGACT GAAGCGGATG AAAAGGAAAT 444 0 

ATATGGATTA ACACGAGACT AATTATGAAA 4500 

ATCTTAATTC AAAAAGACTA TAAATTTCTT 4 560 

GTCTTTGGTC TCGTACTAAA TATTAATCGG 4 6 20 

GCATTAGTTC TCACTGTTGA AGCTTTAAAC 468 0 

ACCGTTGAAT ATCATGATTT AGCTAAATAC 474 0 

ATAGTTTCAA TATTAGCATT TATTATAGGT 4 800 

TTATTTTAGG GAGGCATATA TGAGTTATCA 4 860 

ACAACAAGAA TCATATTCGC CATACAGTCA 4 920 

AGACgGTAGA ACTTTTTATG GTACCAATGT 4980 

TGCTGAACGA GCTAGTTTGG TAT CGGCAAT 5040 

ATCAAtAACT GTAACCX3TAG ATG CAGATAA 5100 

AGTTTTGAAG GAATTATGTG ATGATGATAT 5160 
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ACCATTTGGA TTTTCAGGAA AGGATTTAGA ATAAATGACA GAACATAAAT CAGGATTTGT 5280 

TTCAATTATA GGTAGACCAA ATGTAGGAAA GTCAACATTT GTTAATAGAG TGATCGGCCA 5340 

TAAAATAGCA ATCATCTCCG ATAAAGCTCA AACAACTAGA AATAAAATTC AAGGTGTTAT 5400 

GACAAGAGAT GAGGCGCAAA TT AT ATT CAT TGATAOGCCA GGTATTCATA AACCTAAACA 5460 

CAAATTAGGT GACTATATGA TGAAAGTCGC TAAAAATACA TTATCTGAGA TAGATGCAAT 5520 

CATGTTTATG GTTAATG CCA ATGAGGAmAT TGGACGAGGC GATGAATATA TTATAGAAAT 5580 

GTTGAAAAAT GTTAAGACAC CAGTATTTTT AGTATTAAAT AAAATAGATT TAGTGCATCC 5640 

AGATGAATTA ATGCCAAAGA TTGAAGAATA TCAAAGTTAT ATGGACTTTA CAGAGATTGT 5700 

ACCTATTTCA GCATTAGAAG GGCTAAATGT CGATCATTTT ATTGATGTTT TAAAGACGTA 5760 

. TTTACCCGAA GnACCTAAAT ATTATCCAGA TGATCAAATT TCAGACCATC CTGAACAATT 5 820 

20 TGTAGTGGGT GAAATCATTC GTGAAAAAAT CCTTCATCTT ACAAGTGAAG AAATCCCTCA 5880 

TGCGATTGGT GTTAATG TGG ACCGTATGGT TAAAGAAAGC GAAGATCGTG TTCATATCGA 5940 

AGCAACTATA TATGTTGAAA GAsGTTCGCA AAAAGGAATT GTCATTGGAA AAGGCGGTAA 6000 

25 AAAGTTAAAA GAAGTAGGaA AAcGTGCGAG ACGTGAtATA GaAATGCtTC TAGGCTCTAA 6060 

AGTTTACTTA GAATTATGGG TCAAAGTTCA AAGAGACTGG CGAAACAAAG TTAACTTTAT 6120 

TCGCCAAATT GGTTATGTTG AAGACCAAGA TTAATCTTAA AAGTGGTGAA GATAATTGTT 6180 

30 

AATGCGCCAA AAAGGGATTA TCATCAAAGC AGTTGATTAT GGTGAATCTG ATAAAATTAT 6240 

CACGATTTTA AATG 6254 
(2) INFORMATION FOR SEQ ID NO: 405: 

35 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 

GTTGTTCTAA ATGTTTCTTG nATGAAACGA GTCAATGTAA ACTGATATTG CTGTATTTGT 60 

TGCAGCAATT CATATTGGTC TGGTGTTGCA ATAACAGCAG CTTGAGTTGG AGTCG CAGCT 120 

CTGATGTCTG CAGCAAAATC ACTTAATGTA AAGTCTGTTT CATG AC CAAC TGCTGATATA 180 

ATCGGTGTCT TACAATTATA TATTGCACGG ACGACAGCTT CTTCGTTGAA ATTCCATAAA 24 0 

TCTTCTATGG ATCCACCGCC TCGACCTACA ATAATGGTAT CTACACCTAA ACTATCTGCA 300 
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20 



TGTATTTGTT CAGCTAATGG AAAACGACTA TTTATCGTTG AATGGATATC TCGAATTGCG 420 

GCACCTGTAC TCGCTGTTAA AACTGCAATT TTTTTAGGAA ACTTAGGTAT TGATTTCTTA 4 80 

TTCGCTTTAT CAAAACAACC TTCTTCAGTT AATTTTTTCT TTAATGCTTC TAATTTTTGA 54 0 

TATAAGTTCC CTATACCATC TAATTGCATT TTATTTACAT AAATTTGATA GTTTCCACGA 600 

CGTTCAAAAA CAGAAACACG TGCTTCTAAT AAGACTTCAT CTCCTTCTTT AGGTTCGAAG 1 660 

TTTAATTTAG AAGCACTACC TTTGAACATC ATGGCACTTA TAACGCTTTC TTTATCTTTC 720 

ACATTAAAGT ATAAATGaCC ACTTGaATGc TTTTTGAAAT TTGAAAGCTC ACCTTTAATC 7 80 

1S AATACAGATT GGAGATGTGG ATCTTGATCA AATTTATATT TAATATATTT CGTTAAAGCT 840 

GAAACACTTA AATAATCTGA CATATAACAT CACTCAATTT T A TTTTTTTA TATTACTCAA 900 

TACACCATTT ATAAATTTAT AATGATCATC ATCACTGAAT TGTTTTGTTA ATTCAACTGC 960 

TTCATTCATT ACGACTTTAG CAGGTGTATC ACTGTGTAAT ATTTCATATG TTGCCATTCT 1020 

TAAAATAATA CGATCCGTTT TTAATAAACG TGCAATAGTC CAATCTTTTA AATAAGGACT 1080 

AATTGTCTCG TCTAATACAG GTTCGTGATC TTTAACGCCA GAAACTAGCC AATGAATAAA 1140 

TTCGAAGTCT AAATCTGGAT TATCGTCTTT AATAAAGCTT ATCGCTTCAT TTATCGTTAA 1200 

ATCACTGTCC TTCATTTCTA ATTGAAATAA AGTTTGAAAA GCTTGCACTC GGGATTCTTT 12 60 

ACGACTCATT TTTAACTCCT TCAAACGTTT GTATTTTTCT TTATTTAATT ACTGAATTAG 13 20 

GTATGACATT ACTTTTCAAT AACGATTTGT GTAATGTGAA TATTAATTTG CTTAGGTTCT 13 80 

ATCGCTGTCA TATTAGAAAT TGAATTAAAA ATTGACGTTT GAATTTTGTT TGCAGTTTTT 1440 

35 GAAATATTAA CACCATGTTT TAATGCACAA TATACATCTA TATATATGCC ATCTTCTTTA 1500 

CTCTCGATTT TTAAATCACG GCTTAAATTT TTACGACTAA CTTTTTCTAA ATTTGTTTCT 1560 

TTTAATTCAG CAAAATGGCC AGTGATGCCT TCGACTTCCG AAGTAGCTAT ACTTGCAATA 1620 

40 

ACAGATAGCA CTTCTGGCGC TATTTCTACT TTACCTAATT TTGAATTTGA ATAATCAGTT 1680 

ACTTTGACCA TGGATTGACC TCCTATTAAC CTTCATCATT CATAATGCTA TTTTGCTCTA 174 0 

AAAAGTTTGT ATTAAATTTA CCGCTTCTAA ATATATCGTT ATTCAATAAT TTAATATGGA 1800 

45 

ATGGAATAGT TGTATCAATA CCAAGAACCA CAAATTCACT TAGTGCACGA ATGCCAGCCA 1860 

TAATCGCTTC ATCTCGTGTC GGTTCATGTA TGATTAATTT CGCTACCATC GAATCATAAT 1920 

50 ATGGCGGTAT CGTATAATTA GTATAACATG CTGACTCTAT TCGAACACCA TATCCACCTG 1980 

GTGCAAGATA TTGCTCmATT TTACCTGGTG ATGGCATAAA GTTCTTGTAA GGATTTTCAG 2040 

CATTAATTCT AAATTCAATT GCGTGTCCTG TTAATTTAAT ATCTTCTTGT TTATACGGTA 2100 

65 
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CAGTTACAGG ATGTTCTACT TGAATACGTG TATTCATTTC CATAAAATAA AATTTATTAT 2220 

CATTTAAATC ATATATAAAC TCAATTGTTC CCGCATTTTC ATAATTTACA GCTTTCGCTG 2280 

CACGAACTGC GGCATTTCCC ATTTCACGAC GTGTTTCATC ATCTAAAATT GGGGAAGGTG 2340 

CTTCTTCCAC TAATTTCTGC ATACGTCTTT GAATTGTACA ATCACGTTCT CCTAAATGAA 2400 

TTACATTACC ATAGCTGTCC CCAACAATTT GGATTTCAAT ATGGCGGAAG TTTTCGATGA 2460 

ATTTCTCCAT ATAAAGTCCA CCATTACCAA ATGCAGTTTG AGCTTCTTGT TCTGTCATTC 2520 

GGAAGCCAGT TTCAAGTTCT TTTTCATCAC GAGCAACACG GATACCTTTT CCGCCACCGC 2580 

CAGCAGTAGC TTTAATGATG ACCGGATAGC CAATTTTTTT GG CGATTTTC TTAGCTTCTG 2640 

AGAOGTCTTT CATTAAACCG TCACTACCAG GAACAACTGG AACATTGGCT TTGATCATTT 2700 

CTGCCTTAGC AACATCTTTG ATACCCATTT TTTGGATAGA TTGATAACTT GGTCCAATGA 2760 

^0 ACTTCAATTG GcATgctTCG CATAATTCTG CAAAATCAGC ATTTTCAGCT AAAAAGCCAT 2820 

AACCCGGATG AACGCCATCA CAACCTGTAG AAGTTGCAAT AGATAAGATG TTCGGAATAT 2880 

TTAAATATGA ATCTTTAGAC AAAGTGGGAC CTAOGCAATA TGCTTCATCA GCAATTTGAG 294 0 

25 

TATGTAGCGC ATCTTTATCC CCTTCAGAAT AGATTGCAAC AGTTTGGATG CCTAAATCAC 3000 

GACAAGCGCG AATAATCCTA ACTGCGATTT CACCGCGGTT TGCAATTAAA ACCTTTTTCA 3060 

TTATTTCACC TTAAATAACG GTTGGCCATA CTCTACCATT TGTCCGTCTT CTACTAAGAT 3120 

30 

TTCAACAATT TCACCTGAAA TTTCTGCTTG AATTTCATTA AATAGTTTCA TTGCCTCTAA 31B0 

AATACACACT GTTGTTTCAT TTGAAACAGT GTCCCCAACT TGCACATATG CTTCTTCGTC 3240 

55 TGGAGATGGC GATTTGTAAA ATGTACCTAC CATAGGTGCA TTAATTGTTT TGTGATTATC 3300 

TGAAGTTGGC TTTGGAGCTT CAGTTTTATT GCTATCAGTT GATTGTGCTT GAGGCATAGG 3360 

CATTGCCGCA GCTTCAACTG GCATTTGTGA GATTTGTGGC GTGATAATCT CAGTTTCTTT 3420 

TTCTTTCTTA AGCGTCACTT TGCCTTTAGT ATCTTCAATA TTGATTTCCG TTAAAGTTGA 3480 

TTTATCCAGA ATTTCAATTA ATTCTTTGAT TTCTTTAAAG TTCATTATTA CTGACTCCTT 3540 

CAGTTTGTTT TCATCTACCC GTCTATTTTA CTTGAGACAA CTCTTCAATT CAAGCATGTT 3600 

CATATTGCTG GCGACATTAT AAGTCTATCC CAAAGTTATA ATAAAACCAC ATTTTAAATT 3 660 

AAAAACACTT GTGTATTTAT TACTTAACAT TGACTCATCT TAACTCTTGA 3710 
(2) INFORMATION FOR SEQ ID NO: 406: 



40 



45 



SO 



55 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1705 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 



5 


GCTOATQTTC GTTOCCTTTn 


TCCACCAGAC 


AATTCAGAGG 


GATATTTATC 


ACTAATATCC 


60 




AATATATTTA ATGCTTCTGC 


TACTTTTTCA 


TAACGATTTA 


ACATATGTTC 


TTTATCTAAC 


120 




TTCTGTACCG TTAGTGGTAA CATTATGTTIJ 


TCTTTAACAG 


TCAATGTATG 


CAGTAAATTA 


180 


10 


TACTCTTGAA AAATAAAACC AATATCATGC 


TTGCGTATAT 


CAGATwATTC 


CTTGTTTGAA 


240 




AGCTTTTCTA A1"1TTTTTCC 


TTTTAATGTA 


ATAGAACCTT 


GTGAAATATA 


ATCAATTGAA 


300 




CTTAAAACAT TTAATAATGT 


CGTTTTCCCA 


GATCCAGAGG 


GACCCATAAT 


AGCAATAAAC 


360 


15 


TCGCCTTCTT CAATAGACAT 


ATTGATATCT 


CGCAACACTT 


CTTGTGCCAT 


rrrriTAGTT 


420 




CCATATATTT TTGTTAATTG 


TTTTACTTCT 


AAAATTGCCA 


CTTTAACACT 


CCTATAATTT 


480 


20 


ATCTTAACTT CATTTCCTTT 


AGGCTTTGGC 


ACTTGTATCT 


TCAATTTAAC 


ATATGACTAA 


540 


CATCTATCTT ATTATAACGT 


TGAAGCTGCA 


TTGATGTATC 


AATTCTAAGT 


AACAAAACGC 


600 




ATGTTTAAAA TGACAAATTT 


OTCACTTCCG 


ACATGCGTTC 


AACAATTTCA 


TTTTGTAATG 


660 


25 


GGAAAATCAA TCTGACAGTT 


GTCCCCTTAC 


CAACAGTCGA 


CGTGACTTGC 


AGGTGAATAC 


720 




CTAATTGATC CTTTACACTA 


TTTACTAAAT 


ATAGACCCAT 


ACCTGAAGAC 


GTCGTTTCAT 


780 




TTCTGTTAGC CGTTGACGTA 


AATCCTCGTT 


CAAATATTCG 


CGGCATATCT 


TTTTTACTAA 


840 


30 


TACCTCTGCC ATAGTCTTTA ATATATAACG AAACATGTTG 


ATCATTTAAT 


TCTGTCCCAA 


900 




TTTCAATATT AAAATTCTCA 


CTATATTTCA 


ATGCGTTTGA 


CAAAATTTGT 


CTAATAATCA 


960 




TACGACACCA TTTTATATCT 


GTATAAACAT 


AATCATCCAC 


TTTAAAGTCA 


ACATCAAAAC 


1020 


35 


CAATACCTTT AACCTGACTA 


ATATGTCTTG 


TTAATTGTAT 


TTCATCAATG 


ACCATGCGTT 


1080 




TAAGTGACAC GTAATCAAAA 


TACATATCTT 


TACGTTGAGA 


TTCTAATCTA 


GTAATATACA 


1140 


40 


GCTGTGTATC TAGCATCGAG 


TTTATACGAG 


ACCATTCATA 


TAGTAATGCT 


TGTwTTCTTT 


1200 




CTTGATTTTT TTCTTGATCA 


ATTAATAATT 


TCATAGCTGT 


CACAGGtGTT 


TTTATGTCGT 


1260 




GCACAAATTC TGTAATGGTT 


TGTTCATGCA 


TGTTCAATTG 


CAACTGTTGC 


TCAACAACCT 


1320 


45 


TTTCTTTGTG CGCTGAGATT 


TGACGATATA AATAAfCAAC 


TGTATGACGT 


TGAAATGGCG 


1380 




TTTCCGCTAA ATCTTTATGT 


TTAATTTCTT 


CTATTTCTTT 


ATCTTTGTCA 


AAATGCTTAT 


1440 




ATAATTTTAC TTCTTTAAAA 


TATGTCAATA 


AAAGAAAAAT 


CATTGTTAAA 


CTTAAATTCA 


1500 


50 


• AAGAAACAAT ATAAAATAAA 


CTGTCTATTG 


GAAAATCATA 


ATCGATTAGA 


CTArTGCCTA 


1560 




ACATAAGGAA GTTTAAAAAC 


AATATCCAAA 


, AtATCCAGTT 


CATGCGAGAT 


TTCAAAAAAT 


1620 



55 
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GCACTATCCA TACTAATTTC AGATA 
(2) INFORMATION FOR SEQ ID NO: 407: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1722 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 



15 


TCAATATATC 


TTAAAATTCA 


ATGATTAAAT 


CnATTATCAC 


TAGACATnAA 


ATACATAAAT 


60 




CCTATTCCCC 


ATTTTCATTT 


nTTAATTCAT 


AAATGAATCA 


ATAACCACCT 


AATAACAAAT 


120 




CATATTATAC 


ACCTTTGTTC 


TCTATTTTTC 


TAAGGTTTAA 


AAAATATTTT 


TAGGTAAACC 


180 


20 


TAAAAATAGA 


TGTAATAAAA 


ACGCCTCCTC 


AGATATTTAT 


ATATCTATGA 


AGACGTTTAA 


240 




ATACATTATA 


GATGGTCTGG 


TTCTGGGTGA 


ACGTATACTG 


AGGAAAtACC 


TTTTTTGTGC 


300 


25 


AAATGATGTT 


CGACATTGTC 


ACAAATTTGA 


TGCGCTTCTA 


CTAAGGAAAG 


GTTAGCATCT 


360 


ACAACAATTG 


TGACATCAAT 


AAACACACTA 


CTTCCATGGT 


AACGCCCTTT 


AATACTTTTA 


420 




ACTTCTTGTA 


CTTCATCAAC 


TTCTAAAATA 


TCATTGCGAT 


ACGCTTCTAA 


TTCAGTTTCA 


480 


30 


TTGAAACCAT 


CACTCAACAT 


AAAAATTGCT 


TCTTTAAAAA 


TACCAAAACC 


AGTATAAACG 


540 




ATTAGTAAGC 


CTAGTAATGT 


TGCTAAAATA 


ATATCGACAA 


TTGGGAAACC 


GATTTGCGTA 


600 




AAAATTAATC 


CTATCGCTGT 


TCCAATGCTG 


ACTAAACTAT 


CCGATAAATT 


ATCTTTGGCA 


660 


OS 


GCCGAATTTA 


AAGAACTACT 


TTTCGTTCTT 


TTCGCTAGTC 


TTTGATTGAC 


TGCAAATACA 


720 




ATCAACATTA 


CAAGACCACT 


GATTAAGCTG 


ACGATAATTG 


TTATTGCGTT 


AGGTACAACG 


780 




TCATCTTCTT 


TGAACAAACG 


AGGTGCATTT 


TGAATAACTA 


CTTGGATACC 


TACAAACATA 


840 


40 


ATG ACAAATG 


ACACCAATAA 


TGAAGAAATA 


TTTTCAGACT 


TCAAATGGCC 


ATAAGGATGA 


900 




TTTCGATCGG 


CAGGTTTAAT 


TGAAATTTTC 


AATCCAATAA 


TAACAGCTAA 


AGAAACGATA 


960 


45 


ATATCTGTCA 


TATTGTTTAA 


TGCATCGGCT 


CTTACAGCTG 


CAGAGTTAAA 


GACAAAACCC 


1020 


GTGACATACT 


TAACAATAGA 


TAAGATTATA 


TATACAATTA 


AACTCAAATA 


AG CACCGCGT 


1080 




TGCGCCAATT 


TAAGATTTTC 


ATTATGAGAC 


ATGCGTTGAA 


CCACCTTGAA 


TTAGTATAGT 


1140 


SO 


AACAATATTA 


TGAATGATTC 


ATTTTAATTT 


TACAACGTTT 


TTAATTTTTA 


TAAATTTTTA 


1200 




TAAAATTAAA 


CTAATTTATT 


CATTCGCAAC 


CCCTAAAAAT 


AATTTTTAGC 


CTTTCTGCGA 


1260 




ATTTTATGAG 


CTAGAAAGGC 


GCCCAACTCT 


CCCTGTTTGT 


TAACTTTCGC 


CTCGAAAGTT 


1320 



65 
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CGAAwwTTAT GAGCTAGAAA GGCTTATGCA GTTGACGTTT TACGTCCAAC TGCGTTCCTC 1440 

CGTCTTCTTC AAATTTATTT GTnAGAAAGG CACCCAACTC TCCCTGTTTG TTAACTTTCG 1500 

CCTCGAAAGT TTCTATGTTA GAACCCTATG CATGAGTTGC GAAnTATCTA ATGTCGTGAA 1560 

CTAATTATAT AGAAGAAAAA GTGCATCAAT GACAAATTAA ATGAGATTTC TACTCTACCA 1620 

AACTCTCTTC GAAAGACAAT TTTCTCCTCT ATTT ATT AG C AACTATTGCA TTTCTCCATA 1680 

TAGTACTTCC TTACTTAAAA TACGCTGAAT GTCTGAATTA AA 1722 
(2) INFORMATION FOR SEQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408: 

GCGAGACCCC CTGAGGGAGC AGTGCCCAGT CGAAGACCCG AGGCTGAGAC GGCACCCTAG 60 

26 GAAAGCGAcC ATTyCAATAC GAaTTgTGAt AAATAGAGAA CAGCAGTAAG ATATTTTCTA 120 

ATTGAAAATT ATCTTACTGC TGTTTTTTTA GGGATTTATG TCCCAGCCTG TTTTTTGTGA 180 

TTTTTAATAA TTTGAATATG GAAAATGTAT TATTCTCTCA TTTGTATAOA TTGTATTTAA 240 

TAAGTTAATG TAATC CTTGA GCTCACGATT AATAAAATTC TATAACCTTA ATTATTTTCT 300 

CGATACAAAG GGTTATTAAC TTTAATATAA GTATAATGAT GTGCCTCATC TTCAAGACGC 360 

ATTGTTGTAA CACACTTATA ATCTATAAAT GGCGCGAACA TGGTATCTTT AATTTCaTTT 420 

ATGCGATTCT CATTTACTTG ATTAGATTTG TGTGTCGAAA GTACAAGTTG ATCaAAAATG 4 80 

TTATCTAGTA CATCACGAAC GATATACCAC ATATGTCTTT CTAAGTTTGA ATCATTTGAT 540 

40 GCTTTAGAGA TTGTAAGAAT TAATTCGCCT AAATGGTTTT GAACGGTAGA ATAAAAGGCT 600 

TTGTTAAACA CAGACGTTTT TGAATCAGTA AGAATTCTTG ATTTTTCATG GAAATGAGAT 660 

GTACTGTATA CCATTTCATT TAGTTGTGCT TTATCAATTC TTAAACCTTC AAAGTCTCTA 720 

ATATACATCG TGTCCAATAG ACCATCTTTC CGAAATGTTG CAATAGCATT TTGCAAGTGA 780 

GCCTCTAATG CAATGCCATA TTTAGTAACT AGTGGGATTA CGAGACCAAG CAATGCTTTA 84 0 

CTATAAGTTT CAACCCACGA TTTCGCCGAT GATTCAAAAT CAGACAATGA TGCAGCTGAT 900 

TGATAACGTT TAATCAATGT CACGATAGGT GATTCATTGT TAAATGGGTA GGTTGCAACT 9 60 

AAGCTTGAAG GAATCAATGG TGTGACTTCT TGTGGAATCA TTTGGTATAT ATTTTTTCTA 1020 

65 
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AAAAATGAAT ACCAGCAACT TCATCAATAA 
CTTTCTCCAA AATATCATTT AAAATACGTG 
5 CTGATAATGT ACGAATCTCA CCTGTAATAT 

ACATAGCTGG GTATTTAGGA ACTAATGTTC 
TATGTTTTGC TTCAATGATT AATTCTTTAT 

10 

CGTCATCTAA TTGCCATGGA TGAACAATCA 
GCGTAAATTC ATTTTCTAAT TGTTTAATTA 

y5 CATAATCTTT AGACAGTGAC ATAGTACGGC 

ACTTAATAGG TTGATTGAAT TCTGAAGAAT 
TACGTAATTT AGCCCCAGGA TGTAGOGGAT 

20 AGTAACTATC TTCGCTATTT TCGATAATAT 

ACAGTGCTTG ATAGCTAATT GCAAATGTCA 

GAAATTGATC ACTAGCAGCA TTTTTTAAAT 

25 

TTTCATTTGG ATGGTGTACT CGTGTAATCG 

GGCCTTCAAC ATCAATTCGA TCAAAGGCGT 

GCTTAGCTTG TGGGAACTGG ATTTCTAGTA 

30 

AATCACGACT CAATATTTTC TTACTTTGAG 
TTGATGTGAC CAATCTCTGA GTGACTTTAT 

35 TATTAGCCCA ATCGCTATTA TGTTGTTGTA 

TAAATTTTAA TGTCTGTTCT TTAAAAATTA 
TATTTTACAA AAGCAAGATA GATTTGTATA 

40 CAACAGAATG CGGGTGTAAG TTTTATGACA 

TTTCTAGGTA ATTGGATTGG ACAAATAGGG 
AACGCAGTTT ATCTGGGGAT TGTCAATTTT 

45 

GTGTGGGCAG GGGCAATTGC CGATAAATAT 
TCATCATCAT TTTTAGTAAC TGCAATTTTA 
CAATTAGCGT CATTATTATA TATGCGACAT 

SO 

CTTTAAGACA AGCAATCTTA CCAGATTTAT 
CATTTCATTC ATTCATCATT AATATTTGTC 

55 



TTGTTGATGC 


ATAGGACTTA 


AATATGACAT 


1140 


TCATTAGTGG 


ACCATTGTGC 


GTCGTTTGTT 


1200 


GAACGTTTOT 


CGACAATTTG 


ATGTGTGGCG 


1260 


TGAAAGATAA ACCAGCATAA TAATCCaACG 


1320 


CTACTTCTGC 


TTGATAATCA GAATGTAATA 


1380 


TAATGTGATA ATCATTAAAG 




1440 




X Avj 1 lXiATGA 


ACAGTTGTAT 


1500 


TTAATTTACT 


GTGAATCAGT 


ACTATTTTCA 


1560 


ATAGGAATGT 


TTGTAATGCA 


TTTAAACCTT 


1620 


GACCTTCAAT 


AACGGCTTGC 


TCTGAACGCA 


1680 


TAAATAAAGG 


TGCAGAATCA 


TGTTGCATTG 


1740 


TATtAGTtGC 


ACTGTTTATT 


AAATCTTGCT 


1800 


CTGGTGCTTC 


AATTAAAATA 


CACTCAAGAA 


1860 


TATTTGTAAT 


GTCATCTTTA 


ATGTAGAAAG 


1920 


GTTCTCCAGT 


GATAGGAGCA 


TATAATGTTT 


1980 


TATGAGTCGT 


TGAGATATCT 


AACATAATCA 


2040 


TGCGCGCTTT 


AACTAAGTTT 


TCGCGATGCA 


2100 


CTCTTCCTTG 


TAAAATCATC 


TCTTTAAAAA 


2160 


AAAAT AAATA 


TGTTTCTTGT 


TCTTCTTTGT 


2220 


AGTTCAAGTT 


CATAATTCAC 


CTCTATGAAA 


2280 


ATCCATATTA ATGATAATGA yTCTTATTAT 


2340 


AAATATTTTT 


TTAGCAGTTC 


TTTTCTACTA 


2400 


CTAAATTGGT 


TTGTACTTAC 


CACTTATCAT 


2460 


TGCAGACTTG 


TACCAATATT 


ATTACTAAGT 


2520 


GATAAAGGGC 


GATTGCTGAG 


AATTACAATT 


2580 


TGTGTGCTCA 


CGTATAGTTc 


ACTGCAATTC 


2640 


TAAGAGGGAT 


TTTAAGTGCG 


GTTGAAACAC 


2700 


CAGATAAAAT 


ATCTACTACA 


CAAGCTGTmw 


2760 


GTTCAATAGG 


GCCTGCCATT 


GCTGGTGTCA 


2820 
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CAGTTTTATT ATGCTTACCA TTACATTTTA AAOTAACTAA AATACCTGAA GaTGCATCAA 2940 

GaTACATGCC GTTAAAAGTT ATT AT AG ATT ACTTCAAATT ACATATGGAA GGTCGACAAA 3000 

TATTTATAAC ATCATTATTG ATTATGGCGA CAGGTTTTTC ATATACGACA CTTTTACCAG 3060 

TTTTGACAAA CAAAGTATTT CCGGGGAAAT CTGAAATATT TGGTATCGCT ATGACGATGT 3120 

GTGCCATTGG TGGTATTATT GCAACGCTAG TTTTACCTAA AGTACTTAAA TATATTGGTA 3180 

TGGTAAATAT GTATTATTTA AGTTCATTTT TATTTGGCAT TGCTTTGTTA GGTGTGGTAT 3240 

TTCACAATAT TGTCATCATG TTCATTTGTA TTACATTGAT TGGGTTATTT AGTCAATGGG 33 00 

CACGTACGAC AAATCGCGTT TATTTTCAAA ATAATGTTAA AGATTATGAA CGTGGTAAAG 3360 

TACTGAGTAT TaTTATGATG GgATAGAGGT ATGaTTCCAT kGGGAAGTCn ATTAATGAGT 3420 

ATATGTGCAG ATGTGTTTGG CATTGTTAGA ACTTTTTCAA TAATGGGAAT AAGTACTATA 34 SO 

20 TGCATTACAA TGGTATTCTA TTTTATAAAT AGAAAGTTGA AGTTAAAGTT GGAGGAAAGT 3540 

AATCATGGTA TATCTTGAAT GGGCAAAGGC AGATAGAAAT ATTCAATATC GTGTAATTAA 3600 

CGCCATTATT AAAGAACGTA TTTACCCCGA GCAAACATTT ATTTCGCAAA AAGGATCTTT 3660 

25 AATTGAAATT CAGTATCATA TGCATGTGTT GACTATTGAA GTTGTTAGAA AAAGTGCATT 3720 

AGAACGCTAT GAGTTTACAG GTGATATTAC TTATTTAAAT AAAGGTGAAA CGTCATTAAT 3780 

TATAACTTTA GAAGGTTTAT TAGATGTGTT GAATCATGAC TTTGATATCC CTATTTCAGA 3840 

GCGACTACGC GAAGAGTTAA TACACAGTCG AGATAGTTTA GTTGAAACAT ATAAGCAAAT 3 900 

GTCTCACAGA CAAACGTTAA TAAGTCmAAG TTTTAAATTT TCAAGGTTAC CACAAGATAT 3 960 

TAACTTTTTT TCakGGTtAC AACATGTAAA AGATAGTGAT AAGACAGATG ATTTAACTTA 4020 

TTCTGAGAGT TTGGTACCAG AGGGGCATCC AACACACCCT TTAACCAAAA CGAAATTGCC 40 80 

CTTAACTATG GAAGAAGTAC GAGCATATGC ACCTGAGTTT GAAAAAGAAA TCCCTTTGCA 4140 

40 AATTATGATG ATTGAAAAAG ACCATGTTGT GTGCACAGCT ATGGATGGTA ATGATCAATT 4200 

TATTATTGAT GAAATAATTC CCGAATACTA CAATCAGATT CGTGTGTTTT TAAAGAGTTT 4260 

AGGTTTGAAA AGTGAAGACT ATAGAGCGAT TTTAGTACAT CCTTGGCAAT ATGATCATAC 4320 

45 GATAGGGAAA TATTTTGAAG CATGGnTTGC TAAAAAAATA TTAATTCCAA CGCCGTTTAC 4380 

AATACTTcCA AAAGCaACTT aTCatTTaGG ACGATGTCTT TAATTGATAA ACcATACCAT 4440 

GTTAAgTTGC CCgTCGATGC aCAAGCAACA AGTGCCGTTA GAACAGTCTC AACTGTGACT 4500 

60 

ACTGTAGATG GACCAAAGTT AAGTTATGCT TTACAAAACA TGTTGAATCa ATATCCaGGA 4 560 

TTTAAAGTTG CTATGGAACC GTTCGGTGAA TATGCAAATG TTGATAAAGA TAGGGCACGT 4620 

55 
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AGTGCAAGTC 


TAGTTAATAA 


AJVATrPJUlTA 

^W%X w^^W^X^fc 




TTATCGTGGA 


TAGTTACTTA 


4740 






ATCAAGGAAT 


TACTAAAGAA 
xmw xn^%nuvv% 


Awl AX lALun 


CATTTATTGA ACGATACGCT 


4800 


6 


CAAGCATTAA 




/ini A wV» A A A X 


**X X Uvuvil X 


ATGGAATTGC 


TTTAGAAGCA 


4860 




CACATQCAAA 


ATACAGTAGT 

«1 A XVw^t W A #1>w A 


GAACTTCVVV? 
w/ww x * Www 


V.\_rVk_/\l X X XVJ 


ACaTTCAATT 


TTTAGTGAGA 


4920 


10 




Ul 1 W X /VU/Vi 1 


luMi X Ifixjnn 


iv *F*T*ft r* jv a /*• 


ATCGTGTATC 


AGATATTAAA 


4980 




nX/twX X Innl 


AGCTGATTCT 


ft ^» ft tVi/"** 

ATAGATGCAG 


TGATTGCAAA 


ATTCCAACAT 


5040 




^?^^^P^^2^P^Pft ^P^P^^ 


IV IV ft ftTOft ft ft*P 


GGCAJGAATTA 


ATCCATCATT 


TTAATCAGTA 


TGATTGTGTT 


5100 


75 


ft ft. & ft & IV ft. f*f*C* 


AATTATTTAA 


CATAGTACAG 


CAAGTAGTAG 


CGCATGCCAT 


TAACCCAACA 


5160 






CAAATGAGx X 


AAAAGATATT 


TTGTTTGGAC 


CAACAATTAC 


TGTCAAAGCG 


5220 




TTGTTAAATA 


TQAGAATGGA 


AAATAAAGTA 


AAGCAATATT 


TAAATATTGA 


GTTAGATAAT 


5280 


20 


CCGATAAAAA 


AAGAGGTGTA 


GTACTACATG 


GCACACGTTA 


ACATAAATAT 


ATCGAAGATT 


5340 




AAaTATAACG 


CCAAAGTACT 


TCAAACAGTT 


TTTCAAAGTA 


AAAATATGCA 


ATTCACACCA 


5400 




OTAATTAAGT 


GCATAGCTGG 


TGACCGTACA 


ATTGTAGAAA 


GCTTAAAAGC 


GTTAGGTATC 


5460 


25 


AATCATGTTG 


CAGAATCCAG 


ATTGGATAAC 


ATAATTAGTA 


TTGCAGATAC 


AGGATTTAAC 


5520 



A 5521 
(2) INFORMATION FOR SEQ ID NO: 4 09: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1261 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 



SO 



AGGTCGTCTr 


GCaGmAGmGm 


TATTTGACGA 


TCTTGCTTTC 


CCAAAACACG 


ATGATGATTT 


60 


TAACATACTG 


TCTGATTATA 


TTGAGACACA 


TGGTGATTTC 


aCATTGCCAA 


TGTCTGTATT 


120 


TGATGATTTA 


TATGAAGAAT 


ATACGGAATG 


GCTAAAATTT 


TAATATAATT 


TTTAATAATA 


180 


ATAGTTAGAA 


CCAGGGTGAT 


GCAATTCGTT 


ATCCTGGTTT 


TAATTTAAAA 


TAAACTAAGT 


240 


TGTGACTAAA 


AATTAATCAA 


TTATAGTGAA 


ATATGGTGCG 


CTATCTTGCA 


TAAATTGATA 


300 


TGATTAACTA 


CACAGAATTT 


AAAAGTACAT 


AATACATAAT 


AAGGAAGTGA 


TACAATGGAT 


360 


GATAAGCAAC 


ACACATCTTC 


ATCCGATGAT 


GAACGCGCTG 


AAATTGCAAC 


AAGCAATCAA 


420 


GACCAAGAAA 


CTAATTCATC 


GAAACGCGTT 


CACTTAAAAC 


GTTGGCAATT 


CATATCAATA 


480 
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CAAAAAATAA 


GTGGTTTAAA 


CAAAACTGAT 


CAAGCAAACT 


TAAATAAAAT 


TGAAAATGTG 


600 




TATAAAATCT 


TAAATAGTOA 


TTATTACAAA AAACAGGACT 


CTGACAAGTT 


AAGTAAAGCT 


660 


5 


GCAATTGATG 


GCATGGTCAA 


AGAATTAAAA 


GATCCTTATT 


CTGAATATTT 


AACAAAAGAA 


720 




CAAACGAAAT 


CCTTTAATGA 


AGGTGTTTCA 


GGTGATTTTG 


TAGGTATTGG 


TGCAGAAATG 


780 




CAAAAGAAAA 


ATGATCAAAT 


TATGGTTACT 


AGTCCTATGA 


AGGGATCTCC 


AGCAGAACGT 


840 


10 


GCTGGCATTC 


GTCCTAAAGA 


TGTCATTACT 


AAAGTAAATG 


GAAAATCAAT 


TAAAGGTAAA 


900 




GCATTAGATG 


AAGTTGTCAA 


AGATGTTCGT 


GGTAAAGAAA 


ACACTGAAGT 


CACTTTAACT 


960 


15 


GTTCAACGAG 


GTAGTGAAGA 


AAAAGACGTT 


AAGATTAAAC 


GTGAAAAAAT 


TCATGTTAAA 


1020 


AGTGTTQAGT 


ATAAGAAAAA 


AGGTAAAGTT 


GGAGTTATTA 


CTATTAATAA 


ATTCCaGAAT 


1080 




GATACATCAG 


GTGAATTGAA 


AGATGCAGTT 


CTAAAAGCTC 


ACAAAGATGG 


TTTGAAAAAG 


1140 


20 


ATTGTTTTAG 


ATTTAAGAAA 


TAATCCAGGT 


GGACTACTAG 


ATGAAGCTGT 


TAAAATGGCA 


1200 




AATATTTTTA 


TCGATAAAGG 


AAAAACTGTT 


GTTAAACTAG 


AnAAAGGTAA 


AGATACTGAA 


1260 



1261 



2S (2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 88 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 

30 (D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

35 



40 



45 



60 



AAATATATTG 


AAnAGAnAAT 


TACTAAGATT 


AAATCnTCTT 


AAAATATCCC 


TGAAATAACG 


60 


TCCTAAAGAT 


TAAAGGAAAG 


AGGTTATAAG 


TTATGCCAAA 


ATTAATTTTA 


TGT CGTCATG 


120 


GACAAAGCGA 


GTGGAATGCT 


AAAAACTTAT 


TTACTGGATG 


GGAAGATGTT 


AATTTATCTG 


180 


AACAAGGTAT 


TAATGAAGCG 


ACTAGAGCAG 


GTGAAAAAGT 


AAGAGAAAAT 


AACATTGCCA 


240 


TCGATGTAGC 


TTTTACATCG 


TTATTAACAC 


GTGCTTTAGA 


TACAACGCAT 


TATATTTTAA 


300 


CTGAATCTAA 


ACAACAATGG 


ATTCCTGTAT 


ATAAAAGCTG 


GCGTTTAAAT 


GAACGCCACT 


360 


ATGGTGGATT 


GCAAGGCTTA 


AATAAAGATG 


ATGCTAGAAA 


AGAATTTGGA 


GAAGAACAAG 


420 


TACATATTTG 


GCGTCGTTCT 


TATGATGTGA 


AACCACCTGC 


TGAAACCGAA 


GAACAACGTG 


480 


AAGCTTACTT 


AGCTGATCGT 


CGATATAATC 


ATTTAGATAA ACGTATGATG 


CCTTATTCTG 


540 


AAAGTCTGAA 


AGATACTTTA 


GTTCGAGTGA 


TACCATTTTG 


GACAGATCAT 


ATTTCACAAT 


600 
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TTAAATATCT TGAAGATGTG TCAGATGAAG ATATCATTAA TTATGAAATT AAAACAGGTG 720 

CACCX3CTTGT TTATGAATTA ACGQATGATT TAGAAGTTAT AGATAAATAC TACTTATAAA 780 

AaAAGAGCTG CATGTACACA AGGAGTQAGT GTATATGcAG CTCTTAAAtA TGTGAAGTAA 840 

TGTAAGGAAA TAGTTAAGTA TAGAGTTTAT ATTAACGAGC TAGGOATACT CGAAAATATA 900 

GTTAGACATA CAATATAGTC AAATTAAAAC AATTATTTCG CTCTTTTATG TTGCTTAATA 960 

ATCTTTAAAG CACGCTTTCT TGTTTTAATG TTAGGGCTAT TTAAATTACG ACGAGCAGTC 1020 

TGTAAATCTA ATTTCATCTC TATCCCTCCT TGTAAATATA TTATGACCGA TAACTACTCA 1080 

TATGTAAATA GTAATGATTA OGTTTTAAAG AAATTGTAAT AAAGTCGTGC TAATTTTTTG 1140 

GAAAATGGGT ATAATTACCG GATATCTAAA AATGTGTGTC GTTTTTTAGA TGGTGAGGGG 1200 

GAAGCTTTAA ATGTCGAAGA AACAAAAATT AACGATGATT ATTACTATGC TGATGGGTGG 1260 

20 ATTTTTTGGA TTATT AAATG AAACACTATT AGTGACGGCT TTACCAAGTA TTATGAAAGA 1320 

TTTTGAAATT TCATATACAC AAGTTCAATG GCTGACAACA GCTTTTTTAT TGACTAATGG 1380 

GATTGTTATT CCTTTGTCCG CGCTTGTTAT ACAACGTTAT ACAACAAGAC AAGTGTTTTT 1440 

25 AGTGGGTATT TCTATCTTTT TCTTAGGTAC ATTACTCGGC GGCTTGAGTC CGCACTTTGC 1500 

AACATTATTA GTTGCTAGAA TTATTCAGGC GTTAGGCGCA GGTATTATGA TGCCATTGAT 1560 

GATGACAACG ATTTTGGATG TTTTCCAACC ACATGAACGC GGTAAATATA TGGGGATATT 1620 

TGGTTTGGTA ATTGGTTTAG CACCAGCTAT TGGACCTACT CTTTCAGGTT ACCTTGTTGA 1680 

ATATTTTAAC TGGAGATCGC TTTTCCATGT TGTCX3CTCCA ATTGCAGCTG TGACATTTTT 174 0 

AATTGGaTTT AAAACGATAA AAAATGTTGG AACTACAATT AAAgTACCTA TTGATTTTAT 1800 

TTCTGTCATT TTTTCTGTAC TAGGTTTCGG CGGGTTATTG tATGGAACGA GTTCaATTTc I860 

AGAAAAAGGT TTTGATAATC CTAcGtATTA GTATCTATGA TTGGAGGCGT TGTTTTAGTC 1920 

GCATTATTTG TAwTACGTCA ATATCGGCTA TCAACACCAT TATTaAATTT TGCTGTATTT 1980 

AAAAATAAAC AATTTACAGT TGGTATCATT ATTATGGGTG TCaCAATGGT ATCGATGATT 2040 

GGTTCGGAAA CGATTTTACC TATCTTTGTG CAAAATTTAT TGCATCGTTC AGCTTTAGAT 2100 

45 TCTGGATTAA CTTTATTACC AGGAGCAATT GTTATGGCAT TTATOTCGAT GACTTCGGGT 2160 

GCTTTATATG AAAAGTTTGG TCCTAGAAAT CTTGCTTTAG TAGGTATGGC GATTGTTGTT 2220 

ATTACTACGG CTTATTTTGT TGTAATGGAT GAACAAACAT CAACAATCAT GTTGGCAACA 2280 

SO 

GTTTATGCGA TTCGAATGGT TGGTATCGCG TTAGGATTAA TTCCAGTAAT GACCCATACG 2340 

ATGAATCAAT TAAAGCCAGA AATGAATGCA CATGGTTCAT CTATGACAAA CACAGTACAA 2400 

55 



30 



35 
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15 



30 



35 



40 



60 



65 



AACTTTTCTC CAACTATGTC AGACTATA 2468 
(2) INFORMATION FOR SEQ ID KO: 411: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 

TTTTACAATT TCAGATATCT CTAAAACACA TACCGATCCA ATGGCCAAAA TTATTCGTCA 60 

AAAATTGAAA AAACTAGGAA TTCGTAAAGG GATTCCAGTT GTATTTTCAG ATGAAAGTCC 120 

AATTGTCATA AGAGAAGATG TAAAAGATAT AGTTGGAGAT AAAAATGCTA TCAATCGAAA 180 

20 AGGACAAATG CCACCTTCTT CAAATGCCTT TGTGCCAAGT GTTGTTGGAT TAATTTGTGC 240 

AAGTTATGTG GTGAATGACG TATTAAAAGA TATTCCAGTT CGTCGCATTA AAGACAAAGG 300 

GCAATAATTC ATTTTGAAAG GGATAATTTT CAACGTAAGG CAAGTGTAAC CACACCATAA 360 

25 AACTAAAACT GACTAGTTCG CAAGCATAGT AGAATATGCC TCGTGTACTA GTCAGTTTTG 420 

ATTTGATTAT AGCTAAAATA TCAAAGGTAA TCGATTTAAT CTTATTTTTA CGGCCTTTGT 480 

TTAAGTAAGT TGTTATATAT TTCTTTGAGT TGTTGTTCAC TTTTAGATGT CGTTTTTGGT 540 

TCGTAATAAA TTTTGTTTTT TAGTTTATCA GGTAAATATT GCTGTGAAAC ATAG CCATTA 600 

ACATATTGGT GTGGATATTT GTAACCAATA GATCGGCCTA GATCTTTAGC ACCTTGATAA 660 

TGTCCATCTT TTAAATGGTT TGGAATTTGG CCCACATGAC CGTTTCTAAT ATCGGACAAT 720 

GCACTATCAA TGGCACTCAT TGCTGAATTT GACTTAGGTG ATAAGCATAG TTCGATTACT 780 

GCTTGGCTTA GTGGAATTCT AGCTTCTGGT AAACCTAGAC GTTCTGCTGA TTCAATAGCA 840 

GCAAGTGTTC TCTGACCAGC ATTAGGTGAG GCTAAGCCTA TATCCTCATA GCTAATTACA 900 

AGTAATOGTC GAACTATTGT AGGTAAATCT CCAGCTTCAA TTAATCGTGC TAAATAATGT 960 

AAAGCGGCAT TGACGTCGCT ACCACGGATA GATTTTTGGA AAGOGCTCAT AACATCGTAA 1020 

45 TGCATGTCAC CATCCTTGTC ACTTACAAAT GCACCTTTTT GTAAACAGTC TTTAGCATCT 1080 
TGCAATGTAA CATGTCGATA ACCGT . 1105 
(2) INFORMATION FOR SEQ ID NO: 412: 



(i) SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 579 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
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10 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 

TAACGTCGAA CTTGAGCTGT TACGTTATGA CTCATAATTA TTTTAGCATA GTCATTTAAA 60 

TAAACTTCTG TTCTGTCTGT TGGATAAGCA AATTCAAGCA ATTGACTGTA ACTATCATTA 120 

ATGACTTCTT GATCAACATG ACTATCAAAA TATACAGCAT AATAATAAGT ACCATCAACC 180 

ATATATAACA AATCTTCAAA CTCTGTAGTT ATTGGATTGC TATGATATGC ATAATTAATA 240 

ACATCTTCTA AATCGTTAAA TTTCACAATG ATTGTTCTTG TATTTTTACG TGCTGAAGAC 300 

TTTTGACGTT TAGAACCTTG AGCTTCTTTT TCTTTTGTTC GTTGCTCGAA TAATTCTTCT 360 

'5 AATTGATCTT CACCTTCTAA TGTTTGAGCT AACAATTCTT GAACTTGTTC ATCAAATkGA 420 

TCAGTTGCAT CATCATCAGA CATATTCATC ATATCTTCAT TTTTAGATTT AGAAATTGTG 480 

ACTTCGACAC CTTTTTCAAA GGCATGTACT TGAATCCATA ATGGACCTnC TACAACAAAA 540 

TCTTCTACTT CGTTAATTTC ATCCATCATT GAnCAAAAG 579 

(2) INFORMATION FOR SEQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 

GTGGTGAAAC T TC TTOCTTT GTAATTTTAT AAAGTGGATC AATATCACTC TGACTTACAT 60 

55 CAGGTTGAAC TGTCATCTTT TTAGTTACTT TGTTTTCAAG CGTAATATTT GCTAATTcTA 120 

GG t TTTACAG AATAATGAAT GTCATCAGCT AmCCCTTTmm CtTGATACTy CTCTTGATAT 180 

TTTCCAG CTT TAGCATTTGA TAAATCAATC ACT ACT CTT A AATCTTCTGG ATTTTCAATT 24 0 

40 TTTATTATCT TTGATTGTGG TCCTGAAATA GTCACATTAA CTGTTTCAGG CGCTTTTGTT 300 

AAATGCAAAT CTTTAGTGTT ATAAAGAATT TCAACGGGTA CATCTTGAAT CGTTTTACTA 360 

GACTTTTGAC CAAGATTACC AGTGTTAAAG ATATTTCCAA AAACATTGTT AACAGATAAA 420 

45 AAGAAAAACA ATGCCAAAAG AAAGGCAATA AATCTCAAGC CCCATTTACT TTCT AG CATA 480 

TTATTTCACA CCTTTCTTTT GAAAGCGTGT GCCAAACCAA TGTTCAGCAA GCAACTCTTC 54 0 

AAAAATTTCG TTTGAAATGT CTCGTCGTAA TTTTCCATCA AATGTTACCG AAATATCACC 600 

SO 

AGTTTCTTCA GATACAATAA CGGTAAATGC ATCAGATACT TCTGAAATAC CAACCGCAGC 660 
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TGCTGCTGCT GCAATCTTCG TGCCTTGAAT AATCATTGCA CCATCATGTA AAGGTGTGTT 780 

AGGTATAAAG ACATTAATTA AAAGTTCTTG CGAAATATTT GAATCCATTG CAATACCTGT 840 

5 TTCAATATAA TCTTGAAGAC CTGTTTCTTT TTCAAAGACA ATTAATGCAC CTATACGTCT 900 

TTTAGC CATA TATTGCACAG CCTTTGAAAC CGATTGAATC AATTTCTCTT CATCTTTACT 960 

ATACGTATTA GAAGTATAGC GTTTTAAAAA GCTACCTCTA CCAAGTTGTT CTAACGCACG 1020 

10 

TCTAATTTCT GGTTGGAATA TTACTATTAA AGCTAATACC CCCCATTOAA TAACGATATC 1080 

GAATAATTTA GATGTTGCAG TCAAGTTCAA TATCATACTT ATCTGCTGAC CAATAACAAT 1140 

TACTAATATC CCTTTAAGTA ATTGTATCGC TTTAGTTCCC TTAAAOACCG TGATGAGAAG 1200 

15 

ATAAAGTACA TACCAAACTA TCAGTAAATC AAGGATACTC GTTACAATTT TTAACGTACT 1260 

GAGGTTTTGA AAAAAGTTGG AAAAATCCAT AACATCTCCT CCGGGTAATA TTTTTCCATA 1320 

20 ATACCCATTA TACCAATCAT TT 1342 

(2) INFORMATION FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 base pairs 
25 (B) TYPE: nucleic acid 

<C) STRANDEDNESS : double 
<D) topology: linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO : 414: 



35 



40 



45 



CTGTATAAAG 


ATGGAGGTGT 


TTTGCATGGT 


AAAACGTACT 


TATCAACCAA 


ATAAACGTAA 


60 


ACATAGTAAA 


GTTCATGGTT 


TCAGAAAACG 


CATGAGCACA 


AAAAATGGCC 


GTAAAGTTTT 


120 


AGCGCGCCGT 


CGTCGTAAAG 


GCCGTAAAGT 


TTTATCTGCA 


TAAGATCACT 


GACTCATCAG 


180 


TGAtCTkTTT 


TTTCGTTTAA 


ATTAAGAATA 


AATAGAAATT 


TATGTTATAA 


GCTCAATAGA 


240 


AGTTTAAATA 


TAGCTTCAnA 


TAAAAACGAT 


AnATAAGCGA 


GTGATGTTAT 


TGGAAAAAGC 


300 


TTACCGAATT 


AAAAAGAATG 


CAGATTTTCA 


GAGAATATAT 


AAAAAAGGTC 


ATTCTGTAGC 


360 


CAACAGACAA 


TTTGTTGTAT 


ACACTTGTAA 


TAATAAAGAA 


ATAGACCATT 


TTCGCTTAGG 


420 


TATTAGTGTT 


TCTAAAAAAC 


TAGGTAATGC 


AGTGTTAAGA 


AACAAGATTA 


AnAGAGCAAT 


480 


ACGTGAAAAT 


TTCAAAGTAC 


ATAAGTCGCA 


TATATTGGCC 


AAAGATATTA 


TTGTAATAGC 


54 0 


AAGACAGCCA 


GCTAAAGATA 


TGACGACTTT 


ACAAATACAG 


AATAGTCTTG 


AG CACGT ACT 


600 


TAAAATTGCC 


AAAGTTTTTA 


ATAAAnAGAT 


TAAGTAAGGn 


TAGGGTAGGG 


GAAGGAAAAC 


660 


ATTAACCACT 


CAACACATCC 


CGAAGTCTTA 


CCTCAGACAA 


ACGTAAGACT 


GACCTTAGGG 


720 
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TTTAGATACA ATTACGAGTA TTTCAACACC AATGGGTGAA CGOGCAATTG GAATTGTTCG 840 
ATTGTCTGGA CCGCAAGCCG TTOAAATTGC TGACAAATTA TATAAAGGAA AACATCTTTT 900 

s 

AAATGATGTT CCATCACATA CGATTAATTA CGOTCATATT ATTGATCCAG tCTAAAGAAG 960 
TGGTTGAAGA AGTTATGGTG TCTGTGTtAA GAGCGCCAAr AACATTTACA CGCGArGATA 1020 
TTATAGAgAT TAATTGTCAT GGTGGtATTk TAACGATTAA TAGAGTGCTG GGA 1073 

10 

(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3176 base pairs 
/5 (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 





CTTTACCAAT 


GCCAAATCCG 


AAGTAAAGTA 


TAGCAATAAA 


GATTACTAAT 


ACAATTCTGT 


60 




AAATGGCAAA 


TGGAATTAGT 


TTGATTTTGT 


TAATTAGATG 


CAAGAATGTT 


TTGATTGCAA 


120 


2S 


TTAGTCCAAC 


AGTAAATGCA 


GCTAAAAAGC 


CTAAAATATA 


AAAAGGTATA 


TCAGCAATCT 


180 




GAATATCTTG 


ATAATGTTTT 


AATAAAGATA 


AACCACTAGC 


TGCTAACATA 


ATTGGAACAG 


240 




CCATAATAAA 


TGTAAAGTCC 


GATGCTGCTT 


TATGATTTAA 


TTTCATTAAT 


ACCCCAGTTG 


300 


30 


AAATTGTTGA 


GCCTGAACGG 


CTGAAACCAG 


GCCACATAGC 


TACTGCTTGA 


gAAATACCAA 


360 




TTACAAATGC 


TTGGAAATAA 


CTGATTTGAT 


CTACTGTTTG 


TGGGTTTTTA 


ACTTTAGCTG 


420 


35 


AGTATTTATC 


AG CAATAATC 


ATATAGATAG 


CACCTACGAA 


TAAGCCAATC 


ATAACAGTTG 


480 


GCACACTAAA 


TAAATGTTCT 


TCGATGAAAT 


CATCAAATAG 


TAAGC CTAAA 


ATACCTGCTG 


540 




GCACCATACC 


CACTAATACA 


TGTAATAAAT 


TTAAACGTCT 


TGGCTTTGAA 


CGTCTTTGTT 


600 


40 


GAT CGTTATC 


TCCTTCAACA 


TGTTTGTGTT 


TACCAATATG 


TAAAATCTCT 


AAGAAGOGTT 


660 




CGCGGAACAC 


CCATGCTGCT 


GCAAAGACGG 


ATCCTAATTG 


OATGACGATT 


TTAAATGTAA 


720 




ATGCTGACTG 


AGAAC CTAAA 


AATTCAGATG 


ATTTTAACCA 


CATATCATCA 


ACTAGGATCA 


780 


45 


. TATGTCCAGT 


AGAGGAAACA 


GGTGCAAATT 


CTGTTAATCC 


TTCGACGACC 


CCTAAGATAA 


840 




TACCTTTTaT 


TAATT CAATG 


ATAAACATAA 


TGTACCCACT 


TTCATTACTC 


AATTTAATTT 


900 




ATTTAAATAT 


CAAAATTACC 


ATATCATGAT 


AGCATATTCA 


TTTAAAGACA 


TGCTAGTTAT 


960 


60 


AGTTATAATA 


CTAGA CTAAA 


GATGTATATA 


TTCATTTTCT 


TTTACATGTA 


AAACTACAAT 


1020 




ATTTTATTGA 


GCTATTTAAT 


TTGATTTTAA 


GGAAAACCTT 


TTATAATAGG 


TTTAGGTGAT 


1080 
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TTCTTGGTCA 


GTACTGGTCT 


CGGCATAATC 


GTTATAACGC 


AAAATATTTT 


AATAGCAGAT 


1200 




TTTTTAGCTA 


AAATTATAAG 


ACATCAATTT 


CAAGGTTTAT 


GGATTGTATT 


ATTTATTTTA 


1260 


6 


TTAGGTGTTT 


TACTTTTAAG 


AGCAACTGTG 


CAATTTCTAA 


ATCAATGGTT 


AGGTGATACA 


1320 




TTAGCATTTA 


AAGTTAAGCA 


TATGCTTAGA 


CAGCGGGTTA 


TTTATAAAAA 


TAATGGTCAT 


1380 


10 


CCAATCGGTG 


AACAAATGAC 


TATACTCACA 


GAAAACATTG 


ATGGT CTAGC 


ACCTTTTTAT 


1440 


AAGAGTTATT 


TGCCTCAAGT 


GTTCAAATCA 


ATGATGGTTC 


CGCTCATCAT 


AATCATTGCA 


1500 




ATGTTTTTCA 


TCCATTTCAA 


TACCGCATTA 


ATTATGTTAA 


TAACTGCACC 


ATTTATTCCT 


1560 


15 


TTGTTTTATA 


TTATTTTCGG 


TTTGAAAACG 


CGAGATGAGT 


CAAAAGATCA 


AATGACTTAT 


1620 


TTGAATCAAT 


TTAGTCAACG 


GTTTTTAAAT 


ATTGCTAAAG 


GTTTAGTGAC 


GTTAAAGCTA 


1680 




TTTAATCGTA 


CAGAGCAAAC 


AGAGAAGCaT 


ATTTaCGACG 


ATAGTACTCa 


GTTTAGAACT 


1740 


20 


TTAACAATGC 


GCATTTTaCG 


CAGTGCTTTT 


TTATCGGGAT 


TAATGCTCGA 


ATTTATAAGT 


1800 




ATGTTAGGTA 


TTGGATTGGT 


TGCATTGGAA 


GCAACGCTAA 


GCTTAGTAGT 


ATTTCATAAT 


1860 




ATTGATTTTA 


AAACTGCGGC 


AATTGCGATT 


ATTTTAGCGC 


CTGAATTTTA 


TAATGCAATT 


1920 


25 


AAGGACTTAG 


GGCAAGCGTT 


CCATACTGGA 


AAACAAAGTG 


AAGGTGCCAG 


TGACGTTGTG 


1980 




TTTGAGTTTT 


TAGAACAACC 


GAACTATAAT 


AATGAATTTC 


TATTAAAGTA 


TGAGGAAAAC 


2040 




CAAAAGCCAT 


TTATTCAGTT 


AACAGACATA 


TCATTTCGAT 


ATGATGATTC 


TGATAGATTG 


2100 


30 


GTATTAAATG 


ATTTAAATTT 


GGAAATATTT 


AAAGGTGATC 


AAATTGCACT 


TGTAGGTCCA 


2160 




AGCGGGGCAG 


GTAAATCCAC 


TTTGACACAT 


CTTATTGCAG 


GTGTTTATCA 


GCCAACAATA 


2220 




GGTACTATAA 


GTACAAACCA 


GCGTGATTTA 


AATATAGGAA 


TACTTAGTCA 


ACAGCCATAT 


2280 


35 


ATTTTCAGTG 


CTTCTATAAA 


AGAGAATATT 


ACGATGTTTA 


AAGATATAGA 


AAATAATACT 


2340 




ATTGAAGAAG 


TGCTAGACGA 


AGTAGGTTTA 


TTAGACAAAG 


TGCAATCTTT 


CACAAAAGGC 


2400 


40 


ATTAACACAA 


TAATAGGTGA 


AGGAGGCGAA 


ATGTTATCTG 


GTGGACAGAT 


GAGACGCATA 


2460 


GAACTTTGCC 


GTCTTTTAGT 


TATGAAG CCA 


GATCTCGTTA 


TATTTGATGA 


GCCTGCAACT 


2520 




GGTTTAGATA 


TTCAAACAGA 


ACACATGATT 


CAGAACGTTC 


TGTTTCAACA 


TTTTAAAGAT 


2580 


45 


ACAACGATGA 


TTGTCATTGC 


ACATAGAGAT 


AATACAATTC 


GCCATTTACA 


ACGACGCTTG 


2640 




TATATAGAAA 


ATGGAAGACT 


GATTGCTGAT 


GATCGCAATA 


TTTCAGTAAA 


TATAACAGAA 


2700 




AATGGTGATG 


ACTTATGAAA 


ACACGACTAA 


AATTTCAAGT 


AGATAAGGAT 


TTATTGTTAG 


2760 


60 


CTATAGTTGT 


TGGTGTTTGT 


GGAAGTTTAG 


TTGCGCTCGC 


CATGTTTTTC 


TTAAGTGGTT 


2620 




ATATGGTGAC 


ACAAAGTGCA 


CTTGGTG CGC 


CACTATACGC 


TCTGATGATT 


TTAGTCGTTA 


2880 
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ATAAAGCTAC ATTTACAATG 


CTACGTGATA 


TTCGGGTACA 


GTTTTTCGGT AAATTAGTAA 


3000 




ATGTCATTCC TAATGTTTAC 


CGTAAACTGA 


GTTCTAGTGA 


TTTAATTTCA 


CGTATGATTA 


3060 


5 


GTCGTGTTGA GGCATTACAA 


AATATAkATT 


TACGTGTTTA 


TTATCCACCA 


GTCGTCATCG 


3120 




GTTTQACAQC GCTAGTTACA 


GTCATAGTTT 


TGGCGTTCAT 


TTCAATCGGC 


CATGCG 


3176 


10 
15 


(2) INFORMATION FOR SEQ ID NO: 416: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416: 






20 


TCTTTATTTC AATTTCCAAT 


TGAATTTTTT 


TATTATTTAC 


GCATAGCTCT 


TAAAATTAAC 


60 




GTTACGATTG CAATTAAGAT 


AATTGAACCA 


ATTAATGCTG 


GCAAGATGTA 


AATACtTCCT 


120 




AATTCAGGAC CCCATTGTCC 


GAATAGTTTG 


CCACCTACCC 


ATGATCCAAT 


AATACCTGCG 


180 


25 


ATAATATTGC CTAAAATACC 


ACCTGGGATA 


TCTTTACCCA 


TAATAGCACC 


AGCAGCCCAT 


240 




CCAATTAAGC CACCGACAAT 


TAACATTCCA 


ATAAATCCAA 


ACATAATTTT 


CAGTCTCCTT 


300 




TTTCTATTTA TTTTGCGTTA 


TTCTAAGTAG 


TACCCCTTAT 


TTACAATTCT 


AAAACAAATT 


360 


30 


CAAATTATTT TTATCCAAAT 


ATTTTTAAAA 


GTAGTAATTG 


AATATCAATT 


TTATTCAATG 


420 




TAGCTATCGT TATTTAAAGT 


CTCTGTACCG 


ATAATATCAT 


ATACATTTAC 


ATTAiTrrrr 


480 


35 


CTGCCGAATT CATAGCTTGA 


TTATTTTATG 


TTATAGGACT AgAATATACA 


CATATTATTA 


540 


GAGCATCTTT GAATTTTAAA 


TCAAGAAGCG 


AGGTTAATGA 


ACAATGAATA 


TGCATATTTT 


600 




ATATaACTTA CGAACTAAAC 


ATAATTTAGA 


AATTGACGAA 


TTAGCACAGC 


fc f\*FT*?\ TV viyn 


660 


40 


GAAATATGGT ACTAAATATG 


AAGCACATCA 


AATTTGGGAA 


TGGGAGAATC 




720 




ACCTAAATTT AAAGATGCCA 


TGCATTTAGC 


TGACTtCTTT 


GATGCACCAT 


iiluivini w J. i. 


780 




TTTAGAAAGT AAGGTTAAAG 


AATATCAGAA 


ACATTTAGAA 


GAAGTCGATA 


TTCGCATGGA 


840 


45 


TAAATAGATG CAAATAAACC 


CTCACAACAC GTTTGGCATA 


TATCCTTTCA 


AATCTATACT 


900 




GGATATATTA CATTACGTTG 


TGAGGGTATT 


TTATTAATTA 


ATATGAATTA 


AGACATTTTA 


960 




CAAGCGTTAA TGCAACGAAT 


CTTTTTAGTG 


ATCTTGCTCA 


CTCTTTAATA 


CTTTACCGTT 


1020 


50 


CTTAGCATCA ACAGTAACTT 


CTTGTTTTTT 


ATTACCTTTT 


TTCAAATCGA 


TATTGTAAAC 


10B0 




AAGTTTGCCA TCATCTTTTT 


CAAGTGACCA 


TTCTTTAATA 


TCACCATCAA 


ATTCTTTTTG 


1140 
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5 



20 



ATTCATTGTA 


TCTTCTTTTT 


CAGTCTTTTT 


GTTAATCACT 


TTTTTATTTT 


TATCAGCAAC 


1260 


AAGTACTTCT 


GACTCTTCAC 


CAGATTTTTG 


TTGCGTCACT 


TTATAAGCCC 


ATTCACCATT 


1320 


AGAATTTTCA 


AATGAAATTC 


CTTTCAACTT 


TTGGCCTTTG 


TAAGTTTCTT 


CAGCTTTTTT 


1380 


CACAGCATCT 


TCTGGGCTTG 


TTTTAACATC 


TTTTAAAGCA 


ATAACATCTT 


TTGTTGTATT 


1440 


AGTGTCTTGA 


TTAGTATTTG 


ACTCTGTTGA 


TTTTGTTTCA 


TCTTTTGGAG 


TATCATTGCC 


1500 


ACATGCAGTA 


AGCACCACCG 


CTGACATTGA 


TAACACTGCT 


AATGATTTTA 


ATTTCATAAT 


1560 


ATCACTCTCT 


CTTCCTATTT 


TTGAAACTCA 


TAACAAAAGC 


TTATATGCTA 


TATAGATTGT 


1620 


ATTACCCCTT 


GTTTTTAATT 


TTATTCATAA 


TTATTACAAA 


TATTTTTAAA 


TTAATCGTCA 


1680 


TQCOTTACTT 


TCGTTCGTAC 


TCCTTTTATA 


AATGAACCAT 


GTAATATAAG 


CATGCTATTA 


1740 




TTGTCTAATG 


CTTTCTTTGG 


TACTTCTTCC 


TTTTCAACTT 


CTTCAAAAGT 


1800 


TTCTACATGA 


TGACCTTTAT 


GTGTAATTTT 


CAAGTATCTA 


TGCGGTTTAA 


CATCAAATGT 


1860 


AGCAGTATAC 


ATTAATTGCG 


TCTCTTTCCC 


TTCTTTATTA 


AAAACACTTT 


GACTATAACT 


1920 


TCGGAGTTGA 


TCATC CATTC 


TAGTAGACAC 


CTCTGTCGTT 


TTAACATAAG 


AATCATCCTT 


1980 


CTTTACTAAT 


GGGTTAAATT 


GATCTGTTAT 


AC CATGGG AG 


TCTATTGTTT 


TTAAAATGAA 


2040 


TAAAGCAgCA 


TAAGCGCCAA 


TGATAGTAAG 


TACAAGATAt 


GTkATTGTTT 


TTAAAATCGT 


2100 



TTCAAAAGA 2109 



(2) "INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 
<B) TYPE: nucleic acid 
<C) STRAND EDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

40 

GTTACATAAA TGAAAACAAA AAAGATAATT TTACTGCTTA TGCTACACCA GAACATAATT 60 

ATCAATTTGG TGGTGCTATG ATAGAAAGTG AAAAATTAAG CGAGTTACTA AAGCCAGCCA 120 

ATCAGTTAAA ATCACCAGAT GATATAAAAA AAGAACTAAA TAAAAAGAAG AGTCACTAAA 160 

45 

GTTAGGAGTT ACTTTAATGT CCAAAAAACA TGTTTTTATA ATTATTGGTG TCATATTGTG 24 0 

TATATGTACA GTTTCTACGG TCATGCATTT TAAAATGAAA TATGATGAAA AAGAAAAACA 300 

SO AAAAGCGATT TACTACAAAG AACAACAAGA ACGTATTACA CTCTATCTTA AGCATAATAC 360 

TAAAGAAACG AACACGATTA AATCTGTACA TTTCACAAAC TTGGAAACAA GTCCTATGGG 420 
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10 



15 



20 



30 



35 



40 



ATCGCCTGAA CATAATTATC AATTTGGTGG CGCTATGATA AAAAGTGAAG GAGTAGATAA 540 

ATTATTAAAA CCAGCACATG AAAGAAAATC ACCAGAAAAA ATCAAAGAAG AATTAGATAA 600 

AAAAGAAGGC CACTAGGGTC TTCTTTATTT TTGATTTAAT CTTCCAATAA TCTATGTCAT 660 

TGCTATCGAA GGTGTTTCGC AATTAATATA AATCACTTCA TCATCACCAA TACTTCCCCA 720 

GTTTTGTACA GTACATTAAC ACAAACGAAC CACGTTAATT TAAATGGAvT AaTAGTTTGG 780 

CCATTATAAG AACAATATAT ATCGAnTAAC AAT 813 
(2) INFORMATION FOR SEQ ID NO: 418: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 base pairs 

(B) TYPE t nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 

AAAAGCAATC GTTGGTGGTG CTAAATTCAT CGGCAACTCA TATGTAAAAG CTGGTCAAAA 60 

26 TACACTTTAC AAAATGAGAT GGAATCCTGC ACATCCAGGA ACACACCAAT ATGCTACAGA 120 

TGTAGATTGG GCTAACATCA ATGCTAAAAT CATCAAAGGC TACTATGATA AAATTGGCGA 180 

AGTCGGCAAA TACTTCGACA TCCCACAATA TAAATAAGCA ACATGAACAT AGGATCAAAA 240 

GTCATCCCCC ACTATCAATC ATGGGGGATG ACCTTTGATC CCTTTTTTAT ACATACACAA 3 00 

GCAAAAATAG CGGTGATTGT TTACCATCAA TTTTAACAAT CACCGCTACT TTTGCTTGTA 360 

ATTCATGATT CAATTTTTGT TGTGTGCACA ACGACACTAA ATTATGTGTT TGCTATTGTC 4 20 

GTGTTACAAC GATATGCGTC GTTGATTTAA CTTATCAAGT AATTGATTTA AATTGTCTAA 4 80 

TTCGACTTCC GATAAACATT GACATCTTGC TTCAATCAAT TCGCAACGTG CATTATTTAT 540 

TTGTGAAATT AATGTACGTG CTTGATCAGT CAAAATTAAT TCTTTACATC TTAAATCTTC 600 

GCTAGATTGT TGACTTGTGA TGTACCCnnT CAAAACTAAT 640 

(2) INFORMATION FOR SEQ ID NO: 419: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1159 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 

(XX ) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 
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5 



10 



1$ 



20 



25 



TTCTTGGATA 


CGTAATAAAT 


TACCTTTTGT 


CATAGGATTA 


AACATAGATC 


CAGGTAAAAC 


120 


ATAAACATTT 


CTATTTTGCT 


CTAATGCAAA 


ATCTATCGTG 


ATGTGACTGC 


CACTTTGTTC 


160 


CTTAGCCTCA 


GTAATTAAAA 


CCCCTTTTGA 


CAAACCGCTG 


ATAATTCTAT 


TGCGCTCAGG 


240 


AAATCTATAT 


TTAGCGATTG 


GTGTATGTGO 


TGGATATTCA 


GATATAACTA 


AACCTTTTTC 


300 


TTCTATTTTA 


TTTCTTAATG 


CTAATGTACT 


TTTGGGATAA 


TGTGTTTGAT 


GGCCAAAGGC 


360 


TAAAACTGCA 


ATTGTAGGGA 


GATTGTATTT 


TAAAGCTATT 


TGATGTGCCA 


TTGCATCAGC 


420 


TCCTTGAGCA 


AGGCCGGAAA 


CAATTGTTAA 


ATATTTGCTT 


TTATCATTTG 


ATAATAAAAA 


480 


TTCTAAAGAC 


TGTTGGGTAT 


AACTTGTAGA 


ATCTCTTGCA 


CCTACTACTG 


CCAAATGATG 


540 


CATATTATTT 


ATTAATTTGA 


TGTTCCCTTT 


ATAGAAAAGA 


AGTAATGGAA 


ATTGATATAT 


600 


TTCTTTTAAT 


AGCACTGGGT 


ATTCAGAATC 


CATATATGTA 


ATGTAACTTA 


CTTTTAATTT 


660 


CTGCAGTTCT 


TTAATAATAT 


. CGTTATGATC 


AAGTTTTATA 


AAAGCATAGT 


ATTTACGTAA 


720 


TAAATGAACA 


TTTTCTTCCC 


TATTCACCCA 


TTCACATAAA 


TAACTATCTT 


TTTTTCTTCC 


780 


CTCCTCTTTA 


ATTACATTAG 


GATATGCCAT 


TAAAAATTGA 


TGAATTTGTT 


TAGTCGAAAA 


640 


GTGTGCCCAG 


TA1AAL 1 1 Art 


uUtnnAAlAu 


X 1 J. e\i\ 1 ^-~r\rt X 




fT*T , T r r i 'TV2*r H 7A. 

Xi> A *■ 4 A X\9 AfW^ 


900 


TCATACAAAA 


T CAT ATT CT A 


TTTTTGTTTT 


ACATTTCTAA 


TACAAAAACA 


TTGTCGATGT 


960 


AATGTTATTT 


TAAGGAGTAA 


AAATACTGAC 


TTAAAAAGTG 


AAAAGTATGT 


TGGAAAGAAT 


1020 


TTAAAATTTC 


AATTTCCAAC 


ATACTTTATA 


ATTAAACCTT 


ATAAATAAGT 


TTTGCAAATT 


1080 


TATATAGAAT 


TGGTCTTACT 


GGTTTGATGA 


AATCACCAAT 


TAATTCTTCA 


ACATGTGCAT 


1140 


TAAAACCCTT 


TTTAAATTG 










1159 



(2) INFORMATION FOR SEQ ID NO: 420: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1879 base pairs 

(B) TYPE: nucleic acid 
40 <C> STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 

45 

TCAATCAGCG ACTACAAGAA GTG CGGGTCT TCAAACAATT GATGTGACAA CACTAAGTGA 60 

CCCCACTAAT ATTATCATGG GTATTTTAAT GTTTATAGGA TCTTCGCCAA GTTCGGTTGG 120 

6Q TGGCGGTATT CGTACAACAA CTTTCGCTAT TTTAATTTTG TTTTTAATTA ACTTTAGTAA 18 0 

TAATGCCGAT AAAACATCCA TTAAAGTTTA CAATAGAGAA GTACACATTA TGGATATTCA 24 0 
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GAAAATGGTA AGCTTACATT 


TTTACAAGTA 


tttittuaag tcatgtctgc 


360 




ATTTGGAACT 


TGTGGACTAT 


CGCTTGGTGT 


CACAAGTGAT 


ATTAGTGATA TTTCTAAGGT 


420 


5 




ATACTCATGT 


TTATAGGACG 


TGTTGGCTTA ATATCATTTA TCATTATGAT 


480 




AGCAGGACGT 


CGAGAACCAG ATAAATTCCA TTATCCAAAA GAACGTATTC AAATAGGATA 


540 


10 


ATATAATAGC 


AATCTAAGTT 


TAGTTAATGT 


AGATTTTAAC 


TGGAACTTAG ATTGCTTTTT 


600 


TAGTTTGTAT 


TTTTAACTTA 


TTTTATAAGA 


CGATTGGTTT 


CGAAAATGGT AAACTAGTAA 


660 




CAATGAGAGG 


TGTAACATGA 


TGGAAAAAAA 


TGAAAACATT 


AATGTAGAGA TTTTAACTAC 


720 


IS 


GTCAGATATG CATAGTCATT TCTTAAATGG 


TGATTATGGT TCAAATATTT ATAGAGCTGG 


780 




TACTTATGTT 


AACCAAGTAA 


GAGCACAAAA 


TCATCGCGTC 


ATTTTATTAG ATAGTGGCGG 


840 




AAGTTTAGCT 


GGCTCGTTAG 


CGGCCTATTA 


TTATGCTATT 


GTTGCACCTT ATAAACGACA 


900 


20 


TCCAATGATA 


AAGTTAATGA ACAGAATGCA 


TTATGATGCT 


AGCGGTGTGA GTCCAAGTGA 


960 




ATTCAAGTTT 


GGTTTATCAT 


TTTTAACTCG 


TTCAATTGCT 


TTGGCACGTT TTCCATGGTT 


1020 




ATCAGCAAAT 


ATTOAATACA 


ATGTTACTAA 


GGAGCCTTAT 


TTTTCAACTC CATATTGTAT 


1080 


25 


TAAACATTTT 


GGTGACTTAA 


AAATTGCTAT 


CGTAGGCGTC 


ACAGCAGATG GTTTAATGGA 


1140 




AAATGAGTAT 


TCTGAAATGG 


AGCAAGATGT 


ATCTATTGAA 


AAGACATTAG TGGCATCAAA 


1200 




ACGTTGGATT 


AGATATATCC 


ATGAAGTTGA 


AGAGCCAGAT 


TTTTTGATTG TAATTTATCA 


1260 


30 


TGGTGGATTG 


AATAAAATTA 


GTAATAGTAC 


GAAAAATAAA 


AAGGCAAGTT CGAATGAAGC 


1320 




TGAAAAATTA 


ATGGAAGAAC 


TCGGTGTTAT 


AGATTTAATG 


ATTACAGCTC ATCAGCATCA 


1380 


35 


AACAATAGTA 


GGTCAAG AT C 


ATGAAACGTA 


TTATGTTCAG 


GCTGGTCAAG ATGCCAAAGA 


1440 


GCTTGTACAT 


CTTTCGATTA 


ATTTTAAAAA 


GAGAACAACA 


ACTTATGATG TTGAAAGCAT 


1500 




TGATTCTAAA 


GTGATTGACT 


TAAATGAGTA 


TGAAGAGGAT 


CAAGAATTAT TAGATTTAAC 


1560 


40 


ATTCTATGAT 


AGAAAAGCAG 


TGGCTTATTG 


GTCACAGGAA 


ATCATAAGTG ATAAAGGTTT 


1620 


GATGTTATCA 


GTAAATGGGT 


TACAAGATTT 


AGTCTGTCAA 


ACACATCCAT TTTCGCAATT 


1680 




ATTACATGAT 


GCAATTCACC 


TTGCATTTGA 


TAATGATATA 


ACATGTGTCC ACGTGCCTAT 


1740 


45 


GaACGGAGAG 


AAGGGGTTGA 


GTGGACAGAT 


TOGAAATGAA 


GrTTTGTATC aTGCATACCC 


1800 




ATaTCCAGAT 


aAGCCAtGGG 


tATGACAATT 


aGTGGTCAAA 


ATaTCAAGGT ATnTTGGGGT 


1860 




ATAGTTATTC 


ACCATTAGG 








1879 


50 


(2) INFORMATION FOR SEQ ID NO: 421: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2710 base pairs 

55 
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15 



<C) STRANDBDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 

AATTCCCCCT TTTTCCCCAT CCATTTTnCC ATCCACTGGT CCTTACGGGA CCATATTATT 60 

TnAAATTGGA nAAAATTATt TTAAAGaTTA TwACtACTCT TtAATCATTT TrGTGAATTA 120 

AAAAAAGTAG TGCAAAAAGC AAAATATACT TTATACACTA CAAATCATTT ATTTATAATA 180 

AAGTTTCACC AAAAAATGTT CCAACTAATG AAACCGCTTG TTCAGCAGTA TGATTATTAC 240 

TGTCAATCAA TGGATTTACT TCAACTAAAT CCATTGAGGA AATTAAATGT GATTGATGCA 300 

GTAATTCCAA TGCAAAATGG CTTTCTCTAT AACTAAGACC ACCCAAAACT CTAGTACCAG 36 0 

TGCCTGGCGT TTCAAGCGGA TCTAAAG CAT CAACATCTAA AGATAAATGA ACGCCATCAA 420 

20 CATTGCGTGA CTTCAAATAT TCTATTGTAT TTTCAATTAC TTCCTTTATC C C CAATTT AT 4S0 

CAATATCTGA CATAGTAAAT GTTTTAATAT TATGATCTTT GATAAATTGT CTTTCACCTT 54 0 

TATCTAAATC TCTCATACCA ATTAGTACGA TGnTTTCTGG CTTGATTACA TTACnATTTA 600 

25 ATTCTAAAAG TTCTTTGGGA CCTTCGCCTG TCAAAATCCT TAGAGGCATA CCATGAATAT 660 

TTCCACTTOG TGACTCTTCA GGTATATTTA AATCACCATG TGCATCATAC CAAATAACAC 720 

CTAAATTATT ATAATGTTTA CTTATTGCTG AT ACTGAAC C TAC CGCAATA GAATGATCTC 780 

CACCAAGAAC TAGAGGAAAT CTGTTATTTT CAATTGATGC TGAAACCTCT TTATTTAATT 84 0 

TTTGATTAAC AT CTATAATT TCATCATAAT TTCTTAATCC TTTTTGTTCA CTATGAAATT 900 

TTTCAATGTT CACAGCAGGt ACCTTAATAT CCCCCTTGTC ATATACATCA AGGTCTAATT 960 

GCTTTAATCT TGAAATTAAT CCAGCATATC TAATTGCTGT TGGTCCTAAA TCAACACCTA 1020 

ATTTTCTTTG TCCAAATGTT GATGGTGCAC CTATAATATC AATTGCTTTT GTCTTTGTCA 1080 

TAAGCGTCCC CCTTTGCTCT ATGTAATTAA AGAATAATGT ATGCGCTTAC CATTATCAAG 1140 

CAATAGCTAC ACATATAATC TGTTTATCTT ATTACTTCAT AAAAAAAGGT TCTTCATCTT 1200 

TTATGGTGGG AAGGTAAAAC TTCCTGCTTT TTTTAATACA CAAAAAGCGC AATTGCCTCT 1260 

ATAATTTAAA GTGACCAAAC CCAAACTAAA GGAGACAAGT GCGCCTATGT GTAATGATAC 1320 

CTTAGAATTA CTAAGAATAA AAGATGAAAA TATAAAATAT ATAAACCAAG AAATTGACGT 13 80 

CATTATCAAA GGAAAAAAAG CAACAGTGGT TAATGCTGTA CTAACGTATA AGCCTTCGGC 1440 

SO CTGTTATTGT TGTGGAGTTA AAAATGAAGG ACAAATTCAT AAACATGGTA AGCGTGTTTC 1500 

T CGTATT ACT TTACTTAAAA CTCAAGGGTA TAACACATAC CTCAACTTAG CTAAACAACG 1560 
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GTGTTTTATC TCAAGATGTG TTACTCAAAA AGTTATAGAA GAAOCTACTA AAGTTAAAAC 1680 

AGAGATTGAT ACTGCAGAAG ATAACTGTAT CTCTCCATCT ACTGTAAGTC GTATTAGAAC 1740 

TAAAGCGGCT AATTCATTAC GAATTAAACC CTTTAATTGT TTGCCAGAAC ACATCGCTAT 18 00 

GGATGAATTT AAAAGCGTTA AAAATGTAAC TGGATCAATG AGTTTCATTT TTATAGATAA i860 

TOATACTCAT GATGTTATAG ATATTTTAGA AAATAGAACT ACAAGATTCT TGCGTGCCTA 1920 

TTTCGAGCGA TTCGATTTAA AAAATCGACA ACAAGTTAAG ACGGTTACTA TTGACATGTA 1980 

TGAACCCTAT GTC CGATTAT TTCGCGACCT ATTTCCTAAT GCAGCTATTA TTTTTGACAG 2040 

ATTCCATATC GTTCAACATT TAAATAGAGA ACTTAATAAG TATCGTGTAC AAGTTATGAA 2100 

TGAATACCGT AATAAAAAAG GACCTGATTA TACAAXTTTT AAGAATAACT GGAAAGTCCT 2160 

ATTGATGGAT ACTAGTAAAA CCATATTTAG TAAATAGAGA TGGAATAAAT CTTTTAAGGC 2220 

TTATAAACGC TCATCTGACA TTGTAGAATT CATGCTTTCA AAAGACGATA TACTACGACA 2280 

CTCCTACGAA CTTGTCCAAG GATTACGAAA AGACCTAAGG TTATGTAATT GGCCTAAATT 2340 

TATTAATCGT TTAAATTCAG TTAGTAAAAA GTCTGTGAGT AAGGGTGTAT GGAAAGTGGT 2400 

TAAATATTAT AGAAAACATC AAAGGATGTT AAGAAATACA ATTTATTACC CAGCATTTAA 24 60 

TAATGGTGCT ATAGAAGGAA TTAATAATAA GATAAAATTA ATCAAGTGAA TTTCTTTTGG 2520 

TTACAGAAAT TTCAACAACT TTAAAGCACG TATAATGATG aTTTTCAGCT TGTACAAAGG 2580 

30 AGaAAAAAAG AAGACAACCA AGCCCAATAA TGGACTGGCC GCCTAATAwA nGGGs kCTAA 2640 

AAGTTgTATT TTTAAAAATA GTTCCTTTAA ATT A TAT AC C CnCCACATTT GGGGGAGGAC 2700 
CTAAAAAAGC 

35 (2) INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 <D) TOPOLOGY: linear 



10 
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20 



25 



45 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 

CATTTTAATT GtTaAAATTC CAAAAAAtGT ArGTGGATTw AAAGrAAACC CtGTGTTTTT 60 

AAAAGGtACC ATTaAAATAg TTCCGATTGT ACCATCCCAC GTGAAAtTTT TAgTAtTGCC 120 

GGTGaGAGAA AATGCCAATG CAATCATCGC AGTTCATAAT CATCCATCCG GTGATGTAAC 180 

GCCCTCACAA GAAGATATCA TAACAACAAT GAGGTTGAAG GAGTGTGGTT TGATTTTAGG 240 
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10 



GGGTTACTTT 


GATGAAAATG 


ATTOAAGTTQ 


ATAGAATTAA TGACGTATCT TGTGTATAAT 


360 


ACCTACGAAG 


TACTTTCATT 


GGAGGAAAAA 


TAGTGACTCT ATTTATTATT ATCGGGGTTC 


420 


TCGTGCCAAT 


GGTTTATACC 


ATGCAGTTAA 


nliii 1/uwtnn 


iyiviLL iVji C* 


ACAAAGCGCA 


480 


ATCTTTTAAT 


AACATTAGCT 


TTATCTACGT 


T ft ft *PT^MT 






54 0 


TAATCGTTAC 


GAAACAAGCT 


TTTCCTTTAT 


t*» ivr'nv^'Pft r^r* 
XAAb 1\7 A AljL. 


AATTGGCTCA 


AX XXX inUlVJ 


600 


GAATCGTTTG 


GGGCCTTTTA 


CTAAGTGGTA 


QCTACGCGCT 


GATACGATTT 


TTATCTAACG 


660 


CATTTGGGCG 


TAAGTAATAG 


TAATCTGTTC 


ATCAAGTAGT 


ATCCGTGCTT 


GAAAACAAAC 


720 


TAAAACTCCT 


AATGTGGAAC 


TAGTTAAAAA 


ATTCTAAACT 


AGTACATTAG 


GAGTTTTGTT 


780 


ATGCAGAATA 


AATTTAATTG 


TTAAATTGAA 


AGTGCGGTAA 


AAATCCACTA 


TTTATTTGAA 


840 


AAAAATCGAG 


CAACCAAATT 


AAACCTTGAT 


ATCCTAAGTA 


AATACATAAT 


AAAACAAGTC 


900 


CGACATAAAT 


TAAAAATCGC 


AAAATAGAAA 


GTCCAACTCT 


AAAAAGGAGG 


ATGACTAATA 


960 


GTGCTATTAA 


AATAATTGTt 


AGTATACTCA 


ATGCTACAAA 


CCTCCTArTA CGCTTTTAAA 


1020 


TCCATAA 












1027 



(2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 

35 CGGCACATTT TTAATTTATA CATATTTTAA AACTAAGTAA CAGTTTGAAG AAATCGTAGT 60 

TCAATAATGT TAATTGTGAA AATGTATATA AACATAAAAA AATCATGTAT AATATATGTT 120 

GTTAATTAAA CAGTTCGAAA GCGAGATGAC ATTATGGGAC GTAAATGGAA TAACATTAAA 180 

40 GAAAAAAAGG CCCAAAAAGA TAAAAACACA AGTAGAATAT ATGCGAAATT TGGTAAGGAG 240 

ATTTATGTTG CAGCAAAATC TGGTGAACCC AATCCAGAAT CTAACCAAGC TTTAAGGTTG 300 

GTGCTTGAAC GCGCTAAGAC ATATTCAGTG CCGAATCATA TTATTGAAAA AGCAATAGAT 360 

45 

AAAGCTAAGG GTGCTGGAGA CGAAAACTTT GAT CACCTAA GATATGAAGG ATTTGGCCCA 420 

AGCGGATCAA TGCTAATTGT TGATG CGTTA ACAAATAATG TAAATCGTAC TGCCTCTGAT 4 80 

GTGCGAGCTG CTTTTGGTAA AAACGGCGGT AATATGGGTG TATCTGGATC AGTTGCTTAT 540 

60 

ATGTTGTCAT GTGGCCACAT TTGGTATTG 569 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3264 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
5 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 



10 


TGGACCTATA 


AAATGGATGC 


ATCCAAATAT 


GGATGnGGGG 


TAnGGCGGGa 


AAATAGGaGG 


60 




TTATATAGTA 


AGTCATTTAA 


GTCGAATAGC 


CATCCTTTTT 


AAAATGAAAA 


AAATAGAAAG 


120 




CTCAATAGTT 


TGTTAAAGCC 


TTTAAATAGC 


GTCGTTACGC 


AATTTTAGAA 


TGCTAAAAAT 


180 


15 


TGTCACAAAT 


TTCAAATTAC 


GTGCTAAAGC 


TTGAGATATC 


AATATTTATT 


GGCGATAGAG 


24 0 




TGTAATTTGA 


CTCCGCTTAC 


ATCTCAGTTT 


TATGTTTGTA 


ATTTGGTAGC 


ATAATATTAT 


300 


SO 


AATAAAATAA 


AATTGTTAAT 


CTTTAATTTC 


AGTATAGATA 


TTTTTACGTG 


TAGTCACGTG 


360 


TAAAAT AAAT 


TCAATTAGGT 


TAGGAGACAT 


AATTATGAAA 


ACATTTGGTA 


AAAAGGTTGT 


420 




ATTAATCGGA 


GATGGATCTG 


TAGGATCAAG 


CTATGCCTTT 


GCAATGGTTA 


CGCAAGGTGT 


480 


25 


TGCTGATGAA 


TTTGTAATTA 


TTGACATTGC 


AAAAGACAAA 


GTAAAAGCAG 


ATGTTCAAGA 


540 


TTTAAACCAT 


GGTACAGTCC 


ACAGTCCTTC 


ACCAGTTGAT 


GTGAAAGCAG 


GTGAATACGA 


600 




aGaCTGTAAA 


GATGCAGATT 


TAGTTGTTAT 


TACAGCTGGT 


GCACCTCAAA 


AGCCAGGTGA 


660 


30 


AACACGTTTA 


CAATTAGTTG 


AAAAAAATAC 


TAAGATTATG 


AAGAGCATCG 


TTAAGAGTGT 


720 




TATGGATAGT 


GGCTTTGATG 


GATATTTCTT 


AATCGCGGCA 


AACCCTGTAG 


ACATTTTAAC 


780 




AAGATTTGTA 


AAAGAATATA 


CTGGATTACC 


AGCAGAGCGT 


GTTATCGGTT 


CAGGTACTGT 


84 0 


35 


ATTGGACAGT 


GCACGTTTAC 


AATATTTAAT 


TAGCCAAGAA 






o n n 




TGTTGACGCT 


AGTATTATTG 


GCGAGCATGG 


TGATACTGAA 


CTTGCAGTTT 


GGTCACAAGC 


960 




AAATGTAGCA 


GGTATTTCAG 


TATATGACAC 


ATTAAAAGAA 


CAAACTGGTA 


GCGAAgcTAA 


1020 


40 


AGCGGAAGAA 


ATTTATGTGA 


ATACACGTGA 


CGCTGCTTAT 


GAAATTATCC 


AAGCTAAAGG 


1080 




GTCAACATAC 


TATGGTATTG 


CATTAGCATT 


GATGCGCATT 


TCAAAAGCCA 


TTTTAAATAA 


1140 




TGAAAATAAT 


GTCTTAAATG 


TTTCTATACA 


ATTAGATGGT 


CAATATGGTG 


GTCACAAAGG 


1200 


45 


CGTTTACCTA 


GGTGTACCAA 


CATTAGTTAA 


CCAACATGGC 


GCAGTTAAAA 


TTTATGAAAT 


1260 




GCCATTAAGT 


GCCGAAGAAC 


AAGCGTTGTT 


CGATAAATCT 


GTTAAAACAT 


TAGAAGATAC 


1320 




ATTTGATTCA 


ATTAAATATT 


TATTAGAAGA 


CTAAGCCTAT 


TTTAAGTATT 


AATTAGAAAT 


1380 


SO 


ATATTAATGG 


TAAGAGGATC 


TATTAGTGTT 


GCAGTAACAC 


GTGGCACTGA 


TAGGTCCATT 


144 0 



55 



1324 



EP0 786 519 A2 



10 



15 



20 



TATGACTCGT GAAAATTTAG TGACATGAAT AAAAATGTTG AGGCQTCATT GAGTATAAAG 1560 

GAAAGTAGTT CTGCATTAAT CACGAaGTAG AGCATGACmA CGAGGAATAA CTATAGGGaG 1620 

ATGGTTTTGG AATGACGAtG TCTTGTATCA ACATGGTACA TTAGGTACGT TAATGGCTGG 1680 

CTTACTAGAA GGCACAGCTA CAATTAATGA ATTGTTAGAA CATGGGAATT TAGGGATTGC 1740 

AACGTTAACA GGGTCTGATG GCGAAtAATA TTTTTAGACG GAAAGGCATA TCATGCTAAC 1800 

GAGCATAAAG AATTTATAGA ATTAAAAGGC GATGAGAAAG TACCGTATGC ATCGATTACT I860 

AATTTTAAAG CGAGTAAGAC ATTTCCATTG CAACAATTAT CACAAGATGA TGTATTTGCA 1920 

CAAATTAAAA ATGAAATGTT AAGTGAGAAT TTATTTTCGG CTGTTAAAAT TTATGGCACA 1980 

TTTAAACATA TGCATGTACG AATGATGCCT GCTCAGCAAC CGCCATATAC ACGTTTGATT 2040 

GATTCAGCAC GCAGACAACC TGAGGAAAAA AGACAAGATA TTCGTGGTGC CATTGTTGGA 2100 

TTTTTTACAC CAGAATTATT TCATGGCGTA GGGTCTGCTG GTTTTCATAT ACATTTTGCG 2160 

GATGATGAAA GAGCTTATGG TGGACATGTT CTTGACTTTG AAGTGGATGA CGTTGTCGTT 222 0 

GAGATACAAA ACTTTGAAAC ATTCCAACAA CATTTCCCGG TAAATAACGA GACGTTTGTT 22 80 

25 AAAGCGAAAA TAGACTATAA AGATGTGGCA GAAGAAATTA GAGAAGCTGA ATAGTCTAAT 234 0 

ATAATTAAAA GACCTTAGCG ATATTAGGAA CAGGTGGTTC TAAATGCATC GTTAAGGTCT 2400 

TTTTATATTA TATATGTGCT TACATATTTT GTTGATACGC GCGTAAAAAG CTATAAATAG 24 60 

30 TATCAAGATA TGGGGTGTCA AGAATGTGTT CACGTGCACG TCGATAAATA AAGCCTTGAA 2520 

TCGCTTCGAC TTCAAGAGGT TGCTGATGCG CAATGTCGTA ATACATGCTC GTTCCCATTT 2580 

CGTCGGGATA ACCTTGATAT ATAGTCATAA TAGTATCGAC AGTTTGTTCT GAAAAGTTTA 264 0 

35 ACCCCTCTGC CTGTGCAACA CG ACAAC CAT CTAGTAATAG CTGTCTACAC AGTGTACGTA 2700 

TTTCAGGATT ATGCATGATT GCAACGGTCT GTCTACCGAG TGCTGTGATA GAATTAATGC 2760 

CCAAGTTAAC TAACAATTTA TACCAAATAG CTTGTTGAAT ATTTGCTTCT AATACAATGT 2820 

CGATTTGACT GTCTTGGACC AAATCTCTAA ATTGTCGAGT TAATGCATTA TCTTGTATAC 2880 

GTAATTGATA ATCTCTGAAG TGCGTAACAA CATCGCCTTT CTTTTGACCA CTTATATAGA 294 0 

CAACTGCTTG GCATACGTTT TTAAATGAAA TATGTTCGAG TTGACCATAA CCATTTTGGG 3 000 

CTAAAATGAT GAGCGTGTCC TCATGTGCAA GATGAGTTAA ATGAGGAATA ACTGCATCAA 3 060 

GTTGATGTGT TTTGACTGCA ATAATAATAA CATCAAATGT GTTTGTGACA TCTTCATAAC 3120 

CTTTCACAAC AATATCTTGT GCAGGTGCAT GTGGTACAGT ATAATATGTT ATTGTTTTGG 3180 

CGTGTCTCCC GATAAG CGTT GTATGAGGCA ATGATTGTTG TAATTCATAG GCAATAGTTG 324 0 
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(2) INFORMATION FOR SEQ ID NO: 425: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



10 



15 



20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 

AGAACGAACC TTTAATACGT ACGTGTCGAC CACGAACTGT CATGACAAAT TGATAAAACG 60 

AATAGTTTTT CATTAGTTCA TTGTCACATC AATCACTTTT GTkTCACCTT TAATCACAGC 120 

ATTTTCATCA TAAATATTAA TTGAAGCTGC TTGATCAGTG TTAGTAATTA TAATTGGTGA 1B0 

AATTACAGAT TTAGCGTTAT TATTAATATA TTCAAGGTTG AATCTTACTA ATGGATCTCC 24 0 

GACGTTAACT TCGTCACCAC TAGACACTAA CACTTCAAAT CCTTCACCGT CTAATTGAAC 300 

TGTGTCTAAA CCGATATGAA CTAATAATTC TAATCCGTTA TCTGCTTTTA ACCCAATCGC 36 0 

ATGCTTAGTT GGAAAGACAT TGTCAACACG TCCTGCAATT GGAGACACAA CTTCTCCTTC 420 

AGTTCGATTA ATACCAAAAC CTTCGCCCAT CATTTTTTGT GCGAATACAG GATCTGGAAT 4 80 

ATCTTCAATT TTCACGAATT CTCCAGTTAA TGGTGCATAA ATTGCGATAT CTTTCTGAAC 540 

TTCTTTGCCT TTTCCGAATA ATTTTTTAAA CATACTTTCC ACTCCTACTT ATCAAAATGT 600 

GATATTAAAT CGCCATAACC CAATTCTTCT AACTTTTCAT ATGGAATAAA TTGAATTGCA 660 

GCGGAATTGA TACAGTATCT TAAGCCG CCA CTTTCTTTAG GTCCATCATT AAAGACATGT 720 

CCTAAATGAC TATTTGATTC TTCTGAACGC ACTTCAGTTC TCAACATACC AAATGATTTG .780 

3S TCGACTAATT CTATAA 79€ 
(2) INFORMATION FOR SEQ ID NO: 426: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 93 base pairs 
40 <B) TYPE: nucleic acid 

<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



30 



45 



50 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 

ATCCAGCAAA TCTTTCTTAT CACGTTCTGT AATTTTCTTA GTATCCATCT TGATTAGCTT 60 

TGATAACTTT TCAGCCGTAT CCAACATTTC CGATTGTGTT GTTTTTCGAC CCCTAGTATA 120 

TGTAATAGCC ATTTTAGAAG CATTATCAAC TAAAACTTTC CCATTTCTGT CTAAAATACG 180 
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ATGTGAGCCT TGTGCGATTT GTAAATAACC 
TACAATCACA CCAAATATAA AQTTAATTCT 
5 ATTTGATTTT TCTTTTAGTC TTTTTAACAA 

TTTAAATCAT ATATGAATTT AOAAATTATT 

CATTtAAGAG ACCGAACAAG TAATCATACT 

10 

TTTTTCCCAG TTAACTACAT TCCAAAATGC 

TTTTAGGTAA TAAGCGTGTT CCCATACGTC 

TAATGGATTA TCTTGGTTTG GTGTAGTCAC 

15 

CCAAGCCCAA CCTGAACCAA AGCGTGCAGC 
TTCTAAAGAA CCCCATTGTT CTTTAATTTT 
TGGTGAAAGT AACTCCCAGA ATAATGAATG 

20 

AGCAGTTTGG ATGTTAGCTG GTACACTGTC 
AGATTCTAAA TCTGTACCTT CTACTGCAGC 
TCTGTCATGG TGAATTTCCA TAGTTTCTTT 

25 

GTATGGTAAT TTTGGTAATT CAAAAGCCAT 
TAAATAATAA CAAGCAGTGA CTGGTTCAAC 

5 0 ATTTTCTCTA CTCATTTAAT ATAACTCAAA 

AAGTTAACAG ATTTGCCATA AAATCATACG 
ATAATTTAAA AAATGAGATT TATGCATACA 

35 TTATATTAAT CTTGGcAAGA CTCACATACA 

TTAACACCAG GTAGTGATAA TTTTATCTGA 
TCTCCACACT TTTCACAGAT AAAATGATGA 

40 TTCATTTCAC CATCAAGTTC TGTATTTTCA 

TTTCTATATA TTGTGTCGAA TGAAATTCCA 

TACTTTGCGT TTATATACTT ATCTTCTTCG 

45 

TCTGTATATT TTAAACCGTT CTCTTTTAAA 

ATTAGCTCCC TTTTTAAACT TCATTCGCAT 

AATAACAAGT AGAACTACAA TTACACCACC 

50 

TAAGCCTAAT ATTACTGATA ATTCAC CTAA 

55 



TAAACGTAGT 


ACTAGTACTG 


CAAAAATAAA 


300 


CTTGTTAATT 


GTATTTTGAA 


CGATTTCATC 


360 


AACTACCTAC 


CTCTATTCAA 


AGTCTTTCAC 


420 


TCTATCTTTT 


TGACAAAAAA 


ATAACGGTCT 


480 


TTATTTTGTT 


GCATTATATA 


ATTCGTCAAC 


540 


GCCAATGTAG 


TCAGGGCGTT 


TGTTTTGATA 


600 


TAAACCTAAA 


ATAGGTGTTT 


TACCCTCAGT 


660 


AATTTCTAAC 


TGGCCATTGT 


TTACGACTAA 


720 


TGCTTTGTCA 


GCAAATTCTT 


TTTTAAATTC 


780 


TTCTACTACA 


GTACCTTTTT 


CTTCTGAGTT 


640 


GTTTAAATGT 


CCACCGCCAT 


TATTACGTAC 


900 


TAAATTAGCA 


ACAATTTCTT 


CAATAGATTT 


960 


ATTTAATTTC 


GTAACATACG 


TGTTATGATG 


1020 


GTCAAAATGT 


GGTTCTAATG 


CATCAAATGC 


1080 


AAATAATCAT 


CCTCCTAAAA 


TGTCTGTAAG 


1140 


AAAGAATTTG 


CTTAAATTCT 


ACTACTTATT 


1200 


TCAAAAATAA 


TTAAACATTT 


TGTATATAAA 


1260 


AACGGAGTAT 


GAAATGAACC 


TTTATCTTCT 


1320 


TCGGACCAAA 


TGTGCATAAA 


TCTCATTTCT 


1380 


CCATAAACTT 


CAAGTTTGTG 


TTTGTGAATA 


1440 


TCTATTGGAC 


AATAATCTAT 


TACCTTTGTA 


1500 


TGATGATGGT 


TTGTACAAGC 


GATTCTAAAC 


1560 


ATAATTCCTA 


AATCTTTAAA 


TAAGTGCAGG 


1620 


fiftlTBlTTTT 
X 1 1 i 




TTGTTGTATA 


1680 


ACAAAAATAT 


CTAACATATC 


TTTACGTTTA 


1740 


ATTTTAATAG 


CATCATTTGT 


ATTCATTGAT 


1600 


TTTCTGATAA 


GCCATTGTAA 


TCATAAGTAA 


1660 


CGGAGAAATG 


TCCATATAGA 


AAGCTAGGAC 


1920 


AAATACACTT 


AGTAATATCA 


ATTGCTTAAA 


1980 
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25 



30 



35 



TATCCCTACA 


ACACGCATTG 


AGGCAGAAAT 


AACCATCGCT ACAATAACAA 


TAAATAAAAA 


2100 


TTGAATCCAT 


TTAGGAATGC 


CAATGACTTT 


A i 1 "I^Tli TTOP 

A *A X A. X 


TCATCAAATG 


ACAATATAAA 


2160 


TAATTCTTTA 


TAAAACAATG 


TAATAAACAG 


AAC!AAeTA'TY; 


ATGG£*AA*1Y*A 


CAATAATCGT 


2220 


TGTTAAATCA 


CTTATATTCA 


CTGCGCTTAT 


TY3Af3PPAAAT 


anpAATrra a 


CAATTTCTTG 


2280 


ATTGAACCCA 


TCAGCTAATG 


AAATGAAGAT 




bUin 1 AvWilJ 


CACTCATTAT 


2340 


AATTGGAATA 


GCAATTTCTT 


GGTAAGCAGT 


GTATGACGTT 


CTTAATTTTT 


CAATTAGAAG 


2400 


CGCACCTACT 


ATTGCGAATA 


AGATTCCAAA 


CCACATTGGA 


TTAATAAATA 


CTAGTGTTGG 


2460 


CATAATAGTA 


AGTAAAAACA 


TACCGAAAGA 


TATACCACCT 


AAAGTTACAT 


GACTTAGAGC 


2520 


ATCAGCTwTA 


AGTGATAGTC 


GTCTAACAAc 


GATAAAAGCA 


CCGaTTAGAG 


GCGCAATAAA - 


2580 


ACCTAtCAAG 


ATAc CACTAA 


TTAAAGAGTA 


CCTCATAAAA 


TCAAAATTCA 


ATAATGCATC 


2640 


TATCAATTGT 


GACACGCCTT 


TCCATTTTAA 


ATAAACTCAA ATCTTTATTA 


ATTACAACAT 


2700 


TCTCGATTAT 


GCTGATGATC 


GACAAAACGT 


ACAGGATGTC 


CATAAATTTT 


TGAAATTTCA 


2760 


ACTTCATCAA 


GTGATTTAAA 


CTCATCAGTT 


GTACCATGGA 


AATGCAAATG 


CTTATTTAAA 


2820 


CATGCTACTT 


CAGTAGCAGT 


ATCTGCTACA 


ACACCGATAT 


CATGAGTAAC 


TAAGATAATG 


2880 


GTGATACCTT 


CTTGTTTTAA 


TTGATCTAAA 


GTATTATAAA 


ATTCACTTAC 


ATGTTTTGCA 


2940 


TCAATACCAT 


TCGTTGGTTC 


ATCAAGTACT 


AATACTGcAG 


GTTCTGAAAT 


CAATG CTCGA 


3000 


GCAATCATTa 


CACGTTGTTG 


TTGACCACCT 


GATAATTCTG 


CTATATTTTT 


ATGAATTAAA 


... 3060 


TCACTTATAT 


TCAGTCTTTC 


TAGTACTTTA 


ATCACTTTTT 


CATTATCTTT 


GCTATTAAAT 


3120 


GTTTGGAAAA 


GACGTTTTGT 


CTTTGTTAAT 


CCGCTTAAAA 


CAACTTCTTT 


AACACTTGCT 


3180 


GGGAAACCTG 


AATTAAAGGC 


ATTTGCTTTT 


TGTGATACAT 


AGCTTAATTT 


AATTGATGTT 


3240 


TcTTATTTTT 


AAAATCAATA 


CCTTCAACAA 


AAATCTCACC 


ACTTTGTAAA 


GGTAATAACC 


3300 


CTAGAATCAA 


CTTCAATAAT 


GkTGATTTAC 


CAGCACCATT 


TGGkCCAACA 


AwTGCtAAAA 


3360 


ATTCACCTTT 


ATTTATTTTG 


AATGnnnTAT 


ATT 






3393 



(2) INFORMATION FOR SEQ ID NO: 427: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1123 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 



{xi> SEQUENCE DESCRIPTION: SEQ ID NO: 4 27: 
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CCCTGTTTCT TAGCCTTTTC GTATATGCGT TCATAAGAAG AATTCTTTTT GATTTCAGAT 120 

TGTAAAGCAC TGTTTTCACT TGATTGTTTA TCTATTTTAT AATCTAAATC TGCAATCTTT 180 

5 CCTCGCGTAT CATACGCATC CATTTTTAAA GATAGCATAT AAATACTTAA CATAGCAATT 240 

ACAGTAATCA AAGTTATGTA TAAAACTTTT TCAAATTTAG TTAATTGTAC AACCACTTTT 300 

CTCGAAACAG TCTTCTTTTC GGGCTTAGTT TGTGGTTGTT GCTTCGGTAT ACTATTATAA 360 

10 

ACTTGTTCGT CATATGGTTG GTACACTTTT TCTACAGCCA TTATAAATTG CTCCTTATTT 420 

AAGTATTTCA GCTACACGTA ATTTCGCGCT TCGTGCTCTG TTATTGTCAT CTAAATCTTC 480 

TTCTGTAGCG GTAATCGGTT TACGATTAAC ACGCTTTAAC TTAGGTGTAT ATGCTTCTGG 54 0 

1S 

TATAACTGGT AATCCTCTTG GTACCTCTGG ACCTTTTTCA TATTCTTGGA ACACCTGTTT 600 

ACATAAACGA TCTTCTAAAG AATGGAAAGT GATTACCGAA ATCCTGCCAT CTACTTTCAC 660 

TAATTCAATC GCTTGTTCTA TTGAATCTTC AAAAGCTGAC AATTCATCGT TTACTGCAAT 720 

20 

TOGTAGTGCT TGAAATACTC GTTTTGCAGG ATGTCCGCCT TTTCTTCTTG CTTTTGCAGG 780 

AATACCTTCT TTTATAATGT CAACTAATTC TAATGTTGTT GTTATTGGTT GTTGTTCGCG 840 

2 5 ATGTGCTTCG ATTCTTCGAG CTATCTGTTT TGAAAATTTC TCCTCGCCAT AGCGATAAAA 900 

AATCTTCACT AACGCTTCAT ATGACCAATT GTTAACAATT TCATATGCTG TTAGTTCTTG 960 

TGTTTGGTCC ATACGCATGT CTAATGTTGC GTCATGGTGA TAACTGAATC CTCGTTCTGG 1020 

30 AATGTCGAGT TGTGGGCTTG AAACACCCAA GTCGTAATAA ATTCCATCTA CTTTTTCAAT 1080 

GTTTAAGTCT TTTAATATTT GAGTTAATTC ACGGAAGTTG CTA 1123 
(2) INFORMATION FOR SEQ ID NO: 428: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 8 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 

40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 
TATATCCTCC AnACCATAAA AATGGAATTA TAGTGAAGAC CGCTATGATC CAGGATGACC 60 

45 

ACGTATTATA GAATCTTTCg ATAGAGAAAC GAATCaACGT ATCCgACATC aCGAAATGAA 120 
ATTAGAAGAT TATAAAGATG AGTTAAGAAG AGAATATCTA AAACAATCTG ACAGAATTGA 180 
AGGAGATGAA TAAGCGTGGG ACTTGATTTT AGTGGTTTAC CAGATTTAGC AGTATTGGAA 24 0 

60 

CAAATGAAGG AAAAAGAACA GATTAGTGAG GTTATTGCGC CTGAACATGT TCGTATGCAT 300 
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CATTTCAAAA AATTTGAAGA TGATTTTAAA AATGCGGCAC AAGGGGCTTG GGTGAAAAAT 420 

6CCACAGACG AATTAAAAGA TATTAGTAAT GATTTAGAAA AAATTCAAGA TATTAAAGTA 480 

TAAAAAGGTA TTAAGAAACA CAATAAGTAT ATAATCCATT TAATAATAAA TGAATTATAT 540 

AGTTCATAAT TTCGACTATA AGTGGCTATT AGCCATTATT TTTCGGGATC TATGTCAAAT 600 

CGGACTAATG AATTCAATAA TGGAAGTTAA GCAACCAATC TTTGTTTAAC TTCTTTTTTA 660 

TTTTTTGGAA AATAAAGTTT TGAACATAAT AAAATTTGAT TATGTTTTAA CGAATTTTGA 720 

TGTTTCTTGA ACTATATATC ATCTAGTCGT CATTTACAAT TGGTAAATAT GACTTCAAAC 780 

TGkATGAAGG TGATGGCGAT TAAAAGGCTC ATCCGTAGGT TCTAAAGAAC TAGAnTTT 838 
(2) INFORMATION FOR SEC ID NO: 429: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 
20 (b) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



25 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 429: 

nGTGTTCAAA TAATAGAATG GCTAATACCC CAACAATAAA TAGGAATAAA TATGTTGATG 60 

TCGTAATTTT AGTAAATAAT GAAAATCTAA AATTAGTAAC CCTATTTTGA ATATAAGCTT 120 

TAACTTCTAA TAATACTGGG AAG CCGATTG ATCCTAAAAT TATTAAAAAT ATAACTATCG 180 

CTTGTACAAA ATAATCATGT GCATAAGGAA TTAATGACTT ACCTGTAATA TCTAATCCAC 240 

CATTGGTAGT GGCAGAAATA GAAACAAAAA CACCTTGCAT GATTGCATAT TTTAAATCTG 300 

GATTATCTCG ATAAAAGTAA AATGCTAATA ACATGGCTCC TACGAGTTCG ATTACAAATA 360 

TTGATTTTAC AATATCAATA ATCAATTTGA CGGTACCACT CATTGTGTTT TTGTTATTAT 420 

40 CTAACATAAT TAACTGACGT TCTCTAATTC CAATATGTTT ACCTAGTACC ACCCATAACA 480 

TCGTACCAAT TGCCATGACG CCAATTCCAC CAATATTTAA TATCACGAGG ATAATTAATT 540 

GTCCAAATGT AGAATAGGTA TCGACAATAC TTATCGGAGA CAATCCAGTA ACACTAATTC 600 

45 

CGGATACGGC AACAAATAAT GTGTCAATTG GATTTACTTC TACAC CTGGT TTATGaACAT 660 

ACGGTAAGTT TAATAATAAA AATGCAATGA CAATTGCGAC GATATAGTAC AATACAATAC 720 

CTTGTTGAGG GCTTGATCTT TTTAAAAACT GGCTAAAAAT TGACACGTTG TTCACCTCAA 780 

50 

CATTATTTTA GTTTAATATT AATTTCTTTC TCTTTACCAT CTCTATAAAT CTTCGCTGTA 840 

ATTGATTTCA AGTCATCTTT ATGACTAAAT ATAATCTGCC TAAAGCGTAA ATCATCTTCT 900 
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AAACCATTGT TOTCAACTTG ATCTACAACA ACACCGTTCT TAACTTTTCC TGGCAATTTA 



1020 



ACTGCTTGTC TTTCAAAACT ATTTAGACTG ACAATATTCT TCATTTTAAC ACCTACATCG 



1080 



GGATAGTCAA TTTTACCTTT tGTTTCTAAA TCTTTTACAA TCTTTtGTAC TTCATTAaCA 



1140 



GGtATTGCAA 



1150 



10 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 430: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 797 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 430: 

AAGAATATTT AGGATTTTAT GTTTCGCAAC ACCCAGTAGA TAAAAAGTTT GTTGCCAAAC 60 

AATATTTAAC GATATTTAAA TTGAGTAACG CGCAGAATTA TAAACCTATA TTAGTACAGT 120 

TTGATAAAGT TAAACAAATT CGAACTAAAA ATGGTCAAAA TATGGCATTC GTCACATTAA 180 

ATGATGGCAT TGAAACTTTA GATGGTGTGA TTTTCCCTAA TCAGTTTAAA AAGTACGAAG 24 0 

AGTTGTTATC ACATAATGAC TTGTTTATAG TTAGCGGGAA ATTTGACCAT AGAAAGCAAC 300 

AACGTCAACT AATTATAAAT GAGATTCAGA CATTAGCCAC TTTTGAAGAA CAAAAATTAG 360 

CATTTGCCAA ACAAATTATA ATTAGAAATA AATCACAAAT AGATATGTTT GAAGAGATGA 4 20 

TTAAAGCTAC GAAAGAGAAT GCTAATGATG TTGTGTTATC CTTTTATGAT GAAACGATTA 4 80 

AACAAATGAC TACTTTAGGC TATATTAATC AAAAAGATAG TATGTTTAAT AATTTTATAC 540 

AATCCTTTAA CCCTAGTGAT ATTAGGCTTA TATAACTTTy ATCAACTAAT AAATTATGAT 600 

ATAGTAAACT GATGGTTAGA TATTTyTyAA CCATCAGTTT GCGTktATAT TAGTTTTTTA 660 

TGCTTATTAT TTTTATGAGT TTCACTTTAC ATTATTGATT AATCCATAAG AATAATTAGC 720 

ACAAAAAGCA GTATACATAA ATTGAGTAAA GAATTTTGTC GATATAACCG nGCGGAAAAA 780 

TAATAACnTT TTGGATn 797 

(2) INFORMATION FOR SEQ ID NO: 4 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 
<B> TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 
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TTTCCAAGAG 


AAGTTGCGGA 


AGTAATTAAT 


AAAACGCATC 


ATAATAAATT 


GGTCATTTCG 


60 




ATGATTTOGk 


CACAAATCGA 


TGCGGATAGA 


ATGGATTATT 


TACAACGTGA 


TGCGTATTTC 


120 


6 


ACAGGTGTAT 


CATATGGTGC 


TTTTGATATG 


GAGCGTATTT 


TAAGATTAAT 


GCGACCTTCT 


180 




AAAGATGAAG 


TACTAATCAA 


AGAAAGTGGT 


ATGCATGCAG 


TTGAAAACTT 


TATTATGAGT 


240 


10 


CGTTATCAAA 


TGTATTGGCA 


AATTTACTTC 


CACCCAGTTA 


GTCGTGGTGG 


AGAAGTGCTG 


300 


CTTAATAATT 


GtTTGAAACG 


CGCAAAACAG 


CTTTATAATG 


AAGGCTATGA 


ATTTAAGTTG 


360 




CATCCACATG 


ATTTTATTCC 


ATTTTTTGAA 


GAGACAGTTA 


CGATTGAACA 


ATATGTTGAA 


420 


15 


CTCGATGAAG 


CGGTAGTTAC 


GTATTATTTO 


GAAAAATGGA 


CAAAAGAAGA 


TGATGCTATT 


480 




TTAAGTGATT 


TAGCAAGTCG 


ATTTATTAAT 


CGAGACTTAT 


TTAAATATAT 


TCCATTTGAT 


540 




GGCTCAATTA 


TTACAATATC 


AGAACTGCAA 


GAACTGTTTG 


AAGCAGGTGG 


TATTAATCCA 


600 


20 


GATTATTATT 


TTGTGAGTGA 


AGCATTTTCT 


GATTTGCCAT 


ATGACTATGA 


TCGACCGGGG 


660 




TCAAATCGCA 


AACCGATTCA 


TTTATTAAGA 


CAAGATGGTA 


CGATTAGAGA 


AATAAGCAAT 


720 




CAATCATTAG 


TCATTCATAG 


TATTACAGGC 


ATTAATCGCC 


AAGACTATAA 


ATTATATTAT 


780 


25 


CCTAGAGAAA 


TGGTTGCAAA 


GATTAAAGAT 


AAGACAATTA 


GAGAAGCTAT 


TGAAAATTTG 


840 




ATTAATGAGC 


TTAATTAAAC 


AGGGCTAAAA 


TTGTTATCGT 


TAAATATGGA 


GGTTATATCA 


900 


30 


TTGTCTGAGA 


AAAAAGGCTT 


TAATTTTAAT 


ATCATAAAAA 


ATGACCCTCT 


AGATGGTCAT 


960 


AAAGGTACAA 


ATATTGGTTC 


AATTAGCTTA 


GACAATATTG 


CACCAGTTTT 


TATCGATGTT 


1020 




GCTAACAAAG 


AAGGATTTAT 


TGATATTGGA 


GGCATGCATG 


CTCGTGCCAA 


AGTTGAAAAA 


1080 


36 


GGTGTGAAAT 


GGATTACTGA 


TAAAGCTGCT 


GTTGAAGGCG 


ATGAAGCTAA 


AGaATATTGG 


1140 




TTGTGTTGGG 


TAACAACAGA 


ACGTAATGAA 


CAAGGACCAT 


ATTACGCTGG 


TTTAACAGCG 


1200 




TGCTATTTAT 


TAGTGAATAA 


AGCAATTCGT 


CGTGGTTATA 


AAAGTATGCC 


TGAACATGTT 


1260 


40 


AATATGATGG 


ATAAATCAAT 


GAAACATCAT 


ATTATCATAG 


ATCAAATTGG 








AAAGCTATTT 


TAAAAGACTT 


TTTAATGAAC 


CATGATGAAG 


GTATGTGGAA 


GCATTCTTCT 


1360 


45 


GATGCTTTAC 


ATCAAGCATT 


TAATTAAATA 


TTAGAAACTA 


AAATTTCCCA 


ATTAATCTAT 


1440 


AAAGATATGA 


TTCATTTCTC 


AATGAC 








1466 




(2) INFORMATION FOR SEQ ID NO: 432: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 

TTGTACGAAT ACAAAGTTGC ATTAATATAG AATCATTTAT GCTTTTTAAT ATCACAAAAC 60 

AAATTGATTA AACTAGCTAA AATAGTCAAA ATTGGAATCA AGATTACATA AATATCATAA 120 

CCTCTAGATA TTAGTATACA TAATATAATT AGCGACCCTA TTATAATTGC TAAATTATAA 180 

GCCAAACTAA TTTTATATTT CATTTCAATT CCCACCTTAA TAGCATTTAT TAAAATTCTA 240 

TAATGTTTAC ATAGACTTCT TAACAAACTT TAACCCAGCT AAGCTAAGTA TTGAAATAAT 300 

TACACCTCCT AATATAAATA TATTACTTAA GCTTAGTAAA GGTATAATTA TACTTATCAG 360 

TCCTAAAGAC AATGTATCCG CTG CATAATT CGaTG t AGAT GAGATACTAA AT ACTTTCC C 420 

CATCAAATGA TTTGGCGTTT TTATTTGAAT GGCAACTGAT CTTGTTAGTC CCTCTATAGA 480 

TTGTCCAAGT CCCAATAATG TTGCACCTAT ATATAATATC GCCACACTTG GAAACACATT 540 

20 AATAATCGAT AAGCCAATTC CCCAAACTAA AACACCAATA CTAAATTTAA AGATTAATCG 600 

CTTTTCTGAC AGCAAACCCA TAATCAATGA CATTAATAAA GATGCTATAC CTAAACATGA 660 

TGTAGCTAGT CCATATACAC CAACGCCCTC TTTTAATATA TTGGAAATAA ACAATGGTAA 720 

TACAACACGC CAAAGGCCAG TATTAATCAA TATGCAGGCA AATTGGATGA TTATAATAAA 78 0 

TGGAATTTCT TTAGATTGTT TCAAGAATTC CCAAGTTTCA GAAAAATCTT CTTTTGAGTG 840 

TCTATCAATC ATGTTGTTAT TTGTATATTT TAAAAGTGCA TTAAAAATAA ATCCTAAAAA 900 

TAGCAATATA CTACAAATAA AAAAGACGCC AACATTACCA ACTAGTATTA CAATGACACC 960 

AATTAAAGCA GGTAAAATAA TATTTGAGCC TCTTTGCAAA CTATCGATTA ACG CATTACT 1020 

35 TGTTGCTAAA TGCTCCTCAT CAATAATTTC AGGAAGAATT GCCCTAAACG CAGGATCCGT 1080 

ATAGCAGTTA ATAATGGTGA TAG CTGTAG A TATGGTTAGA AGCGTCAGAT AATTTAAATT 1140 

TGATGTTATT GCAAGTAAAG GAATAATTAT TATAATCAAA CTTAGTATAA GATCAGATAG 1200 

40 ATAAAGTATT TTCTTTTTAT TATGTTTATC AGAATATGCG CCAC CGAAAA TACCAAATAT 1260 

AATAAATGGA AGTGTTTGAC TCATAACCAT CATTGATAAT TTTAAAGATG ATTGGTTTGT 1320 

CAATTCAACA GTAAACCAAA TTATTTGTAA CGAAAACAGC ACAAAACAAC TCCGACGTAA 13 80 

G Ar ATT AC CA ATCAATAAAT ATGTAAAGTT TCTATTTTTC AAAACTTCTA AATACAACAT 1440 

ATTTATCACC TCTCATAAAA ATAATTGAAT G CAT CCAC CA GCTTTTTTAG ACCTTCTTCT 1500 

AAACTCTCTT TATCCAAAGC GCAATTAATT CTAATATAAT TTAGTCAGTT AAATATCAAT 1560 

TATTTCGAAA TATACATACT ACTTGAAACA CCATACATAA CCCCCAAAAT GACTACTCAG 1620 

AGGTTATATT CTACTAATTA TGATTATATT AAATATGAAA ATATTATCAA AAAAATCAAA 1680 

55 



25 



30 



45 



50 



1333 



EP0 786 519 A2 





TCATTCTCAT 


CATATTCTAG 


GTTGTTTTTA 


ACAAACTAAA 


TATAGTGAAT 


GCAAATCAAC 


1800 




TATTATTTAA 


ATTATGAATT 


ATTTTAATTC 


TTTCTTCTAC 


GAGCCAATAA 


CATTAATCCA 


1860 


s 


GCAATTCCAA 


TTATACTACT 


AAAGATCAAA 


CCTTTTTGCG 


TGCTTTCTAA 


ACCTGTTTTT 


1920 




GGTAATTCTG 


CTCGTTTTTT 


CTCTTGATTA 


GCTACTGATT 


CTTTAGCAAT 


TTTAGATTTT 


1980 


10 


TTAACTTTAT 


CATTTTTATC 


CATTGAATGA 


ACTGGGCCAT 


TTGGTTTTGC 


TCTGTCTTTC 


2040 


GATAATCCTG 


GATTGTTAGG 


ATTTACTGGG 


CCACTTGGAT 


GAGTTGGTCT 


GCTCGGCTTC 


2100 




TCTGGGTTTT 


CAGGTCCTTT 


TGGATCTTTT 


GGTTTCTCTC 


CACCGAACTC 


TACAATCTTA 


2160 


IS 


TCTACTGGTT 


GTTTtGTGAT 


CTCTTCTGTT 


GGTTGACCCT 


CGCCAACTTT 


TTCACCTGTT 


2220 




AATGGGkTCA 


CTGTGAwTGG 


TGkTGTGAwT 


GTCyTACTTC CTGGkTGTCC TTCTTGTTTC 


2280 




ACTCGCTCTT 


CACCAGGTTG 


TAAT 








2304 


20 


(2) INFORMATION FOR SEQ ID NO: 433: 








2S 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4733 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 








30 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 






GTCTTAATAT 


TTTACAAGAT 


ACAGCCTGAC 


AATATTTATA 


TTACAAACCA 


ATATGGTAAA 


60 




GAGGCTGTGT 


CTCGATTAAA 


AAGAATTTTA 


GAGCAAAGTA 


TACCTAATAA 


AGAATATATT 


120 


35 


GATTTATCAG 


AGATATCATT 


AATTACATTT 


TTTAAACCAG 


AATATAATAA 


ACAACATGTG 


180 




AATGAGGAGT 


TTAATTCTTT 


ATCTTCTTCG 


AAAATAAAGA ATTTAGCCAA 


AGAAAATGAT 


240 




GGTATTATTA 


TTTATTTAAA 


TTTTAAGGAT 


GTTAATTATC 


AACTGaTAaG 


TGAAGGCAAT 


300 


40 


acittttttt 


CTGCAAAACC 


ATATTTGCAT 


TGTGTGTTCA 


ATAAAGATAT 


TTTAAATATG 


360 




GATAAAGTTG 


ATATAGAAAA 


TTTTTTCAAA 


AGTATAGAAG 


TCAAATACAG 


TAAATAAGAT 


420 


45 


CAATTATTGA 


ATCCACCGAC 


CAGCAGATTT 


TTTTAAAAAA 


TTAATACCCC 


GTTAATACCC 


480 


TTTGCTTCAA 


TTTGATGAAA 


ATCAATGAAA 


TTCAAAATTG 


AAGAAATCCT 


TTAATATCAA 


540 




GGTGTACGAC 


AGTCTATATA 


ATCATGCGAA 


ATTCTAAAAT 


TTTCtGAcGT 


AAAAAAACAA 


600 


SO 


ATTCyTAAAG 


CAGCTCGTCG 


TTCACCTCAA 


TTCTCAAAAC 


GTTAATTGTC 


GGACGATATA 


660 




TATACAAAAC 


ACCTCGATGT 


TATGTCGAGG 


TGTTTTTTTG 


CGTTTGTGrG 


GGGAATATGG 


720 




AATAGAGTAT 


AAATGAATTA 


ATGTCTCAGG 


TATAGAATTA 


ATTCAACTAT 


GAATTATTGA 


780 
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TTAGCTTGCG GAGACCGTAC GATAGTGTTC TAATAATAGA TATTCAAGGG AAAGAGCTAT 900 

CGGATGATAA CCAGATGATA ACTCAAGAAC CATGrTTTAA GAGTGTTATC AAAATAAGGA 960 

GTGACTTAAT TGAAAGATTC TATATTTTGG AAGAAAGCTT TTATTyCTGT TTATTTTATT 1020 

GTTGsGATGC TGGTGTTTCT ACTTTTTAGQ TTTTATATTA AaACAGATAA CTTTTCTATA X080 

TATTTAATGA GTATCTTCTT AATTTGTTTA GGAACTGCTT CTATCATTTA TAACTATAAA 1140 

ACCAATCGAT AAATATAAGT ATGAAATGTA TAAGTCACAG TTAGATCTAA GTCTTGCTGT 1200 

GGCTTTTTAT ATCTCCATAA AACGACAAAT TCAAGCCCGA CATAAAACAG CATTTTCAGC 1260 

CCACCATAAA ACGACAATTT CAGCCCGCCA TTGACTAAAC ACCACATCCC AAAAATATCG 1320 

TAACAATCCT CTACATCAAT CAATCCAACA TCCCTCATAA TCACAACGCA CAAAATCTAT 13 80 

TCATGCATTT TTGGAATACT TAGTATTACA AATAAOGATT TTTATTCATC TAACAAAGGT 1440 

20 TATATAATGT ACTGAAGGCA ATTTTTATGT ATTACAAATC TAATCGTACA TGTAAAATTT 1500 

TGATAAACAT CATTAATTTT GCGTAACTAT CATTAGATTA CAAATCACAA AGTAATTACA 1560 

TGTAATACAC ATCTATACAT CACATTTGAA GGGAAATGaA TATAAATGAC TGATAAAAAG 1620 

TACACTGCAG CCGATATGGT TATTGATACT TTGAAAAATA ATGGGGTAGA ATATGTTTTT 1680 

GGTATTCCGG GTGCAAAGAT TGACTATCTA TTTAATGCTT TAATTGATGA TGGTCCTGAA 1740 

CTTATTGTCA CTCGTCATGA ACAAAACGCC GCAATGATGG CGCAAGGTAT TGGAAGATTA 1800 

ACAGGCAAAC CGGGTGTAGT ACTTGTTACA AGTGGTCCTG GTGTAAGTAA TTTAACTACT 1860 

GGATTATTAA CCGCAACATC TGAAGGGGAT CCTGTATTAG CGTTAGGTGG CCAAGTGAAa 1920 

CGnAnATGAT TTATTACGAT TAACGCATCA AAGTATTGAT AATGCTGCGC TATTAAAATA 1980 

TTCATCAAAA TATAGTGAAG AAGTACAAGA TCCTGAATCA TTATCAGAAG TTATGACAAA 2040 

TGCAATTCGA ATTGCTACTT CAGGAAAAAA TGGCGCAAGT TTTATTAGTA TTCCGCAAGA 2100 

CGTTATTTCT TCACCAGTTG AATCTAAAGC TAT AT CACTT TGCCAAAAAA CAAATTTAGG 2160 

AGTACCGAGT GAACAAGATA TTAATGATGT CATTGAAGCG ATTAAAAATG CAT CATTTCC 2220 

TGTTTTATTA GCTGGTATGA GAAGTTCAAG TGCAGAAGAA ACAAATGCCA TTCGCAAATT 2280 

AGTTGAGCGC ACGAATTTAC CAGTTGTAGA AACATTCCAA GGTGCAGGTG TAATTAGTCG 2340 

TGAATTAGAA AATCATTTCT TCGGTCGTGT GGGCTTATTC CGCAATCAAG TTGGTGATGA 2400 

ATTATTACGT AAAAGTGATT TAGTTGTTAC AATCGGTTAT GATCCAATTG AATACGAAGC 2460 

TAGTAACTGG AATAAAGAAT TAGAAACACA AATTATCAAT ATTGACGAGT TCAAGCTGAA 2520 

ATTACTAATT ATATG CAACC GAAAAAAGAG TTGATTGGTA ATATTG CTAA AACGATTGAA 258 0 
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CAATTAAGAA CACATATTGA TGAAGAAACT 
CTACATCCAG TGGAAATTAT TGAATCTATG 
6 ACAGTTGATG TTGGAAGTCA CTATATTTGG 

AGACATTTAT TATTTAGCAA TGGTATGCAA 
TCAGCTGCAC TTGTGCGCCC TAATACGCAA 

10 

TTATTTTCAT CACAAGATTT AGAAACGGCC 
ATTTGGAATG ATGGAAAATA TAACATGGTT 

1S TCGTCAGGTG TAGACTTCGG TCCTGTAGAT 

AAAGGTTTAC GAGTTACTAA TCAAGAAGAA 
ACAGATGGTC CAGTATTAAT TGATATACCT 

20 ACAAATATGT TACCTGACGT ATTTAACTAA 

GATTTTATGA CTAATGTTTT ATACCAACAT 
TTAAAAGGAA CTGCATCAAT AAATGAATTA 

25 CTAACAGGTT CAAACGGTGA GGTAATCTTT 

CATAAAGAAT TTGTAGAATT AAAAGGTGAC 
TTTGTAGCAG ATACAAGCTA TGAAACGAAA 

30 

ATTAAGGAAA AGATGTTGAG TGAAAATTTA 
AAAAAAATGC ATGTACGTAT GATGCCGGCT 

55 TCAGCTAGAA GACAACCTGA ACAAACTGAA 

TTTACACCAG AATTATTCCA TGGTATCGGA 
GATGATCGTA ACTTTGGTGG ACATGTCTTA 

40 ATCCAAAATA TAGAAACATT TGAACAGCAT 

GCAAATATTG ACTATAAAGA TATTGCAGAC 
AAATATAATG ACGGTTTATG AAAATTGACT 

45 

GTAAATAlpA TTAACCATGA CTACAATAGA 
TAAAATAAAT CCCTTCACTA TTAGCAGTAG 
AGATTTGCTG TTATGTTTAG ATTATAAATT 

50 

GTATAGGCCA ATCTGTTACT ACGAGAAGCA 
AAAAGACAAA CGCCAATTTT TCAGCGCTTG 

55 
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GGTATTAAAG OGACGCATGA AGAAGGAATT 2700 

CAAAAGGTAT TAACTGATGA TACTACTGTA 2760 

ATGGCACGTA ATTTCAGAAG TTACAATCCA 2820 

ACGCTTGGTG TAGCATTACC GTGGGCAATT 2680 

GTTGTGTCOG TTGCTGGCGA TGGTGGCTTT 2940 

GTACGTAAAA ATTTAAATAT CATCCAGCTT 3000 

GAATTCCAAG AAGAAATGAA ATATAAACGT 3060 

TTTGTAAAAT ATGCAGAATC ATTTGGCGCG 3120 

TTAGAAGCGG CAATTAAAGA GGGCTATGAA 3180 

GTAAATTACA AAGATAATAT CAAACTTTCA 324 0 

AATAAAGATA AATGTTAAAG AGGAGTGGGA 3300 

GGTACATTAG GCACATTAAT GGCAGGATTA 3360 

TTGCAACATG GTGACTTAGG TATCGCTACA 3420 

TTAGATGGAA AAGCTTACCA TGCAAATGAA 3480 

GAGTTAACAC CATATGCAAC TGTAACTAAA 3540 

GATAAATCTT CAGAAGCAGT TTTTGCAGAA 3600 

TTTTCAGCAG TAAAAATTTC AGGCTTATTT 3660 

CAAGAACCAC CTTATACACG TTTAATCGAT 3720 

ACGTATGTCA AAGGTTCAGT CGTTGGTTTC 3780 

TCAGCAGGAT TTCATGTACA CTTTGCGAAT 3840 

GATTTTGAAG TAGAAGATGT TAAAGTAGAA 3900 

TTTCCAATTC AAGATAAAGA TTTCACTAAA 3960 

GAAATTAGAG AAGCTGAATA ATGAATCCAG 4020 

TCATAATGCG CGATTTAGAA ATGATAGTTT 4080 

ACAAATATAT TTATAATTAC GTCTAAGTAA 4140 

TGAGGGGATT TATTAGGTTC CAGATATTTG 4200 

GTGGTACACA CTCATATAAA ATTTACTATT 4260 

AACAACAATA ATTTACAAGT TCAATAACTA 4320 

CCCTATAAAA CTATTTTCAA ATTATTATTT 4380 
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TTTCTTTAGA TTCACTACTT TTTTATTACC ATCATTCAAA GTAAGCGTAT AAGTTGCTGT 4500 

TTGGGCATTA TTAATTTTTT CTGTTGTAAC ACCACGTTGA GAAGCTAATT CATTTTTTAC 4560 

TTTACTOTCA ATTTCTTGAT AAAGAACATT TTTATTTTCT GGGAAGATAA AGTAAGTTCG 4620 

ATGTAATGCA GTAATACCAT CTACTGAAAT TGTGTAAGGA ACAGTGTGAT AACCATCCAC 4680 

AGTCATTTCT TTATAGCCGT TATTACTATC TGCAGATGCT TOGTGACTCG GTA 4 733 



10 

(2) INFORMATION FOR SEQ ID NO: 434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 
1$ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434: 



2S 



30 



40 



45 



AATAAATGGT 


AATTTTATAT 


CATCAACGGC 


TACAAATGGG 


GCAGTTTCAT 


ATATAAATAA 


60 


ACACATCTAC 


TCAACTGnTC 


TTAATTAATT 


AATAAATGAT 


TTACCAGATG 


ACATATAACA 


120 


GCCATTATTA 


GnTTAATTCT 


AATAGTTTAT 


TTAAATTTTC 


TTCGGTTGTC 


GCCCAACTGG 


180 


TTGCGAATCT 


AACAACACGA 


TGTTGATCAT 


CGTATTTTTC 


CCAAACAGCA 


AATTTAACTT 


240 


TTTGTTCTAA 


CTCTGCTATT 


TTCTCGTTAC 


TTAAAATAAA 


AAATTGTTGA 


TTGGTTGGAG 


300 


AATCAAAGTA 


AAGACGATAG 


CCTTTATTTT 


TAAACCCGTC 


TTTCATCTTA 


TTTGCCATTT 


360 


CGATAGCATG 


TCTGCTTATA 


TTAAAATATA 


AATTGTCCGT 


AAATAATTCT 


AAAAATTGTA 


420 


TGCCTGTTAA 


CCGTCCTTTT 


GCTAAAAGGG 


CACCGTGGAT 


GCTTGATTCG 


AGTGGTAAAT 


480 


TGTTTCGGTT 


CATTATTTTT 


CGTAAAAACA 


ATGGCTTCCC 


CGCATAATGC 


ACCTATCTkC 


54 0 


GTACCACCTA 


TATAAAATAC 


ATCACAATAT 


TTAGCGrTGt 


CTTTAATAGT 


CATATCTGAT 


600 


TGGTCACTCA 


TCAATCCATA 


CCCTAATCGT 


GCACCATCCA 


TAAATAATGG 


AAGCTGATAT 


660 


TGCTTACATA 


CTTTGCATAA 


CTCTTCCAAT 


TCTGaTTTAG 


AGTATAATGT 


GCCATATTCT 


720 


GTAGGATGAG 


AAATATATAC 


CATTCCTGGG 


AATACCATAT 


GGTCCTTTTT 


AAAATCACTT 


780 


TTAAATGTCT 


CCATGTAAGT 


TTCAACATCT 


GAAGCACTAA 


CTTTTCCTTC 


CTTAGAGGGT 


840 


ATAGTAATTA 


CTTTATGTCC 


ACTATATTCA 


ATTGCACCGC 


CCTCATGCAC 


AGCAACATGA 


900 


CCAGTGTCTG 


CTGAAATGAC 


CCCTTCGTAA 


CTTTCTAACA 


TTGAATTAAT 


AACAACCTGA 


960 


TTGGTTTGTG 


TTCCACCTaC 


TAAAAAACGA 


ATTGTAGCAT 


TTGGaCAGTC 


AATTGTATCT 


1020 


TTAATCTTTT 


CAATTGCCTG 


AGCTGTGAAT 


TGaTCAAAGC 


CATATCCCGA 


AGCTTGTACA 


1080 



55 
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TCGAATGAAA TCACTACATT TCCCCCTAAA ACTAATATCA ACATTTTAAT AAGATAAACC 1200 

AATTTCAAAA CTAGTTOGAT ATTTAAAATG TATTATGGAT GGnTAAAGTT TGTATCGCAT 1260 

5 TATCGOGAAG TTGnATAAAT ATAT 1284 

(2) INFORMATION FOR SEQ ID NO: 435: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1072 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 

ACAGCTTTTG GTAArGGAGA AAwTCAaTAt GAAACAGTAT AArGCGTATT TaATCGATTT 60 

20 AGATGGCACA ATGTATATGG GAACAGATGA GATTGATGGA GCAAnACAAT TCATCGATTA 120 

TTTAAATGTA AAAGGCATTC CTCATTTATA CGTAACTAAT AATTCAACAA AAACACCTGA 180 

GCAAGTAACT GAAAAATTAC GTGAAATGCA CATTGATGCT AAACCAGAAG AGGTTGTAAC 240 

GTCAGCGTTA GCCACTGCTG ATTATATTTC AGAACAATCA CCAGGAGCAT CAGTATATAT 300 

GTTAGGTGGG AGTGGTTTAA ATACTGCGTT AACCGAAGCG GGACTTGTCA TTAAAAATGA 360 

CGAGCATGTT GATTATGTAG TTATTGGACT TGACGAACAA GTTACATATG AAAAGCTTGC 420 

GATTGCAACG TTAGGTGTAA GAAATGGtGC AACATTTATT TCTACAAATC CTGATGTATC 480 

AATTCCTAAA GAGCGTGGTT TATTACCTGG TAATGGTGCT ATTACAAGTG TTGTAAGTGT 540 

35 ATCGACAGGT GTATCG CCAC AATTTATTGG TAAACCAGAA CCGATTATTA TGGTTAAAGC 600 

ATTAGAAATT TTAGGATTAG ATAAATCCGA AGTTGCTATG GTAGGCGATT TGTACGATAC 660 

CGATATTATG TCTGGTATTA ACGTAGGTAT GGATACGATT CATGTACAAA CAGGTGTATC 720 

40 TACGTTAGAA GATGTGCAAA ATAAAAATGT GCCACCAACG TATTCTTTTA AAGATTTAAA 780 

TGAAGCAATA GCTGAATTAG AAAAATAGAT ATAGTCATTT TATAAAGTAG GTGAATTGAT 840 

TTGGTAAAAA TAGTTGTTTC GAGGAAAATT CCAGATAAAT TTTATCAACA ATTAAGTAAA 900 

CTTGGTGACG TTGTTATGTG GCAAAAATCA TTAGTGCCTA TGCCTAAAGA TCAATTTGTG 560 

ACaGCcTTCG TGACGCAGAT GCTTGTTTTA TTACATTAAG TGAACAGATC GATGCAGAAA 1020 

TTTTAGCGCA ATCACCAAAT TTAAAAgTAA TTGCGAATAT GGCTGTAGGA TA 1072 
(2) INFORMATION FOR SEQ ID NO: 436: 



25 



30 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3271 base pairs 
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is 



20 



25 



(C) STRANDED NESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 36: 

TAAAAACTTA CTTTAACACC ATTCCTTTTT AACTTTTTTC GTGTTTCnTT TnCTTAAGTC 60 

10 CATCCATATT TTTAATGATG TCATCTGCTG TTTTATCTTT TAAATCTAAC ACTGaGTGAT 120 

AmCGAATTTG TAGCACAGGA TCAAATCCTT TATGGAATCC AGTATGTTCA AATCCTAAGT 180 

TACTCATTTT ATCAAAGAAC CAATCATTAC CAGCATTACC TGTAATCTOG CCATCATGAT 240 

TCAAGTATTG ATATGGTAAA TATGGATCGA TATGTAGGTA TAGACAACGA TGTTTTTTAA 300 

CATATTTTGA TAATTCATTA AAGAAAAAGT GTACGAGTTC TTGATTTTCA TAATCAATCA 360 

CTGGACCGCG ATTTGAATAA AAATACTTGA ACACTTTCAT AACAGGTACA GCAGTAAGtA 420 

AGCAAGCTGC AATGACCTCG TTATTATTGT TTTTTATTCC CACTAAATGT GTTTCATAAC 480 

CTTCAGCAAG CTTTAACTCA TAGTGGCCAA CAGTTTGCGT GaAATGACTG TATGGCATGC 540 

TATCTGTAAa GGCACCAAAC TCTTTAGCTG TTAAATTTGT AAACTTCATT ATCATTACTC 600 

CTATTTGTCT CTCGTTAATT AATTTCATTT CCGTATTTGc AGTTTTTCTA TTTCCCCTCT 660 

GCAAATGgCA AAAATAATAA ATCTAATCTA AATAAGTATA CAATAGTTAA TGTTAAAACT 720 

30 AAAACATAAA CGCTTTAATT GCGTATACTT TTATAGTAAT ATTTAGATTT TTGAATACAA 780 

TTTCAAAAAA AGTAATATGA ACGTTTGGGT TTGCTCATAT TACTTTTTTT GAAATTGTAT 84 0 

TCAATTTTAT AATTCACCGT TTTTCACTTT TTCAAACAGT ATTCGCCTAA TTTTTTTAAA 900 

35 ' 

TCAAGTAAAC TTAATTATTC AATGTTTGTT GGATAGATTG TAAATATTTA ATGATTTCCT 960 

CACGCGTGTT AGATTTAAAT CGCTTAACGA TTTCGCTACC AATGACAATG CCATCTGCAA 1020 

CCTCTTTTAT ATCTGCAACA TGTTGTGGTG TTCTTATACC AAATCCTGCG ACAACTGGCA 1080 

40 

CATTGGCTAT CGCTTTAATT GACTCAATTT TTCGTTTTAA TTCTGGATGA AACGCACCGT 114 0 

TTTGCCCTGT TGTCGCATTC ATCGTCACAG TATAAATAAA GCCTTCCGCA TGGGATACGA 1200 

45 TATCTTTTAT ACGTTTGTCA TCAGTAGTCA TCGCAACTAA CGATATGATT TTGACGCCAT 1260 

AGTGACTAAA TTGTTGTTTT AAACGCTGCG ATAATTCATA TGGTAAATCA GGAATAATTA 132 0 

AGCCGTAGAC ACCAGTATCT CGACATTTTT CAAAAAACGC TTGTTCTCCA TAATGACAAA 1380 

60 TAATATTATA ATACGTCATT AATACATAGT TACACTTAAT TTGATCACCA TGTTTTTCTA 144 0 

ATTGATTGAA AATATAATCT ATCGTGATGC CTTGTTTAAT CGCTTGTTGA CCTGCTTCCA 1500 

TGATAACTGG ACCATCAGCA ACCGGATCAG AGAAAGGTAC TCCAATTTCA ATTATATCTG 1560 
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GTATAAATAA 


TTTAGTCATT 


TGCAAGACCT 


CGCTCTACCA 


TATATTGTCT 


AATXGTTTCC 


1680 




ATATCTTTAT 


CGCCACGTCC 


AGAAATAGTT 


ACTACAATAA 


TATCTTCTTX 


CGACATCGTA 


1740 


5 


GGCGCTAGTC 


TTTCAACATA 


ACTCAGTGCA 


TGTGCACTTT 


CAATTGCAGG 


TATAATACCT 


1800 




TCATGTTTTG 


TAAAGTTGAT 


TAAAGCATTC 


ATTGCXXGXG 


TATCACTAGC 


ATTTTCAAAA 


1860 


10 


GTTACTCTAC 


CAATGTCGTG 


GTAATAAGAA 


XGTXCXGGTC 


CAATACCAGG 


ATAATCAAGT 


1920 


CCTGCTGAAA 


TAGAATGTGC 


TAGTTGCACP 


TGP Of* A TPTT 


CATCTTGAAT 


TAAATACATT 


1 Qftft 




TTAGTAC CAT 


GTAATACGCC 


AGGTGATCCT 


TTlft OO A A ■ i w i "rt 

X X \JCw\/\ X X V7 


CAAGTGCATG 


TTT AT CAGT A 


<U4 v 


IS 


TCATCGCCTT 


GACCTGCGGC 


TTCAACAOCG 


TATAATGOAA 

X Ax MAX VJV^tn 


CATCATCTTT AATAAATGGA 


•31 Aft 




TAAAATGTAC 


vj*%x x \jv— #a x x 


tcsagooapoa 


POfJ AT AO ATY2 


CTACAATTGC 


ATCCGGAAGT 






WftwV * A W X X 


TPTTP & & TAT 


vlvlWii XXX 


AX 1AL. 


CAATCACACT 


CTGAAAATCT 


O *i O f\ 

2220 


20 


otaa oa aim 




f2TPTT2P 21 PPT 


TV H»T*PPIVP71 RP 


CTAATAAATA 


ATGTGTATCA 


2280 




TOT A PA Tf2 a p 


X lALUWviln 


TTVSPMTrtP'P 
X luLAAlTivi 


TTATTAACTG 


CATCCGATAA AGTCCCTTGA 


2340 




VUil^l XWWi 


ptv2p , p i ta pn hp 


LI 1 loLALLA 


AUxAAi XaJC-A. 


TTCTAAATAC 


ATTAAGTTGT 


2400 


25 


TGTCTTTTAA 


TATCTTCACT 


T*r^/"VT\ r PTV Tv TV /-» 

TC CGATAAAG 


ACAACAAGTT 


CCATATCAAA 


TAATG CAGCA 


2460 






lAuW X AVJACL. 


AXvjX xgaccc 


G CAC CAGTTT 


CAGCAACAAG 


CTTCTTCTTG 


2520 


30 


^^^^^^ H ^T*/ ** 1 ** i w i ' 




CGCTTGACCT 


AACGCA1 xAT 


TAATTTTATG 


GGCGCCTGTA 


2580 


luAl 1 X Av*A 1 




LAAA1A1A1 X 


i |wp t\ prtpP/* 1 TV /"> 

ITAvj LTGCCAC 


CTAGGCTTTC 


AGTATATGAT 


2640 




f2 P Af2 PATATT2 


X nnu X vj^j l\9 X 


\.V7V.\j<*Aj X\^L- X 


AlJAX As- X L. X\» 


ATAAATAGTA 


TTCCAGTTCT 


2700 


35 


/ ** I u 1 11 1 ta ta P*"P 

vlli vivviv 1 




XXX lVj^.U lvl 


X iAiAAVjUl X 


TTTTCAACTC 


AATAATTGCT 


2760 




czrzc aTTii &Tr; 

.Wv»f\X XrtrtxVj 




21 T 21 TTY2PP P*p 




CAAAGAAACC 


TAATTCATCT 


2B20 




fTi L " i ~ i "i "I'OTTT 

>7w A X w A V7 X X * 


\7 im X X V7 XXX 


j^x i\_ni x ^* x 


TOT A T OTOOT 


TTCACAATAT 


TTACAATTGC 


2t5eU 


40 


TGTCATTTTT 


TCTATATCTT 


J l"i "P(5CCjC!ATT 

X 4. vvvvvnl X 


TAOTTOTATA 


CCTGATGCAA 


GATCATAACC 






TTGATGTGAT 


AATTTAAGTT 


GATTAACTGT 


TTGAATATTP 

X Xx"a«»X*»X X X 


TCAGAGTTAA 


TGCCTCCTGC 


O U u U 




TATCAAATAA 


GGTATGTCTT 


TTATGTGCTT 


CAAAATAGTC 


CAGTCATATG 


TTTGACCGGT 


3060 


45 


ACCACCATAC 


GACACTGAGG 


GTGTGTCGAT 


AATAAATAAA 


TCTACGAACC 


CTTTATATTT 


3120 




ATTTATGTTT 


TGGATTATGT 


TTTCATCTnG 


CAGtAAAGCT 


TTAGTGATTT 


TAATG CTTG A 


3180 


50 


ATATkTCTTT 


TTAATTTCCT 


GTAtAAAAtC 


AATAGATTCt 


GtGTAACTGT 


ATTGTGkTAA 


3240 


wTGaCGwATg 


CtTAAwACgT 


GTGCCAATGG 


T 






3271 



(2) INFORMATION FOR SEQ ID NO: 437: 
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(A) LENGTH: 1553 base pairs 

(B) TYPE: nucl ic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 



10 


ATAATGAAAT 


AATACTGTGT 


TTTATCTGCG 


AAATGTAtCA 


TTTTCTAATc 


GTTTCACAGT 


60 




AAAATGAAAA 


GATAAAGTGT 


GTTTTTACTT 


GAATTTTGAC 


TAAAATTACT 


CTATATTTAT 


120 




TAATTGAGCT 


ATGCTTATTA 


TTACAATTTG 


ATTACAAATT 


TTAAATTTGT 


TAATTGAATG 


160 


15 


ATAATATTAA 


ATAAAGAAAC 


TTACACAAGC 


AAATATGAGT 


TGTAGCCCAA 


AATACTTGTT 


240 




AAATCAAAGT 


TGAAAGCTAC 


AAATAATGAA 


AATTATAAAC 


TTGAATCTGA 


AAGTAATTAC 


300 


20 


TATAATTATG 


ACAATGTTAA 


CTTTTAAACG 


CACTTATTAA 


TTAACTACAT 


AATGTTAATA 


360 


TCTAATTTAT 


TCAAGTACTT 


TCGCAAGATT 


TATTATCTAA ATAACGGGGG AAAGAATCAT 


420 




GAGTtCACAA 


AAAAgAAAAT 


TAGTCTTTTT 


GCGTTCTTCT 


TATTAACCGT 


AATAACGATT 


480 


25 


ACCTTGAAGA 


CGTATTTTTC 


TTATTATGTT 


GATTTTTCTT 


TAGGTGTTAA 


AGGTTTAGTA 


540 




CAAAACTTAA 


TATTATTGAT 


GAATCCTTAT 


AGTTTAGTAG 


CACTGGTTTT 


AAGTGTGTTC 


600 




CTATTCTTTA 


AAGGCAAAAA 


AGCATTTTGG 


TTCATGTTCA 


TAGGCGGCTT 


CTTATTGACG 


660 


30 


TTCCTATTAT 


ATGCCAATGT 


TGTGTACTTT 


AGATTCTTCT 


CTGATTTTTT 


AACGTTTAGT 


720 




ACTTTAAACC 


AAGTAGGTAA 


CGTAGAATCT 


ATGGGTGGTG 


CGGTTAGTGC 


ATCATTCAAA 


780 




TGGTATG ACT 


TTGTTTATTT 


CATTGATACG 


TTAGTTTACT 


TATTCATTTT 


AATATTTAAA 


840 


JC> 


ACAAAATGGT 


TAGACACAAA 


AGCATTTAGT 


AAGAAATTTG 


TTCCTGTCGT 


AATGGCGGCT 


900 




TCAGTAGCAT 


TATTCTTCTT 


AAACTTAGCT 


TTTGCTGAAA 


CTGACAGACC 


AGAATTATTA 


960 


40 


ACACGTACAT 


TTGACCATAA 


ATATTTAGTG 


AAATATTTAG 


GACCTTATAA 


CTTTACAGTA 


1020 


TACGaTGGTG 


TTAAAACTAT 


CGAAAATAAT 


CAACAAAAAg 


CGCTAgCATC TGAAGATGAC 


1080 




TTAACaAAAG 


TATTAAAtTA 


TACGAAAcAA 


CGTCaAACmG 


AGCCTAACCC 


rGAwTATTAT 


1140 


45 


GGGGTGGcAA 


rGAAGAAAAA 


TATTATTArG 


ATTCATTTAG 


AAAGTTTCCA 


AACCTTCTTA 


1200 




ATTAATAAAA 


AGGTTAATGG TAAAGAAgTA ACACCGTTTT 


TAAACAAATT 


ATCAAGTGGG 


1260 




AAAGAGCAAT 


TCACATACTT 


CCCTAACTTT 


TtCCATCAAA 


CAGGTCAAGG 


TAAAACATCT 


1320 


50 


GACTCTGAAT 


TTACAATGGA 


TAACAGTTTA 


TACGGTTTAC 


CGCAAGgTTC TGCCTTTTCA 


1380 




TtaaaaGGAG 


ATAATACGTA 


TCAGTCATTA 


CCAGCAATTT 


TAGATCAAAA 


GCAAGGCTAC 


1440 




AAATCTGATG 


TCATGCACGG 


TGACTATAAA 


ACATTCTGGA 


ACAGAGACCA 


AGTATATAAA 


1500 
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(2) INFORMATION FOR SEQ ID NO: 438: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 





ACAAAATCAA 


TCAGCAAAAT 


GATTGGATTA 


AGACGCAAAT 


TGAGCGTTCA 


ATGGAAGGCG 


60 


15 


AAACAGTTGG 


CATTAATGAT 


CAAAATATAG 


AAATATATAG 


TGAAcGtTgA 


GATTTATACC 


120 




ATACACTCGT 


ACCTTTAAAT 


CAAGAATTGC 


ATAAGTTGCG 


ACTTAAAACT 


CAAAACTTAA 


180 




CCAATGAAAA 


TTATAATATT 


AATGATGTGA 


AAGTTAAAAA 


GATTATTGAA 


GATGAACGTC 


240 


20 


AAAGACTAGC 


ACGAGAACTT 


CACGATTCTG 


TTAGTCAGCA 


ACTTTTTGCG 


GCmAGTATGA 


300 




TGCTATCTGC 


TATCAAAGAA 


ACGAAtTAGA 


ACCACCATTA 


GACCAACAAA 


TTCCTATTTT 


360 




AGAGAAAATG 


GTTCAAGATT 


CGCAGTTAGA 


AATGCGTGCT 


TTGCTGTTAC 


ATTTAAGACC 


420 


25 


GCTTGGTTTA 


AAAGACAAAT 


CTTTAGGTGA 


GGGTATTAAA 


GATTTAGTTA 


TTGATTTACA 


480 




AAAAAAAGTG 


CCAATGAAAG 


TTGTGCATGA 


AATACAAGAT 


TTTAAAGTGC 


CTAAAGGTAT 


540 


30 


TGAAGATCAT 


TTGTTCAGAA 


TTACACAGGA 


AGCAATTTCG. 


AATACATTGC 


GTCATTCAAA 


600 


CGGTACAAAA 


GTGACAGTAG 


AATTGTTTAA 


TAAAGACGAT 


TATTTATTGT 


TGAGAATTCA 


660 




AGATAATGGT 


AAAGGTTTTA 


ATGTTGATGA 


AAAATTAGAA 


CAAAGTTATG 


GACTTAAAAA 


720 


35 


TATGCGTGAA 


AGAGCTTTGG 


AAATTGGTGC 


AACGTTCCAT 


ATTGTATCAT 


TGCCAGATTC 


780 




AGGTACACGT 


ATCGAGGTGA 


AAGCACCTTT 


AAATAAGGAG 


GATTCGTATG 


ACGATTAAAG 


840 




TATTGTTTGT 


GGATGATCAT 


GAAATGGTAC 


GTATAGGAAT 


TTCAAGTTAT 


CTATCAACGC 


900 


40 


AAAGTGATAT 


TGAAGTAGTT 


GGTGAAGGCG 


CTTCTGGTAA 


AGAAGCAATT 


GCCAAAGCCC 


960 




ATGAGTTGAA 


GCCAGATTTA 


ATTTTAATGG 


ATTTACTTAT 


GGATGACATG 


GATGGTGTAG 


1020 




AAGCGACGAC 


TCAGATTAAA 


AAAGATTTAC 


CGCAAATTAA 


AGTATTAATG 


TTAACTAGTT 


1080 


45 


TTATTGAAGA 


TAAAGAGGTA 


TATCGTGCAT 


TAGATGCAGG 


TGTCGATAGT 


TACATTTTAA 


1140 




AAACAACAAG 


TGCAAAAGAT 


ATCGCCGATG 


CAGTTCGTAA 


Aa cTTCTAG A 


GGAGAATCTG 


1200 


50 


TTTTTGAACC 


GGAAGTTTTA 


GTGAAAATGC 


GTAACCGTAT 


GAAAAAGCGC 


GCAGAGTTAT 


1260 


ATGAAATGCT 


TACAGAACGA 


GAAATGGAAA 


TATTATTATT 


GATTGCGAAA 


GGTTACTCAA 


1320 




ATCAAGAAAT 


TGCTAGTGCA 


TCGCATATTA 


CTATTAAAAC 


GGTTAAGACA 


CATGTGAGTA 


1380 
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(2) INFORMATION FOR SEQ ID NO: 43 9: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 608 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION; SEQ ID NO: 439: 





ATGTnGGCGG 


ACGTAAAGGA 


CATGTTTATA 


CTGATOATCG 


AGCATTAGAT 


ATTGATATCG 


60 


IS 


TTCCGCCTGC 


TCAAGCAGAT 


GGTAAAGCTA 


CTAACCCCGA 


ACAATTATTT 


GCAGCAGGTT 


120 




ATGCATCTTG 


CTTCAACGGT 


GCTTTCGACC 


TAATTTTAAA 


GCAAAACAAA 


GTGCGTGATG 


X80 




CTCATCCAGA 


AGTAACACTA 


ACAGTGAGAC 


TAGAAGATGA 


TTCAGACTCA 


GAAAGTCCTA 


240 


20 


AATTAAGTGT 


TTCAATTGAT 


GCGACAATTA 


AAAATGTTAT 


ATCTCAAGAA 


GAAGCTGAAA 


300 




AATATTTACA 


AATGGCTCAT 


GAATTTTGTC 


CATATTCAAA 


AGCGACTCAA 


GGAAATATTA 


360 




ATGTCGATTT 


AAATGTAAAT 


GTTGTAGATT 


AGCATTAACT 


TAAAGAGATT 


ATTCAACGTT 


420 


25 


ATTAATAAAA 


TTCACATAAA 


ATTCAAATTt 


StCrAcCAAA 


AATTTTTGGT 


TGGyTATTTT 


480 




TTCTATTCGT 


GATTGAAATT 


TCTGGCAATA 


TTTAACTGAA 


AATGATTGTA 


CCTTAGTCAT 


54 0 




CATAAATGTG 


ACCGGTTCCA 


ATACTGGCTT 


GACTTCTTCG 


CATACCGTCT 


ACAAATAAAA 


600 


30 


GTCCAGTG 












608 



(2) . INFORMATION FOR SEQ ID NO: 44 0: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 
CCTTTCAAtT TCTCCCAAgA TTTACGCATG TCTGACCAAA TGTCAATACA ATACCTGAAT 60 
45 AATGATTTCT GGAGTAATTG CGAAAAATAT TTAAAAGTTG CTATAGATCA ATTTTCAAAT 120 

TATAGTATCT CATCTCAAGT TTCTAACTAT CATTTCACAG TATTACTTGG AGATCGCCAA 180 
AAACCACTTA TGTATCTAAA TAAAAATCGC GGTGGTGATG GTGGCATACC AGGTTATATT 24 0 

so 

ATGATTTATT TAGTGCCGAG TACAAGTACA ATTAATTCTA TGAAAAGCTT AATTGCACAT 300 
GAAGTAAATC ATAATATGCG CTATCAATAT ATTGATTGGG ATGGCGGAAG TTTGATTGAA 360 
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ATTGGACOGT GGGTAACAAA TACTAATTGA AGTCGTGATA ACGTAAAAAT TAAAAATACT 480 

ATTTATAATC ATTTACATTT AAAGCATATA TTTGAATCGA tGCCTTATCT CTATGGTGAT 540 

GATATTAATA AACTTCAAGG TAGGCCTATC GTTGGCTTAT CTCATGCTGC CGGGTATGCA 600 

TGTGGCTATC ACTTGGTAAA ATACTTTTTA CAAAAAACAA ACATACCTAT TGAAGTTGCT €60 

ACAACACTTC CAGCACAAAA AA 682 



10 

(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2574 base pairs 
1S <B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY : linear 



20 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 



25 



30 



35 



SO 



ATCTTCTTTT 


AACATGACAA 


ATTGCAACAA 


AATATTAATG 


ACGCCAAAAG 


ACATTTTTTC 


60 


ACGTTCAATT 


AATTCTTCAA 


CCATTGTCTT 


TTGCGATATA 


GTTGGTTCTG 


ATTCAGACCA 


120 


AGAAGCTAAC 


ATATCAATTG 


GACTCGTTTG 


TTCAAGTAAC 


TCAAACCATT 


CATCACTTTG 


180 


TGGCTTTGGA 


TTCACTTCTG 


AAGATTTGCC 


CGCCGAAGAT 


GATGTAGCAG GAGATTTCAC 


240 


CTGTAATTTA 


GGCATTTGAT 


TTTCGTGTTC 


CATTAAGTAA 


TACGAGCGTG 


CTTGTTTACG 


300 


CATTTCTTCA 


AAGGATAACT 


GTTGTCCACT 


TGTAATTGAA 


TTTAAAATAA 


CATGCTTCAT 


360 


GCCATCTGCT 


GTTAAACCAT 


ATAAAGTCGC 


GAGTTGTGTA 


ATTAAACGCT 


TTGCATCTTT 


420 


GGTAACAATG 


TCTTGACTAA 


TAAAATGTTT 


ACCTAACATT 


TGTCTCAACA 


TTTCAAAGTC 


480 


AAAAGATTCA 


TTTGATAAAT 


CGATACCTTG 


GTACGGTTCA 


TTAATCGGaA 


TATCACTTGT 


540 


ATCGATATCT 


ATTTTTGTAG 


ACGGCACTTT 


AAAAACATCA 


GTAAATTGTC 


TTGTTACCTG 


600 


TTTAAATTCA 


CTCAAATCAA 


TTTGTTGATA 


CTCAAAGTAT 


TTCTTCAACT 


CATGAAATCG 


660 


ACGATGCTCG 


ACTTCACTAT 


ATAAAAAGAT 


TGACAACATT 


GGATCATTAA 


AAAATAAATG 


720 


TGCTGAAGGG 


GtTGaATTAA 


TTGGTAAACA 


AATTGTGTTT 


CTTGTTCATC 


ATGTTTGACA 


780 


AACGCCTTTA 


ACAATCCAAT 


CGCTTCAAGT 


AAGTCCATTT 


GTTGTCTAAA 


CTCTAGTAAA 


840 


TTAATTTTAA 


GTTCATTCAT 


AAAAATATAA 


TGAGAAAGAA 


TCAATGTTTC 


ATTATGACTT 


900 


TCTTTAACGA 


ATTGAGTCAT 


AAAATGATAT 


AAACCCACTG 


CTTGCGTTCC 


AATTAGCGGT 


960 


GTATACAGTC 


GATTCAATAC 


CTCTAAATGA 


TTCGTATTTA 


AATCAAAGTG 


TTGCATAACT 


1020 


TTGAATTGAT 


CCTTTGGTCT 


TAAGCCGAAT 


TCGAAGGCTT 


GTCGTCCCaT 


TTAAGCnATC 


1080 
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AAATTCTTTA 


TAGACTGATG 


CAAATCTAAC 


ATATGAAACT 


TGATCAACAT 


GCATTAACAA 


1200 


GTTCATAACG 


TGTTCACCTA TATCTCGTGA AGAGA CTTCC 


GTATGACCTT 


CATCTCGTAA 


1260 


TTGCCATTCA 


ACCTTGTTAG 


TTATGTCTTC 


AAGTTGTTGA 


TATCTAACTG 


GTCGTTTCTC 


1320 


ACAAGAACGC 


ACAAGTCCAT 


TAAGTATCTT 


TTCTCTTGAA 


AACTGCTCTC 


TTGTGCCATC 


1380 


TTTTTTCACA 


ACTATAAGCT 


GACTAACTTC 


GATATGTTCA 


AATGTAGTGA 


AACGTGTTCC 


1440 


ACAATTTTCA 


CATTCTCTTC 


GTCTTCGAAT 


GGCATTTAAT 


TCATCGGCAT 


GCCTTGAATC 


1500 


TACAACTTTA 


GATTGTGTAG 


AATTACATTT 


CGGGCATTTC 


ATTACATCAC 


CCTCTTTATT 


1560 


TTGATTATGC 


CTAATTATAC 


TATAAATCTA 


GAGATGAAAA 


AAGAATCCCT 


CAATTTAATT 


1620 


CATTTAACCA 


AATAATGAAA 


CAATAAAAAA 


CATTATATCG 


TTACTTATTA 


AGTAATTTGC 


1680 


ATGACAATAT 


TATTGTATTA 


AAAATAAAAA 


ACCTAACTCC 


GAAGTCAGAG 


TTAGGCTATA 


1740 


AATTAATTGT 


ATTAACTTGC 


ACTTACAGTT 


TCTTTTGATG 


TCAAAAGTGC 


TCCAATTTGC 


1600 


TCAGCAACAT 


CTACAACTCT 


ATTTGAATAA 


CCCCATTCAT 


TATCATACCA 


AGCAATAACT 


1860 


TTTACTTTAT 


TCCCTGACAT 


GACCATTGTT 


GATTTTGCAT 


CAAT AAT AG C 


TGAATTTGGA 


1920 


TTAGTATTAA 


AATCAACAGA 


CACTAGTGGT 


TGATGTTCGA 


CTTCTATGAT 


ACCTTCTAAA 


1980 


CCTGCATTTT 


CAAAAGCTTG 


GTTTACTTCT 


TCTGCAGTTA 


CTTCTTTTTC 


TAAATCAACA 


2040 


ACTAAATCAA 


CGAGCGATAC 


ATTCTTTGTT 


GGTACACGTA 


ATGCCATGCC 


GTGTAATTTA 


2100 


CCTTCTAATT 


CTGGTAATAC 


TTCTTTTAAA 


GCTTTCGCCG 


CACCAGTAGA 


AGTAGGAATA 


2160 


ATGCTTTCAT 


TACATGAACG 


TGCACGTCTT 


AAATCTTTAT 


GTGGATTATC 


AATATTTTTT 


2220 


TGGTCATTTG 


TAATAGCGTG 


AACAGTAGTC 


ATTAAACCAT 


TAACTATTCC 


AAACTGATTA 


2280 


TTTAAAACTT 


TTGCAACTGG 


ACCAATGCAA 


TTAGTAGTAC 


ATGAAGCATT 


ACTAAAAATG 


2340 


TCAAATGCTT 


CTATATCTAA 


TTGGTTATCA 


TTTACGCCTT 


TAACTACCAT 


TTGAACATGT 


2400 


CCACCTTTTG 


nAGGACCAGT 


TAACAAAAsT 


TTTtTGGCAC 


CTGCTTTAAT 


ATGTGCGATG 


2460 


GCTTTATCAC 


CATGaTTAAA 


TTTACcAGTT 


GCATCTATAG 


CAATATCGAT 


ATCTAATTCT 


2520 


TTCCATGGCA 


AGTTTTCAGG 


ATTGCGATCA 


GCAACCAATT 


TAATTTTATG 


ATCT 


2574 



45 (2) INFORMATION FOR SEQ ID NO: 442: 

<i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH : 3326 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 
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CAAGGTACCG 


GTTTAGGTTT 


GTTCATTTGT 


AAAATGATTA 


TCGAAGAGCA 


TGGTGGTTCC 


60 


ATAGATGTTA 


AAAGCGAATT 


AGGCAAAGGC 


ACAACATTTA 


TTATTAAACT 


ACCAAAACCA 


120 


GAATAAAATT 


GAATATAGTT 


ATTTCAGAAC 


GCATGTTATT 


GATTAGAGAC 


TCTAATTTAT 


180 


AGCATGCGTT 


TTTTGATTGA 


TGTGGGGAAT 


TTTGTATGTG 


GATTAGAACT 


TAGGGTTTTT 


240 


GCGAATATCA 


ACTATTAAAT 


ATATTACTAA 


TTTATACAAA 


AATATAAAGT 


TTGATAAAGT 


300 


TATTTATTTG 


ATTATAAAAA 


TAGGGTAAAA 


TATAGATATA 


TTGTATTAAT 


TAAATTATTC 


360 


GAGGTGTCAT 


ATGAAAAAAT 


TCATTGGATC 


AGTTTTAGCT 


ACGACATTAA 


TTTTAGGGGG 


420 


ATGTTCCACG 


ATGGAAAATG 


AATCAAAAAA 


AGACACAAAA 


ACAGAAACAA 


AATCTGTACC 


460 


AGAAGAAATG 


GAAGCTTCAA 


AATATGTAGG 


CCAAGGCTTC 


CAACCGCCTG 


CAGAAAAAAA 


540 


TGCGATTGAA 


TTTGCGAAGA 


AG CATCGTAA 


AGAATTTGAA 


AAAGTAGGTG 


AA£TAA*nY"*TT* 


600 


TAAAGATAAC 


TTTGGACTAA 


AAGTTAAAOC 


TACAAATGTT 


GTAGGTAAAG 


ATGATGGTGT 


oou 


AGAAGTTTAT 


GTGCATTGTG 


AAGATCATGG 




AATGCAAGTC 




lots 


CAAAGATGCC 


ATCCATCAAA 


AAGGATCAAT 


GCGCAGTAAT 


GJVf AA*Pfirt*IYy 


ATYSATATflAft 


/ o yj 


TATGATGCTG 


GGTACAGTGC 


TGAGTGGCTT 


TGAATATCQA 


GOGCAAAAAG 


AAAA12TATY3A 


fid f\ 


TAACTTATAT 


AAATTCTTCA 


AAGAAAATGA 


AAAGAAATAT 


CAATATACAG 






AGAGGCAATT 


AACAAGACAC 


AAAATGTOGG 


ATAT AAAAAT 


■ C2 AAT ATTTTT 

\«W^X«^X X A X X 


ATATTAOATA 




CTCTTCTAGA 


AGTTTAAAAG 


AATATCGAAA 


GTATTATGAA 


CCACTGAXTC 


GAAAAAATGA 




TAAAGAATTT 


AAAGAAGGAA 


TGGAACGAGC 


AAGAAAAGAA 


GTGAATTACG 


CTGCAAATAC 


1080 


AGATGCTGTT 


GCTACACTTT 


TTTCTACTAA 


GAAAAACTTT 


ACTAAAGACA 


ATACAGTAGA 


1140 


TGATGTAATC 


GAACTAAGTG 


ATAAATTATA 


TAATTTAAAA 


AATAAACCAG 


ATAAATCTAC 


1200 


AATCACAATA 


CAAATAGGGA 


AACCCACTAT 


TAATACTAAG 


AAAGCCTTTT 


ATGATGATAA 


1260 


TCGTCCAATA 


GAATATGGGG 


TGCACAGTAA 


AGATGAATAA 


AATTAATGAT 


AGGGATTTAA 


1320 


CAGAATTAAG 


TAGTTACTGG 


GTTTATCAAA 


ATATTGATAT 


AAAAAAAGAA 


TTTAAAGTTA 


1380 


ATGGAAAAAG 


GTTTAAACAA 


GTAGACAGTT 


ATAATGATGA 


TAAGAATAGT 


AATTTGAATG 


1440 


GTGCTG CTG A 


TATTAAAATA 


TATGAGTTAT 


TAGATGATAA 


AAGTAAACCA 


ACTGGTCAAC 


1500 


AGACAATAAT 


TTATCAAGGA 


ACATCTAATG 


AGGCAATTAA 


T CCAAATAAT 


CCATTAAAAT 


1560 


CATCGGGGTT 


TGGAGATGAT 


TGGCTCCAAA 


ATGCTAAATT 


AATGAATAAT 


GATAATGAAA 


1620 


GCACAGATTA 


TTTAAAGCAA 


ACAGATCAAT 


TATCAAATCA 


ATATAAAATA 


AAGTTAGAAG 


1680 


ATGCAGATAG 


ATTATCAAAT 


AGTGATTTTT 


TAAAAAAATA 


TAGAATGGAA 


TCAAGTAACT 


1740 
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ATCAAGGAGC 


GAAACATCCG 


AATGAAAAAG 


TTGTTGCTAC 


TGACTCAGCA 


ATGATTCCTT 


1860 


ATOCTGCTTQ 


GCAGAAATTT 


GCTAGACCAC 


GCTTTGaTAA 


TATGATTAGT 


TTTAATAGTA 


1920 


CCAACGATTT 


ATTAACATGG 


TTACAAGATC 


CATTCaTCAA 


AGATATGCCA 


GGAAAACGCG 


1980 


TTAACATTAA 


TGATGGTGTG 


CCCAGGTTAG 


ATACTTTAAT 


AGACAGCCAT 


GTAGGTTATA 


2040 


AAAGGAAGTT 


AAATAGAAAA 


GATAACACAT 


ACGATACTGT 


ACCACTAATC 


AAAATAAAGT 


2100 


CGGTAAAAOA 


TACAGAAATT 


AAAAATGGAA 


AAAAAGTAAA 


AAAGACTATT 


AACATAACAT 


2160 


TAGATATGGA 


TGGGCGAATT 


CCAATAAATG 


TTTGGACAGG 


AGATTCGATT 


GCACGTTCTG 


2220 


GAAGAGGAAC 


TTTAATTAAA 


CTTAATTTAG 


AAAATCTTGA 


TGCGTTGAGT 


AAACTGATTA 


2280 


CTGGTGAAAC 


AAGTGGTATG 


TTAGCAGAAT 


GCGTAATCTT 


TTTAAATGAA 


AGTTTTAACA 


2340 


TCTCAGAAAA 


TGAAAATAAA 


AATTTTGCAG 


ATAGAAAGAa 


ACAATTATCA 


GAAGGATTTA 


2400 


AGGATAAGAT 


TAACTTATTT 


CAGTTAGAAG 


AAATGGAAAG 


AACTTTAATT 


AGTAAAATAA 


2460 


ACTCACTTGA 


AGAAGTTGCA 


GATGAAACAA 


TAGAAAGTAT 


TAGTG CTGTT 


AAACACTTAT 


2520 


TACCTGATTT 


TGCATTGGAT 


r 

GCATTAAAAG 


AAAGAATTAA 


TGAGTTGTTT 


AAAGGTATAA 


2580 


AATCTTTTAT 


AGAAAAAGTG 


TATGATAGTA 


TAGATAATGA 


AATTTTAGAA 


ATTTTCAAAA 


2640 


ATATAGATCA 


CGACTTCAGA 


GATGGAGTAT 


CTGAAGAAAT 


GATGAAACAT 


TTGAAAGTAG 


2700 


TGAAACAGAA 


TATAGACCAA 


ATAAAAAATC 


AAAATGATAT 


TTATGGTAGG 


CAAATTGCAG 


2760 


ATATTAGAAG 


TATTATGAAA 


CAACAAGATG 


CAACAATTTT 


AGATGGAAAT 


TTTCAAATTA 


2820 


ATTGTAGCGG 


CGAAAATATG 


GTACAGGGTC 


TAGTTATACC 


TTCTAATTAT 


TTAGGAAGAA 


2880 


AAATGAAAAT 


ATTAAAAGAC 


CATATCGATG 


ATGGTATTAA 


AAAAATAGCA 


GACTATGTTC 


2940 


AAGGTATATA 


TGATGAATAT 


GCATCGAAAA 


TTGTCGATGT 


AATAAAATAT 


TTGATTAATA 


3000 


CAATTCCCAA 


AATACGTAAG 


AATTTAAGAC 


ATGCAATTGA 


AATGTTAAAT 


GTAAAAAAGA 


3060 


AAGAATTTTT 


GTCCCTGATT 


CCTAATGTAA 


CTTGTAATTA 


TATTAAAACT 


AAATTAGAAG 


3120 


AATTAGATAA 


TACTTTAGGC 


AAATGGGAGC 


CTTTTCTTAA 


TGATTTAAAA 


GCAGTGTCAC 


3180 


CAATTTTAGA 


TAACCATTTA 


GATGATATTG 


TTAAGAACAT 


GAAGCCTTTG 


ATTGTACAAA 


3240 


TGAyATwTGA 


ACCATCACAT 


TATGaCGATA 


TGTTTAATTC 


aAGAAAAGCT 


TTAACGcCAG 


3300 


TGTTCTCAAG 


CGTTTTATAA 


AGGTTG 








3326 


(2) INFORMATION FOR SEQ ID NO: 443: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5301 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 443: 
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ACCTATAGCT 


GGCCCGTCAA 


GTCCTGAATC 


ATATGTCGCT 


ATAAACCCAG 


TTGCATGCCC 


€0 


AGCAATCTCG 


GTTTCAATTT 


CAAAATCATG 


CTCTTTCAAA 


CGATCTATTA 


AAGTTCGAGA 


120 


CGCAAATATT 


TCTTCATTAC 


CAAGTTCAGG 


ACGTTCATGA 


ATTCTATGAC 


TGATTTCGAT 


180 


ATAACTATAT 


TTATTTGTTT 


CTATATAATC 


GAGAATTTGT 


TGTTTTTCAC 


TCATTTTTTA 


240 


CTATCTCCCT 


TTTACCCTAC 


ACTCATTTTA 


ATCATCCACA 


ATATTTTGTT 


CTTTCAAAAT 


300 


GAATTATTAC 


TTATTCTATC 


GGTTTTATCT 


CATGATGTCA 


TCTAGTTTTT 


CTTTATTTAC 


360 


AAAATTTTCT 


AATAACTAAA 


aGCCTTTCAT 


AAATTTATAA 


AACAGTTTCA 


AATTGTAAAA 


420 


CATAAGCCCT 


ATTGTTACAA 


TTTAAGTATT 


GTTAGAAACA 


CAACTTACAA 


CAAACTTAAG 


480 


TTTTATAATG 


CAACAAATCA 


TAAGCGTTTT 


ATAGTTTTGA 


GTAGTTAATA 


GGAGGAAATT 


540 


CAAAATGACA 


AAAATGAATG 


TTGAAAGTTT 


cAATTTAGAT 


CaTAcTAAGG 


TGGTTGCCCC 


600 


ATTTATTAGA 


TTAGCGGGAA 


CGATGGAAGG 


ATTAAACGGA 


GATGTCATTC 


ACAAATACGA 


660 


CATTCGTTTC 


AAACAACCAA 


ACAAAGAACA 


TATGGATATG 


CCCGGACTAC 


ATTCATTAGA 


720 


ACATTTAATG 


GCTGAAAATA 


TTAGAAATCA 


TAGTGACAAA 


GTTGTTGATT 


TAAGTCCTAT 


780 


GGGTTGCCAA 


ACTGGTTTCT 


ATGTATCATT 


TATTAATCAT 


GATAATTATG 


ATGATGTATT 


840 


AAATATTGTT 


GAAGCAACTT 


TAAATGATGT 


GCTAAATGCT 


ACTGAAGTGC 


CTGCTTGTAA 


900 


TGAAGTACAA 


TGTGGCTGGG 


CAGCAAGTCA 


TTCATTAGAA 


GGTGCTAAAA 


CTATCGCTCA 


960 


AGCATTTCTA 


GACAAACGAA 


ACGAATGGCA 


TGATGTTTTC 


GGTACAGGAA 


AATAAATCTT 


1020 


AGTCAATCAA 


GTTAATCAGA 


AAaGCAGTCG 


AACAATGATT 


TTACAATCGC 


CATTGTCCAA 


1080 


CTGcrrmA 


TTATGCTTCA 


AAGTCAAAAA 


ATCGAACAAA 


TGAAAAAGTA 


AAATCTTTAA 


1140 


CATTTGTCCG 


ATTTATTTGA 


GAACCACTAT 


AATTTCTTAA 


TTAGTCCCAT 


TAACACGAAC 


1200 


TGCATAGGTA 


ACCTTAAATA 


TAGTTGCCAT 


GTTGGCAATT 


GTTTATCTCC 


TAAAGGTAAC 


1260 


TTTTTAACTG 


CCATATAGAT 


ATTAGCTGGG 


AATACAGCTA 


GCAAGAATAG 


ATTGATTGTA 


1320 


TTTTTCAAGC 


ATTGAGATGG 


TCTTTTAATT 


AAAAGTGCAA 


GTCCAAATAA 


TATCTCAAAG 


1380 


ATTCCTGTAA 


CAAGAACCGC 


TGTTTTTCGA 


AGTGGCAAAC 


ATTTCGGTAT 


GATATTTCTA 


1440 


AATTGTCGTT 


CTCGTGTAAA 


ATGCAATACA 


CCTATTACAC 


TAAAACCTAT 


TCCTAATAAA 


1500 


TATCTTAGTA 


TGTT CAATCA 


GCTTCAACTC 


CTATTCTGTA 


ATGATTTTAT 


GAATTAATGT 


1560 


AGGCGATACA 


ACATGaTCAG 


CAATTGTTAT 


GCTTGAATCT 


AATTTTTTAA 


CAACATCGTC 


1620 
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AATTTTTTTA TTTAAAACAA TACCAACCGC TAAATCAATA TCATCCTCTT TTGTTAAACG 174 0 

TCCCGCTCCT AACATCATCG AAGCGACACC TATATCGTTA GAGACTAATT CAGTCACATA 1800 

ACCTGATTTT TTAGCTTTAT ATTCAATTTG ATATTGAGCT TGTGGCAAAC GCTCTGGATG 1860 

GTCAATAACA GTTTCGTCGC CACCTTGGTT TTTAATAAAT GTTTTGAATT TTTCTAATGC 1920 

TGCACCTGAA TTAATTGCCT CAATTAGCAA CGCTCTCGCT TCTTCAAGCG TTTCAGCTTT 1980 

GTTTGCAAGT ACAACCATTT GAGAACCTAA TGTTAATACA AGTTCTGTTA AATCTTTCGG 2040 

ACCTTGTCCT TTCAACGTAT CAATTGCTTC TTGTAACTCA AGCGCATTGC CAATCGCACG 2100 

TCCAAGTGGC TGATTCATAT CAGAAATAAT OGCCATCGTA TTACGTCCCA CATTATTACC 2160 

AATACGTACC ATTGCGTGCG CTAATGCTTC AGCATCTTCT AATGTTTTCA TAAATGCACC 2220 

GCTACCAGTT TTTACATCTA ATACAATTGC ATCTGCACCA GCAGCAATCT TTTTACTCAT 2280 

20 AATTGAAGAG GCAATTAATC GTATTGAATT GACAGTACCA GTAACATCCC TTAAGGCATA 2340 

TAATTTTT TG TCTGCAGGAG TTAAATTTCC TGATTGTCCT ACAACTGCCA CTTTATTTTC 2400 

ATTAACCAAT TTCACAAATG TTGCTTCATC TATTTCAACA TGAAAACCAT CAATTGCTTC 2460 

25 TAATTTATCA ATCGTACCAC CTGTATGACC TAATCCACGC CCACTCATTT TTGCAACAGG 2520 

AACATCTACA GCTGCTACTA ATGGTGCTAA AACCAATGTA GTTGTATCTC CTACACCACC 2580 

TGTTGAGTGC TTATCTACTT TGACACCTTT AATATCACTC AAATCTATCA TATCACCAGA 2640 

ATTAACCATA GCCATCGTTA ATGCTGCACG CTCATCATCA TTCATATCTT GGAAATAAAT 2700 

CGCCATTGCT AAACTTGATG CTTGGTAATC AGGAATATCC CCTTTAACAT AGCCGCCAAT 2 760 

AAAGAAATTA ATTTCTTCCG TTGTTAGTGT ATGACCGTCA CGCTTTTTCT CAATAATGTC 2820 

TATCATTCTC ATTTTTATCA TCCTTTTCTT AAAAAGCTTA GGACAAAGCA TCTGCGCTTT 2880 

CTCTAGTCCA TTTTTAAAAG CACAAGCGAA AATTATTATA GCAAGCTATC GATTTCATTT 294 0 

TTAATATCAC AATTTCAATG CGATGTTATT ATTCTTAAAT AGATTGGTTA TAACGTTAAA 3 000 

GTCCCTATTA AATTATCTTA GAATCATCAT GGCATTTATG ATGTCTTAAA GCTGATATCG 3 060 

ACATACTTAT ATATGGTTAC GATGTCCCAT GCTTACATAT TTTTATAAAA TTAGTAATCT 3120 

GAATCTGCTT CTAAACCTTG CATAATTTGa ACGCCTGCGC TCGCACCAAT ACGTGTCGCA 3180 

CCTGCTTCAA CCATTTTATT GAAATCTTCT AAATTACGTA CGCCACCTGA TGCTTTTACT 3240 

TCTATATCAG CACCTACTGT ATCTTTCATT AATTTAACGT CTTCTGCAGT CGCACCGCCA 3300 

CCTGCAAAAC CTGTTGAAGT TTTAACGAAG TCCGCACCAG CCGCTTTTGT TAATTCACTC 3360 

GCTTTTACAA TTTCGTCATG GTCCAACAAT ACCGTCTCAA TAATCACTTT TACTGTGTGA 3420 
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TTTAATGCGC CGATGTTGAT GACCATGTCA ATTTCATCTG CACCATTTTG AATCGCATCT 3540 

TCTGTTTCAA ATGCTTTCGT CGCAGTTGTC GATGCACCTA ATGGGAATCC TATTACCGTA 3600 

CAAACTAACA CCTCTGAATC AGCTAGTCGC TCTGCTGCAT ATTTAACATG CGTTGGATTC 3660 

ACACATACAG ATTTAAAATG GTATG CTTTC GCTTCATCGA TGATTTGATC GATTTGCGTA 3720 

CGTGTTGACT CAGGCTTCAA TAAAGTGTGA TCAATCAATT TTGCACTATT CATTTTCTAT 3780 

CTCCTCCTTT ATGGTTGATT ATAAAAATAC GGTTGTAAAT TAGTTGATTG AGCGTCAGGT 384 0 

TCATTTAAAT ATCAGGTTAG ATGTTCGCTT TTTATGTAAC CGCATACATA TACTATTACA 3900 

TTAATTCATT TCCCATAAAC AAACAATACA ATTGAACGTG ATATCTTCAT TATGAACGAT 3960 

GACTTGACAA CAAGCTAATC AGGATTATAT TTTTATAATT CTTTAATTCT ATAGTACAAA 4020 

AATTCGCAAA AAAGGGAAAC AAATGTTATC TTAAAATTAT TAATGAATAT TAAGGAGAAG 4080 

20 ATAACAAATG ACAAAAGGTA CACCACATAT TCAACCAAAT GGAGTAAAAA TTGCTAAAAC 414 0 

AGTATTAATG CCTGGCGATC CGCTACGTGC AAAATATATT GCTGATAATT TTTTAGAAAA 4200 

TGTTGAACAA TTTAACGATG TACGTAACAT GTTTGGTTAC ACTGGTACAT ATAAAGGTAA 4260 

25 AGAAGTTTCT GTAATGGGTT CTGGTATGGG TATTCCAAGT ATTGGTATTT ACTCATATGA 4320 

GTTATACAAC TTCTTTGATG TAGATACAAT CATTCGTATC GGTTCTTGTG GCGCATTACA 4380 

AGAAAATGTT AACTTATACG ATGTTATTAT TOCACAAGCT GCATCAACTA ATTCAAATTA 4440 

30 

TGTAGATCAA TACAATATTC CAGGTCATTT CGCGCCTATC GCTGACTTCG AGTTAGTAAC 4500 

TAAAGCTAAA AATGTCGCTG ACCAAATCGG TGCTACTACA CACGTAGGTA ACGTATTATC 4560 

TTCTGATACA TTTTACAATG CCGATCCAAC ATTCAATGAT GCTTGGAAAA . AAATGGGTAT 4620 

TTTAGGTATC GAAATGGAAT CAGCTGGTTT ATATTTAAAT GCGATTCATG CTGGTAAAAA 4680 

AGCACTTGGT ATTTTCACAG TAAGTGATCA TATTTTACGT GACGAAGCTA CTACACCTGA 4740 

AGAACGTCAA AATTCATTTA CACAAATGAT GGAAATCGCT TTAGAAATOG CAGAGTAACT 4 800 

TATTTAAATT GACTTTAATT GCTCTTTAAC AATGCGATTA AACTCAAAAA GCCAACACAT 4860 

TCTGGGCGTA TCCCCATTTA TGTGTTGGCT TTTATTTATA TTATTACTTA TCTGTAGATT 4920 

AGCTTAAGTA AGATTTAAAC ATCCAATTAT GTTTATCTAC TGATGTTTGC ATACCTATAA 4 980 

ACATATCTTC TGATACATCA TCG CCAG CAT TACCAGCAAT TTCGATTGCG TTTTCTAATT 5040 

GTTTTGAGAT ATTTGTGAAG TCTTGTGATA ATTCTTCAAC CATTTGTTCT GCAGAGTAAC 5100 

CTTTCGCAGC TTCTTTAACA ATTGATTGCT CTAAGCATTC AGTTAATGTA CCTACAGGGT 5160 

TTCCTCCTAC CGCTAAAATT CTTTCAGCTA ATTCGTCTAC ATATTGGCTT GCTTCATTAT 5220 
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AATTOTGTAG CTTTGTGTAA G 53 OX 

(2) INFORMATION FOR SEQ ID NO: 444: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

w 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO; 444: 



IS 



30 



35 



60 



GAAGATGAAA 


GCTATATTGA 


AGATGATGTT 


ACTAAAAAGG 


AAGCTATTTT 


AAGTATGCAA 


60 


ATTCCTAAAG 


GTTTCTCTCA 


AAAATTAAAA 


GAGAACCGTT 


TAAAAGAAAC 


GATACAGTTA 


120 


TATGGTAGAG 


ATGACTTTAT 


AGGTGGTATT 


GCTGTAGAAA 


TTGTTAGTAG 


TTCATTATAT 


180 


GAGCAGCAAA 


TTCCTAACAT 


TATTTATGAA 


CACCTTGAGG 


ATATGAAACA 


GCATCAATCC 


240 


ATCGATGCTA 


TCAACAAGTC 


CTATCATAAA 


CATACACCTG 


AATCTAAAAT 


CAAATTTGTG 


300 


TCGCTTACTA 


AACAAGCACA 


ACACTCTATT 


TCAATTAGCT 


TAATCTTTGC 


GGTGATTTTA 


360 


TTTGTTAGCG 


CTGTTCAAGT 


AGTACTTCAT 


TATCGTTTAA 


ACCAACAAGC 


AGCATTGCAA 


420 


CGATTATCAC 


AATATCATTT 


AAGCCGTTTC 


AAACTATATA 


GTACTTATGT 


AATGACACAT 


460 


ACGATTTTGT 


TATTGTTGGT 


ACT ATTGG CA 


GTTAGTCTAT 


ATTTGTCTCA 


ACCACTCAGC 


54 0 


TTAATATTTT 


ACTTAAAATC 


ACTGTTACTT 


ATATTGATTT 


ATGAGATAGG 


TATCGTCTTT 


600 


ATCTTATTCC 


AT ATT CAAAC 


AATAAGT CAT 


CGATTATTCA 


TGACATTTAT 


ATATGCACTT 


660 


GCTATGGGTA 


TCGTATACTT 


GATTATTTTC 


ATGTAAAGGA 


GCGTAACTGA 


TGATAGAAAT 


72 0 


TAATAACCTT 


TCAAAGCGTT 


AC CGTAACAA 


ACAGATTTTC 


AATCATTTAA 


CTATGTCCTT 


780 


TGATAGTAAT 


CGTTTAACCG 


TATTACTTGG 


TGATAATGGT 


GCTGGAAAAT 


CAACATTACT 


840 


TCGTATGATT 


GCTGGTATTG 


AAAAAGCTAA 


TGATGGAACT 


ATCAACTATT 


TCGGCGAAAA 


900 


ATGGAATCAA 


AGACaAATAC 


aAAATCACAT 


CGGTTATGTG 


CCACAAGACA 


TTGCGTTATT 


960 


TGAACACATG 


ACAGTGGCTG 


AAAACATTAA 


ATTTTTTAAA 


TCACTTTGTA 


AAAATCCAAT 


1020 


TAACGATACA 


ACTATCAACG 


AATATTTACA 


GCAATTAAAC 


TTTGATGATA 


CGTCTGCCAA 


1080 


AGTATCTACA 


TTGTCCGGTG 


GGAATAAACG 


TAAAATTAAT 


ATATTAGTAG 


GTTTACTAGG 


1140 


TCAACCTCGA 


ATTCTCATTT 


TAGATGAACC 


GACAGTTGGT 


ATTGATTTAA 


AATCTAGACA 


1200 


TGACATCCAC 


CAACTACTTA 


ACATCATGAA 


ATCTAAATGT 


TTAATTATAT 


TAACTACCCA 


1260 


TCATTTAGAT 


GAAGTTGAAG 


CACTTGCAGA 


TGATATCAAG 


TTAATTGGCC 


AAGATCCTTT 


1320 
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CCCAAGCTGC GTATOATATC GCAACTTGGG ATTTTCTGTA TTATCTACTT TGCAAOTATG 1440 

ACGTTGGGTC TACTGCATAT TGATTACCGA TGCCACCAGA CATACGTTGG AAGTGTACGT 1500 

5 GAGGCGCTGT TGAATTACCC GTACTACCTG AATATGCAAT TTGGTCACCA GCTTTGACTT 1560 

TATCACCAGC TGAAACAGTT AAACGATTAT TATGCATATA CCATTGGTAG TTATTACTGT 1620 

TCGCTTCTTT AATCGTTACT TGATTGCCGC CACCATAGTT ACTCCAACCT GCTTGTACTA 16 80 

10 

CTGTACCATC AGTTAATGAG TAAACTGGTG AATTTTCAGG CATTGCATAG TCGACACCGT 174 0 

AATGOGCACC ACCACCGTGA TATTGTCCAT ATGGTTGTAG TIGTTTACGA CTTGTTAACC 1800 

y5 AGCTTGCGTC TTTCGCATGA CCACTAGcTG TCGCTTTACT TGCTGATCCA CCATTTTGAT 1860 

TAGATGTGCC ATTAGGATAA TTGACCTTTC CATTACCATC ATGGCTGTTA TACGCTTGGT 1920 

TGTTGTTACT ATGTGAATAA TAGCTCGCGT CTGGACCTAC ATTTGATTGA TAACCATATT 1980 

2° GATTAATATG CTGTTGGCTT. TGACTCGCTG TGTAGTCATT GTTATCTCCT GCTGTTGCTG 2040 

GATTCACATA TGTTTGGCCG CTTCCATTGG CATTTGCATT TTTTGGATAA CAGTTATAAA 2100 

AATAATGCGT ATGTCCTTGA GCATCTACGA ATGTATAGCT ATATTCTTTA TTATCAAACA 2160 

25 TTGCTTGATT CCAGTTACCA TCAGGTGTGT GATGATAATC CCCATTAGAA TCAATTGTAT 2220 

AATAAGTACC ATAAGATACG TCTTGTGATT GTGTTGACAT TTGTGTATGT GCTTGTTGGG 2280 

TGTTTGTCGT TTCTGCTGCA TCTGCTTGAT GCGCCATTGT AAATGTAGCG AAsCCATCGT 2340 

30 

TGCAATCGCT GCTGCTGTTA ATTTTTTCAT GTATAAAACA TCCTCCATTA AAGTTAAAGT 2400 

TAGTTTTCAA TTAAACTGTA CTGCACATAC TAAAAGAATT AGACAACTGA GTAAAGGATT 2460 

35 TAATTCTCAT TTTCCAACTA TTTAATATTC CCGAAATGTT TTACTAAACT CATTACATTG 2520 

TCATTACAAA ATAGCCATAC ATTGATATTA AAATGACATC TCTCACTGCA TTCGTTTAAC 2580 

CTTTTATAAA TTTTCAAAAT TAACAACTAA TCGTTCGTCC ATGTTCGCGA TTCAACGCTA 2640 

40 ATGCATGATA GTAATCATCC ATCAAATCAT ATCAACCAAA TTCCATTATC AATCGCTATT 2700 

GATTGTCATT CAACTTTCTA ATAGTGATAT GCTTCTCAGG CTTAAAAATC GTCATATCTT 2760 

TTCTATTAAT TAAATCATCT GTGAGCTTTA ATGCTACTAA TTCATTGCTG CCATAATACT 2820 

45 

TAATATATAA CGTTCTTGTA GTTAAATTTA TTACGGTCTG ATACATCGTA TAGTGATTGT 2880 

CAGCATCATG CGGACGTACA ATTCCAATCG GTATATTTAC CGCATCTAAT AAATAAAATG 2940 

CATTCATTAA ATCCATTTCT TTATCATTGT TTTGAGCAAT GTTTGCTTTC ATAAATGCCA 3000 

so 

TTCTCACAAA GCGCTCAGTT GAAGTAAATC CACCTGGCAA TCCAAATGTA CCTGCTTCAT 3060 

TGCCTAAAGG TTCAATCGTT ACACCTTCCA ATAAATTTGC TGTTGCTGGA TAAGGAGAAA 3120 
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5 



10 



15 



20 



25 



30 



35 



50 



CACCAATAGG 


ATTATCTTTT 


ATAACCACTT 


CACCCTCTTT 


AAATGAAACT 


TCGACTGTAT 


3240 


GTCCAGTTGC 


ATCGGAAACA 


TGATAATGCA ATGGCGGAAC 


TTCACCGATG 


TCATTTAAAT 


3300 


ATACAGCTAC 


AACATGTATT 


TGGGATGCTT 


GTTGTTTCAT 


ATCTTCAATG 


CTTGTTGTAT 


3360 


ATCCCAAAAT 


CCATGTCACA 


ATTTCATTTT 


GCGTAATATT 


CATCGCGTCC 


GCTTTGTGTG 


3420 


TTGATCCATA 


TGAACTATAA 


CCTCGGAAAT 


ATTGTGTTGA 


AATGGCAACG 


CcATGTtCAT 


3480 


TAACACCATC 


ACCATAAATA 


AAACCTTCCA 


TATCTGTTCC 


TGTGCCAATA 


AAGCCATATT 


3540 


GCGTTTGGCC 


TGTCGTGCCA 


GTGCAAGATT 


TCCAACGATA 


ATTTCTAGGC 


GTCACTGCTG 


3600. 


GCGAACCATC 


TAATGOATAA 


TCATAATCCA 


TCGTGCGTCC 


AAGAAGTACT 


TGATTATTTA 


3660 


AAGTTTGTAT 


TGTGAATCCT 


GTGCACATTG 


TTCTCACTCC 


TCTGTACCTT 


CATTTACTTT 


3720 


AATCACTTTC 


AAATAAAGCT 


GTTTCACTTA 


AACATACTAT 


AAAAAATCAA 


TTATACAAGC 


3780 


AATTAATTGA TATTCATTCT 


CAATAACTGT 


GGTATGATAT 


GTAAGGAAAT 


CATGACTTAT 


3840 


GTGTGAGTGA 


ACGATCATCT 


ATACATCCGT 


TCACTTCATC 


TCATGACTTT 


CTATATTTAA 


3900 


TTTTTACAAG 


GAGTGACATC 


TGTGAATAAC 


ACACAATCTT 


CACCACGCAG 


TAATATTATT 


3960 


ATTGCGATTA 


TGTTGTCTGC 


ATTAACATAT 


TGGTTGTTTG 


CACAATCATT 


TATTAATATA 


4020 


GGACCTCTCG 


TTGGTCAAAC 


ATATCAAACC 


TCTCCTGCCG 


TGTTAAATTT 


ATCTATTAGT 


4080 


TTAACTTCCT 


TCGCCACAGG 


TATCTTCATG 


GTGGCTGCAG 


GTGATATTGC 


TGATAAAATA 


4140 


GGACAACTGA 


GAATGACATA 


CATGGGTCTC 


ATAATCAGTA 


TGTTTGCATC 


TCTTCTATTA 


4200 


ATTATATCGG 


ACATCACTGC 


ACTGCTCATC 


ATCGGTAGAA 


TTTTACAAGG 


TCTATCAGCA 


4260 


GCTATCTTGT 


TACCTTCAAC 


AGTTGGCGTG 


TTAAATAATC 


AATTTAAAGG 


AGAACATTTA 


4320 


AGACGAGCGA 


TTAGTTATCT 


AATGATTAGT 


ACTGTTGGTG 


GCATCGGCCT 


AGCTGGTGTT 


4360 


ATCGGCGGTT 


TAATTGCCTC 


AAATTTCGGA 


TGGCAAACGA 


ATTTCATCAT 


TAGTATAGTC 


4440 


ATTGCTTTCA 


TTGCCATATT 


GCTTCTAAAA 


GGGACACGTG 


AAAAAGTAAG 


TCAACATAGC 


4500 


CACCGTCATC 


CATTCGATTA 


CAAAGGTATG 


TCGATTTTCG 


CTGTTATGAT 


TGGTAGCTTT 


4560 


ACATTATTGT 


TAACACAAGG 


ATTCGAACAA 


GGTTGGTTTA 


GTACATTTTC 


AATCATTTGT 


4620 


CTGAGCATTT 


TTATCaTCAc 


TACGTTGATA 


TTCATCATCA 


TCGAACGTCG 


ACATGAAGTA 


4680 


CCTTTTATTG 


ATTTCTCAGT 


ATTACGCAAC 


CGTCCGTTCA 


TTGGTGCATT 


TTTAAATAAC 


4740 


TTTGTTTTAA 


ATAGCGGTCT 


AGGCGTAACA 


GTGGTCTTTT 


TCATATATGC 


TCAAACACAC 


4800 


CTTGGTTTAt 


CAgcTGCGCA 


ATCTGGACTT 


GTTACATTGC 


CATATGCCAT 


TGTGGCAGTT 


4860 


GCGATGATTC 


GTTTAGGTGA 


AAAAGCAACA 


TTACGTTTCG 


GTGGCAAATT 


GATGCTCATC 


4920 
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TCACAATATG 


TCATTGCAGT 


TATCATTGGT 


TTCGTCATAT GTGCGATAGG 


TAATGGTTTA 


5040 




GTCGCAACAC 


CTGGACTTAC 


GATTGCAATT 


TTCAGTATGC 


CTAATGAAAA 


AGTTGGTTTA 


5100 


s 


GCTACAGGAT 


TATATAAAAT 


GAGTGGTACA 


TTAGGTGGCT 


CCTTTGGTAT 


AGCACTAAGT 


5160 




ACTACAGTTT 


TCAGTATGTT 


ACAACTAAAC 


TATGCACCAA GTGTAGCTGC AACCGTAACA 


A 

5220 


10 


TTTATAGTCA 


GCATTGTATT 


W» X S2AX V_>V_> X X 


GGCTCATTGT 


CTGCATACAT 


GATCATTCCA 


5280 


AAAACAGTTA 


AATCTTAAAT 




AATTATGTTT 


CGAAATATCT 


TTATCACTTT 


5340 




AAAATGATAT 




f*A Afi & A 2k 1* AT 


AAGCGAACTG 


AATAAATAAA GATTCAATTA 


5400 


IS 


ACGCATCAGT 


**X X*VO\JLrt X X v» 


AY "IV TAAAAr 


GATTAATAGT 


TTTATAAGAA 


GGTGTTTGAT 


5460 




v< x xw\uv X4v\ 


vv^nutiLAi x 


Lunn 1AC X\9 X 


CATGAAGTAA 


TTTCTCTATT 


CTACGACCAG 


5520 






TTYSAfSTATAT 
A luMulAlnl 


fH'* 14 T fk T fc * ?* 
uUA 1A X AALiA 


TGATTTTTAA 


CATGATCTTT 


GGATGATAGG 


5580 


20 


A TYSTTYS fY2r« f 


AWViAlvaA x\a I 


.CTGAATTCAT 


CGAATTTGCT 


ATCAGGTATC 


GTTTCAACAA 


5640 




111 f\X 




AT AT CATTTT 


GAGGAATTCT 


AACAGAAGTT 


TTTATTGGTA 


5700 






{jOvUAAAGTGT 


LTlAlTllTl 


TAAAGTATTT 


CAAAGTAAAA 


TTACATGTTA 


5760 


25 


TaTTa S"V""T* ft /""T> TV 

AIALulAvflA 


X X AATvSv* (JG A 


GACTCCTGAG 


GGAGCAGTGC 


CAGTCGAAGA 


CCGAGGCTGA 


5820 






*P A i'"*/"* TV TV TV 


7i "Vk ^1 ^^^^ Wk. mm/"iK 

AAGCCATTCA 


ATACGAAGTA 


TTGTATAAAT AGAGAAGAGC * 


5880 


30 




TTTCTAATTG 


AAAATTATCT 


TACTGCTGTT. TTTTAGGGAT 


TTATGTCCCA 


5940 


vj^-L- 1 v_ X 1AL 1 


V.1AAI lAlAl 


TCACTATCAA 


TTAGACAAAA 


TGGCCATTTT 


CAAATATCAC 


6000 




V3U\J X X\* lliL 


lwatAt-t-l IvxAA 


TATATTTATT 


ATAATTCTCT TTTTGAAAAT 


CAGTTAACAT 


6060 


35 


TAATTTAniVT 


fZT AffYST ATT 


TTAACACTTT 


TTGCATTGTT 


TCTATTCTCA 


TTTTTCTAAA 


6120 






TPTTTT1 & 


CAATACGATT 


AACAGCATCA 


TATGATAATT 


CTACTGTTTC 


6180 




TTPAATTTVA 




ATGAAATAAT 


CGTGCACATT 


AAAAACGTAT 


CACCAAAGTA 


6240 


40 


ATAAACATCT 


AAATCATCAC* 

****** A V*^fc X V^lVi 


GTTTATGTTG TCCAACTAAC AAACGACGAT ATTCGAACTC 


6300 




TTTTTCriT?GA 


TATTTCAATT 


CTAAAAAACT 


AATAATCTCT 


TCTTCTTTTA 


ATTTGAATTG 


6360 




CATTTAAAAA 


CATCCTCTCT 


TAAGTTTTAA 


CAAGCCTTAA 


TTAAAAATTT 


TTTCAATCAC 


6420 


45 


ATAGTTCAAT 


ATACATCATT 


TCGTTATGTT 


TTTTAATACT 


TTGTTCAAAA 


ACAAATATTT 


6480 




TATTCTTTAA 


AATAATGACT 


TTTGTATTTT 


TAATCACAAT 


AAACATTTTA 


AAATTCTTGT 


6540 


50 


TATCATAATC 


ATTAAAAGGT 


ATTAACCTTA 


ATAATATATT 


CTCTCGTCTC 


AACCTTAATC 


6600 


GTATACTTCA 


GACGTCTGTT 


TGTAGACAAT 


AAAAGTCATT 


CACGTCTTCA 


TATGTCATCA 


6660 




AATGTTTATC 


ATGATATGAT 


GAATATAATA 


ATCGGGTATA 


TAACTGTATG 


ATTAATTACA 


6720 
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TGGTGTATCA GTTACAACTO TGTCACATAT TTTAAATCAT AATGATAGTC GTTTTTCCGC €84 0 

AACAACGATA AAAAACGTAC ATGCTGTTTC AGAACGTTTA GGCTATGCCC CTAATAAACA 6900 

TGCAAAACAA TTGCGCGGCA GTAAAATTCA AACTATTGGC GTCATTTTGC CTAGCTTAAC 6960 

AAATCCGTTT TTCTCAGCAC TGATGCAAAG TATTCATGAC CATAAACCAT CTGATGTTGA 7020 

TTTATGCTTT TTAACATCTA CAGCAACTGA TTTGTATGAC AATATTAAAC ATTTAATTGA 7080 

TCGAGGTATT GACGGATTAA TTATCGCACA ATACATATCA TCCCCGGACG CCCTAAATAA 7140 

CTATCTAAAG AAACATCATG TACCTTATGT CGTACTGGAT CAAAATGACC ATCAAGGCTA 7200 

TACAGATTTT GTTCGGACAA ATGAATATCA AGGTGGACAA CTTGCAGCAC AACATTTAAT 7260 

AGAACTCGGT CACAACCATA TGATAATTGT TGCACCATAT GACATGATGG CGAATATGTC 7320 

GACTCGTGTC GCTGGATTTG TCGATACTXT GCGCGCGAAT CAATTGCCAG AACCACAAAT 7380 

20 CGTCCATACT GAATTATCTA AGCGCGGTGG GCTAACCATT GTTGATGACA TCATGGTTCA 7440 

ATCTGCCACT GCAATCTTCG CTATTAACGA TGAACTCGCT ATTGGCATTT TACGAGGACT 7500 

AATTGAACAT GGCATCAGTA TCCCGAAAGA TATCTCATTA ATAGGTTATG ACGACATTGA 7560 

TTATGCAGCG TACGTCTCGC CACCTTTAAC TACTGTGGCA CAACCTATAA CTGATATTGG 7620 

CAAAACATCT TTAACCTTAT TACTTCAACG ATTACAG CAC TTAGATAAAT CCATTGATAT 7680 

GATTGAATTA CCAACGACTT TAAAAATTCG TGCAACAACT GGCTATCATC TTTCAAACTA 7740 

ACTACGTATC TTCCGAAATA T A CT CAT CAT TGTTAGGCCC TTAGCGTTGC TTTAATG CTG 7800 

AGGGTTTTTA ATCATAATTA TTTTACTAAG AAATTAAAAT AATAATGTAT GAATTTTTAA 7860 

ATATGATTTA AACGTTTTCA GTTTTTATGA AAACGCATGC ATTTTACAAA TAAAAATGGT 7920 

ACGATGGCAC TGGTAAAACG TTTTACTAAA AACAAATCAT GAGGTGTATA ACATGAGCAT 7980 

TGTTGCATTA CTTATCGGGT TAGGCCCCTT AATTGGCTGG GGCTTCTTCC CAACAGTCGC 8040 

TTCAAAGTTT GGTGGTAAAC CTGTACATCA AATTATCGGT GCTACTGTAG GTACGTTAAT 8100 

CTTCGCTATT TTATTAGCCG TAGTCACATC AAGTGGCTTC CCTACTGGAA CCAATTTGCT 8160 

ATTCGCCTTA TTATCAGGTG CAGGATGGGG ATTCGGACAA ATCATTACAT TTAAAGCGTT 8220 

CGAATTAGTC GGCTCATCTC GTGCCATGCC AGTCACAACA GCATTCCAAT TATTAGGCGC 8280 

ATCTTTATCG GGTGTCTTTG CATTAGGAAA TTGGCCAGGC ATTGGTCATA AAATCATTGG 8340 

ATTTACAGCT TTAGTCGTTA TTCTAATTGG AGCGCGTATG ACAGTTTGGA GTGAACGCAA 8400 

AGAAGCAAGT AACGCCAAAA ATTTACGTCG TGCAGTGGTA CTTCTGTTAA TTGGTGAATT 8460 

TGGATACTGG TTATATTCAG CTGCACCGCA AGCAACTTCT ATTGATGGCC TAACTGCCTT 8520 

SS 



35 
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AGCAGAGAAT CCATTCCGTA ATAAAATTAC GTGGTTACAA ATTATTTCAG GTTTCTTCTT 8640 

TGCATTTGOT GCTTTAACAT ATCTTATTTC AGCACAACCT AATATGAATG GTTTAGCAAC 8700 

5 

TGGATTTATT CTTTCTCAAA CATCCGTTGT GCTTGCTACA TTAACTGGTA TTTATTTCTT 8760 

AAAACAACAT AAAACGTCAA AAGAAATGGT TATTACAATC ATCGOCTTAG TACTCATTTT 8820 

AGTAGCCGCT TCTGTTACAO TATTTATAAA ATAAGGAGTG TAGATGTCAT GAAAAAATCA 8880 

10 

GCTGTTTTAA ATGAACATAT TTCAAAAGCA ATCGCGACAA TTGGTCATTT TGATTTATTA 8940 

ACGATTAATG ACGCTGGCAT GCCAATTCCA AATGATCATC GTCGTATCGA CCTAGCTGTA 9000 

1S ACTAAAAACT TACCACGCTT TATTGATGTC TTAGCTACAG TGTTAGAAGA AATGGAAATC 9060 

CAAAAAATAT ACTTAGCAGA AGAAATAAAA GAACATAACC CTACACAATT GCAACAAATT 9120 

AAACAATTGA TTTCATCGGA AATCGAAATC ATTTTCATTC CTCACGAAGA AATGAAAAGT 91B0 

20 AACTTAGCTC ACCCATTAAA TAAAGGTAAT ATTCGTACTG GTGAAaCAAC GCCCTACTCT 9240 

AATATTGCAT TAGAATCGAA TGTTACTTTT TAAAAGTTAT AACTTGAAAG GAGCGTACAC 9300 

ATGACCAACA AAGTTGTTAT TTTAGGTTCA ACGAATGTCG ATCAATTTTT AACAGTTGAA 9360 

25 

AGATATGCAC AACCAGGCGA AACATTACAT GTTGAAGAAG CACAAAAAGC ATTCGGCGGA 9420 

GGTAAAGGTG CCAACCAGGC TATTGCCACT GCACGCATGC AAGGAGACAC AACATTTATT 9480 

ACTAAAATTG GCACTGATGG CGTTGCTGAT TTCATCTTAG AAGATTTTAA AGTAGCTCAT 9540 

30 

ATTGATACAT CATATATTAT CAAAACAGCT GAAGCAAAAA CGGGCCAAGC CTTTATCACT 9600 

GTGAATGCAG AAGGACAAAA CACCATCTAT GTTTATGGTG GTGCGAATAT GACGATGACA 9660 

35 CCTGAAGATG TTATTAACGC AAAAGACGCT ATAATCAATG CAGACTTTGT CGtTGCACAA 9720 

TTAGAAGTAC CCATCCCGGC TATTATATCT GCATTTGAAA TTGCCAAGGC ACATGGTGTG 9780 

ACGACAGTAT TAAATCCTGC ACCAGCGAAA GCATTACCTA ATGAATTATT ATCATTAATC 9840 

40 GATATTATTG TGCCAAACGA AACAGAAGCC GAATTGTTAT CTGGGATTAA AGTAACTAAT 9900 

GAACAATCTA TGAAAGACAA TGCCAATTAC TTTTTATCTA TAGGCATTAA GACTGTTTTG 9960 

ATTACGCTAG GTAAGCAAGG TACATATTTT GCTACTAAAA ATCAAAGCCA ACACATCGAA 10020 

45 

GCTTATAAAG TAAATGCGAT TGATACAACT GCTGCAGGCG ACACATTTAT TGGTGCATTT 10080 

GTCAGTCGCT TAAACAAGTC GCAAGATAAC TTAGCAGATG CTATTGATTT TGGTAATAAA 10140 

GCGAGCTCAC TCACTGTACA AAAACACGGC GCGCAAGCAT CTATTCCTCT ACTAGAAGAA 10200 

50 

GTAAATCAAG TTTAAATGAA TCAAACACAG CTATGATATG AAGGTTTAGC ATATAACATG 10260 

CAACATTCGT ATATCATGGC TGTGCTTTTT TATCTTTATA AAACATCATC TATTAGAAAT 10320 
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TTTGTAATCT TTTTAACTTC CAAATTaTCG CATATAAATA TGCTATATTA ATGATAATAA 10440 

TTATCAATTA AAAGGAGGTT ATGCTATGTC TAAAGAAGCT GGTCATACAT TTTTAGCTAA 10500 

ATTAGOAAAA ACTCGTCTAC GCCCCGGTGO TAAAGAAGCA ACAGATTGGT TAATACAACA 10560 

AGGGGCATTT TCACAAGATA AACAAGTGTT AGAAGTGGCA TGTAATATGT GCACAACATC 10620 

TATTTATCTA GCTCATACAT ATGGCTGTCA CATTCAAGGC GTTGATATAA ATAAGAAAGC 10680 

ATTAGAAAAA GCACAGGAAA ACATTTCAGC AGCAGGTCTT GAATCATATA TTCAAGTTCA 10740 

ACAAGCGAAT GCTGTTAAAT TGCCCTTTGA TGACAATCAA TTCGATATCG TTTTAAATGA 10800 

AGCAATGTTA ACAATGTTAC CCATCGCCAT AAAGGAAAAA GCATTACGCG AGTACTACCG 10860 

AGTCTTAAAG CCTGGGGGTA TCTTGTTAAC ACATGATATT GTCATCGTTA ATGAATCACA 10920 

TGCCACACAT GTTGTTAAAT CATTATCTGC AGCAATTAAT GTCAATGTCT CACCGCAGAC 10980 

GAAACTTGGC TGGTTAGATT TATATAATCA AGCTGGTTTT AATCATGTGC ATTATCATAC 11040 

TGGTCCAATG AGTTTAATGA CACCAAAAGG TTTAATTTAT GACGAAGGTA TTGTTGGAAC 11100 

TATAAAGATT ATCAACAATG CTTTGAAAAA AGAAAATCGA CCAATGTTTT GTAAAATGTT 11160 

TAAAACGATG ACTAAATTGC GTAAAGATAT GAATTATATT ACTTTTGTCG CTAAAAAAGA 11220 

GCACTAAATA TAATGCCACT AACTGTACTT TGTAT CTATG TTTGACT AT C ACTTTAATTT 112 80 

CTTTGTGACA CTAATCATCT ACTTAACAAT ATCGTTATCG TTGATTAGTA AGTCATCAAT 11340 

TTTGGTTAAA GACTTTCATA AACACTCAAA CATTAACACT ATACATAGTT AGTGGcATTA 114 00 

TTTTTTyCTn AAAATTTTAA CmTCmCGGGr TtGGGAmCrG AAaTGrtAwT T cGCrmAA t T 114 60 

TAwTcT 11466 
(2) INFORMATION FOR SEQ ID NO: 445: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 

TTACATAGTT AACACTAGTT AATCTATTAG TTAACATTAG TTAATAATTA GTTAATTTCC 60 

ATTTGTATTC TCATGTGATA AATTCTAAAA GCATACaATA AATTTAATAT GTAAAAAGAA 120 

AGGGAATACA CATGAAAAAT AAATATATCT CGAAGTTGCT AGTTGGGGCA GCAACAATTA 180 

CGTTAGCTAC AATGATTTCA AATGGGGAAG CAAAAGCGAG TGAAAACACG CAACAAACTT 240 
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AAGTATTACA TCTAAAAGGT ATCACAGAAG AACAACGTAA CCAATACATC AAAACATTAC 360 

GCGAACACCC AGAACGTGCA CAAGAAGTAT TCTCTGAATC ACTTAAAGAC AGCAAGAnCC 420 

CAGACCGACG TGTTGCACAA CAAAACGCTT TTTACAATGT TCTTAAAAAT GATAACTTAA 480 

CTGAACAAGA AAAAAATAAT TACATTGCAC AAATTAAAGA AAACCCTGAT AGaAGCCAAC 540 

AAGTTTGGGT AGAATCAGTA CAATCTTCTA AAGCTAAAGA ACGTCAAAAT ATTGAAAATG 600 

CGGATAAAGC AATTAAAGAT TTCCAAGATA ACAAAGCACC ACACGATAAA TCAGCAGCAT 660 

ATGAAGCTAA CTCAAAATTA CnTAAAGATT TACGTGATAA AAACAACCGC TTTGTAGAAA 720 

AAGTTTCAAT TGAAAArGCA ATCGTTCGTC ATGATGAGOG TGTGAAATCA GCAAATGATG 780 

CAATCTCAAA ATTAAATGAA AAAGATTCAA TTGAAAACAG ACGTTTAGCA CAACGTGAAG 840 

TTAACAAAGC ACCTATGGAT GTAAAAGAGC ATTTACAGAA ACAATTAGAC GCATTAGTTG 900 

20 CTCAAAAAGA TGCTGAAAAG AAAGTGGCGC CAAAAGTTGA GGCTCCTCAA ATTCAATCAC 960 

CACAAATTGA AAAACCTAAA GTAGAATCAC CAAAAGTTGA AGTCCCTCAA ATTCAATCAC 1020 

CAAAAGTTGA GGTTCCTCAA TCTAAATTAT TAGGTTACTA CCAATCATTA AAAGATTCAT 1080 

TTAACTATGG TTACAAGTAT TTAACAGATA CTTATAAAAG CTATAAAGAa AAATATGATA 1140 

CAGCAAAGTA CTACTATAAT ACGTACTATA AATACCAAGG TGCGATTGAT CAAACAGTAT 1200 

TAACAGTACT AGGTAGTGGT TCTAAATCTT ACATCCAACC ATTGAAAGTT GATGATAAAA 1260 

ACGGCTACTT AGCTAAATCA TATGCACAAG TAAGAAACTA TGTAACTGAG TCAATCAATA 1320 

CTGGTAAAGT ATTATATACT TTCTACCAAA ACCCAACATT AGTAAAAACA GCTATTAAAG 1380 

CTCAAGAAAC TGCATCATCA ATCAAAAATA CATTAAGTAA TTTATTATCA TTCTGGAAAT 1440 

AATCAATCAA AAATATCTTC TCTAGTTTTA CATC A TTTTT TAAATAATTT TCGTAACAAA 1500 

CCGTGATTAA AAAGAACCGT TGATTCTCAA TCGAATCTAC GGTTCTTTTT TCATTTTCCA 1560 

40 TCAATTAAAT GCTTCTTCGC TATTTGTCAG CCCACTTTTT TACCTGCAAC TTGTTAAATA 1620 

ATCCTTACAT CGTTAACGAA TAGTTCATCA TTTAGTTGAA TCAGCTCAAC TTTATTAACT 1680 

TCATATTTTC ACAAACTATT GCGCAATCCA TTCCTTTTCC ACTACAAGCA CCATAATTAA 1740 

45 ACAACAATTC AATAAAATAA GACTTGCAAA GCATAGTTAT GTAGCTATAT AAACGCCTGC 1800 

GACCAATAAA TCTTTTAAAC ATAACATAAT GCAAAAACAT CATTTAACAA TGCTAAAAAT 1860 

GTCTCTTCAA TACATGTTGA TAGTAATTAA CTTTTAACGA ACAGTTAATT CGAAAACGCT 1920 

SO 

TACAAATGGA TTATTATATA TATGAACTTA AAATTAAATA GAAAGAAAGT GATTTCTATG 1980 

ATTAAAAAtA AAATATTAAC AGCAACTTTA GCAGTTGGTT TAATAGCCCC TTTAGCCAAT 2040 

55 



2S 



30 



35 
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CmTyCAAArG AcACaGACAT TACTAGCCAA CGATTTAGCT ATnACTCCAA ACCTTCCATT 2160 
GGATTTGGTA AAGGnT 2176 
5 (2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1557 base pairs 

(B) TYPE: nucleic acid 

10 iC) STRANDEDNESS : double 

<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 

15 





AAAAGCATGG 


CTTAAATGAA 


GTACGCTATA 


ACAAATTACA 


AGAACATGCT 


ATTGTTATGC 


60 




ATCCGGCACC 


TGTGAATAGA 


GGAGTAGAAA 


TACAAAGOGA 


TTTAGTAGAA 


GCTTCAAAAT 


120 


20 


CAAGAATTTT 


TAAGCAAATG 


GAAAATGGCG 


TTTACTTAAG 


AATGGCAGTC 


ATTGATGAAT 


180 




TATTAAAATA 


GGTAAGGGGA 


CGAAAATGAT 


GAAATTAATT 


AAAAACGGTA 


AAGTATTACA 


240 




AAATGGCGAA 


TTACAACAAG 


CAGATATTTT 


AATTGATGGT 


AAGGTAATTA 


AACAAATTGC 


300 


25 


ACCTGCAATT 


GAACCAAGCA 


ATGGTGTTGA 


CATCATAGAT 


GCGAAAGGTC 


ACTTTGTGTC 


360 




ACCTGGATTT 


GTCGATGTTC 


ATGTTCATTT 


ACGTGAACCT 


GGTGGTGAAT 


ATAAAGAGAC 


420 




AATTGAAACT 


GGTACTAAAG 


CTGCTGCTAG 


AGGCGGATTT 


ACAACTGTAT 


GTCCAATGCC 


480 


30 


TAACACAAGA 


CCGGTACCAG 


ATTCTGTAGA 


ACATTTTGAA 


GCTTTACAAA 


AATTAATCGA 


54 0 




TGACAATGCT 


CAAGTACGTG 


TATTACCTTA 


TGCTTCAATT 


ACAACACGTC 


AATTAGGTAA 


600 


35 


AGAATTGGTT 


GATTTCCCAG 


CACTAGTAAA 


AGAAGGTGCC 


TTTGCGTTTA 


CAGATGACGG 


660 


TGTAGGAGTA 


CAAACTGCAA 


GCATGATGTA 


TGAAGGCATG 


ATTGAAGCTG 


CAAAAGTAAA 


720 




CAAAGCCATC 


GTAGCACACT 


GTGAAGATAA 


TTCATTAATC 


TATGGTGGTG 


CAATGCATGA 


780 


40 


AGGGAAACGC 


AGTAAAGAGT 


TAGGTATACC 


AGGTATTCCA 


AACATTTGTG 


AATCTGTTCA 


840 


AATCGCAAGA 


GATGTACTAT 


TaGCTGAAGC 


AGCAGGTTGT 


CATTATCATG 


TATGTCATGT 


900 




TTCTACTAAA 


GAAAGTGTTA 


GAGTCATTCG 


TGACGCTAAA 


CGCGCAGGCA 


TTCATGTTAC 


960 


45 


AGCTGAAGTT 


ACACCACACC 


ATTTATTGTT 


AACAGAAGAT 


GATATTCCTG 


GTAATAATGC 


1020 




CATTTATAAA 


ATGAATCCAC 


CATTGAGAAG 


TACTGAAGAT 


AGAGAGGCTT 


TGTTAGAAGG 


1080 




GTTACTAGAC 


GGTACAATTG 


ACTGTATCGC 


AACAGACCAT 


GCACCACATG 


CACGTGATGA 


1140 


SO 


AAAAGCACAA 


CCAATGGAAA 


AAGCaCCATT 


CGGAATTGTT 


GGTAGTGAAA 


CAGCATTCCC 


1200 




ATTATTATAT 


ACGCATTTTG 


TAAAAAATGG 


TGATTGGACA 


TTACAACAAT 


TAGTAGATTA 


1260 
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TTATGCAGAT TTAACAATCA TTGATTTAGA TAGTGAACAA GAAATTAAAG GAGAAGATTT 1380 

CTTATCAAAA GCAGATAATA CACCATTTAT CGGCTATAAA GTTTATGGAA ATCCGATCTT 1440 

AACAATGGTT GAAGQCGAAG TTAAATTTGa GGGGGATAAa TArTATGCAA gCAAACGTTA 1500 

TCTAGTGTTA GAAGACGGTC TTTTACGAOG CTACCGTTAG GTCTGATAAC TTACTGA 1557 



(2) INFORMATION FOR SEQ ID NO: 447: 

10 

(I) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

IS 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 



20 


GCTAGAAATm 


TTGmATGaCA 


ATACAACTCT 


GTTAAAaTGA 


TGGACGTAGA 


CAAATATGCG 


60 




TATTGACGCT 


TTATTTTAAA 


AATTaACATG 


CTTATAACAT 


GTTTATAGAA 


GGAGATTAAC 


120 




CTATGAACTA 


TCaAGTTCTT 


TTATATTATA 


AATATATGAC 


GATTGATGAC 


CtGAACAGTT 


180 


25 


TGCTCAGGAT 


CACTTAGCCT 


TTTGTAAAGC 


ACACCATTTA 


AAAGGTAGAA 


TTCTTGTTTC 


240 




TACAGAAGGT 


ATTAACGGCA 


CATTATCTGG 


TACAAAAGAA 


GAAACCGAAC 


AATATATGGC 


300 




ACATATGCAT 


GCCGATGAAC 


GATTCAAAGA 


TATGGTGTTT 


AAAATTGATG 


AAGCTGAAGG 


360 


30 


ACATGCTTTT 


AAGAAAATGC 


ATGTACGTCC 


TCGAAAAGAA 


ATCGTTGCTT 


TAGATTTAGA 


420 




AGATGACGTC 


GATCCAAGAC 


ACACAACTGG 


CCAATATTTA 


TCACCTGTAG 


AATTTAGAAA 


480 




AGCTCTTGAA 


GATGATGACA 


CAGTCATTAT 


TGATGCACGT 


AATGATTATG 


AATTTGATTT 


540 


35 


AGGTCATTTC 


CGAGGTGCAA 


TTCGTCCAAA 


TATCACACGT 


TTTAGAGATT 


TGCCTGACTG 


600 




GATTAAAGAG 


AATAAAG CGT 


TATTTG CAGA 


TAAAAAAGTG 


GTTACGTACT 


GTACTGGTGG 


660 


40 


CATTCGATGC 


GAAAAATTTT 


CTGGATGGCT 


TTTAAAAGAA 


GGTTTCGAAG 


ATGTAGCTCA 


720 


ACTTCATGGC 


GGTATTGCTA 


CATATGGTAA 


AGATCCTGAA 


ACAAAAGGTG 


AATATTGGGA 


780 




CGGTAAAATG 


TACGTATTTG 


ATGACOGTAT 


CAGTGTTGAT 


ATCAACCAAG 


TTGAAAAnAC 


840 


4S 


AATTATTGGT 


AAGGATTGGT 


TTGATGGCAA 


ACCATGTGAA 


CGTTATATTA 


ATTGCGCTAA 


900 




CCCAGAATGT 


AATAAACAAA 


TATTAGTTTC 


TGAAGAAAAC 


GAAACTAAAT 


ATTTAGGTGC 


960 




ATGCTCTTAT 


GAATGTG CTA 


AACATGAGCG 


TAATCGTTAT 


GTTCAAGCAA 


ATAATATTAG 


1020 


50 


TGATAATGAG 


TGGCAACAAC 


GTTTAACAAA 


CTTTGATGAT 


TTACATCAAC 


ATGCTTAGTT 


1080 




TTAATTAAAT 


ACCTTTCAAA 


ACACGCTTTG 


AAAATCCGAT 


TTATAAAGGT 


TTTTCAAGGC 


1140 
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5 



10 



1S 



TAAATTTTAA 


TACTGCGGGG 


TGTCTTAAAA TGCACATTTT 


AGTAACAGGG 


TTTGCGCCTT 


1260 


TTGACAATCA 


AAATATCAAT 


CCCTCATGGG AAGCTGTGAC 


TCAACTAGAA 


GATATTATTG 


1320 


GCACACATAC 


AATCQATAAA 


TTAAAACTAC CAACCTCTTT 


TAAGAAAGTA 


GATAATATTA 


1380 


TAAATAAAAC 


GTTGGCATCT 


AATCATTATG ATGTTGTACT 


AGCTATAGGA 


CAAGCTGGTG 


1440 


GTAGAAATGC 


CATTACCCCA 


GAACGTGTCG CCATTAATAT 


TGATGATGCA 


CGTATTCCAG 


1500 


ATAATGATGA 


TTTTCAACCT 


ATTGATCAAG CCATTCACTT 


AGACGGTGCG 


CCAGCTTATT 


1560 


TTTCAAATTT 


ACCaGTTAAA 


GCAATGACTC AAAGTATTAT 


TAATCAAGGA 


CTTCCTGGAG 


1620 


CACTTTCAAA 


TAGCGCAGGT 


ACATTTGTTT GTAATCACAC 


ACTTTATcAC 


TTAGGTTATT 


1680 


TACAAGATAA 


GCATTACCCT 


CACCTACGAT TCGGATTTAT 


TCaTGTGCCA 


TACATACCAG 


1740 


AGCAGGTcAT 


TGGTAAACCC 


GATACACCAT CTCATGnCCA TTGAGGAAAA 


GATnAGTTG 


1799 



20 (2) INFORMATION FOR SEQ ID NO: 44 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448: 



30 



35 



40 



45 



ACTTGGTTTT 


TTATTGTTTA 


TAAATAAAAC 


TCACTTAATA 


ATGTTTTCAT 


AATCTTCTTC 


60 


GACTACTTAA 


TTCTTTAAGA 


TATTCGTGAA 


AAGAGACATT 


ACACTAGTTA 


ATTTTCAAAC 


120 


AATACAAAAA 


GCGTCTACCT 


C CT ACAT AT A 


ATTGTAGCGG 


AGATAGACGC 


TTAATATTTA 


180 


TTTAAAAATT 


ATTTTAAACC 


ACCGAATGTC 


ATAACATCAC 


GGGCAATCAT 


ACTTTCTTCA 


240 


TCTGTTGGAA 


TAACGACAAC 


TTTAACTGGT 


GAATGAGGAT 


AGTTAATAAA 


TCCTTCTTTA 


300 


CCACGTAGTA 


AGTTTTCATT 


TTTCTTAGGA 


TCCCAGTAAA 


CACCCATAAA 


TTCTAAGCCT 


360 


TCAAGAACTT 


TCGCACGAAT 


TTCTACTGAG 


TTTTCACCGA 


TACCTGCTGT 


AAATACGATA 


420 


ACATCAACAC 


CATGCATTCT 


CGCAG CATAT 


GATCCAATAT 


ATTTGTGAAT 


TTTAGAAGCA 


480 


AATACATCTA 


AAGCCATTTG 


TGAACGTGCT 


TTACCTGATT 


CAGCTTCTTC 


TGATAAGTCA 


540 


CGTAAATCAC 


TAGATGTACC 


TGATAATCCT 


AATAAACCTG 


ATTCTTTGTT 


TAAGATTTCC 


600 


AATACTTGTT 


CAGCAGTTTT 


ACCTGTTTTT 


TCCATAATAA 


ATGGAATTAA 


AGCAGGGTCA 


660 


ATATTACCAG 


AACGAGTACC 


CATTGTTACA 


CCAGCAAGTG 


GTGTGAAGCC 


aTTGATGTAT 


720 


CAATAGATTT 


ACCGCCATCG 


ATAGCTGCAA 


TTGATGCTCC 


ATTACCAATG 


TGACATGAAA 


780 
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TATGGCTTGT ACCATGGAAA CCATACTTAC GAATGCCATA ATCTTTATAA TAATGATATO 900 

GCAAGCTATA TAGATATGCT TTTTCAGGCA TTGTTTGATG GAATGCTGTA TCAAAAATTG 960 

5 CCACATGAGG GATATTTGGT AATAATTTAC GGAAAGCACG AATACCCAtC AAGTTaGCTG 1020 

GGTTGTGaAG CGGT6CTAAT TcGCTTAATT CTTCAATTTC CTTTTCAACC TCATCAGTAA 10B0 

TAGCTACTGA TTCAGGGAAT TTTTCACCAC CATGTACAAC ACGGTGACCT OTTCCATCGA 114 0 

10 

TATCGTTAAT ATCATTAATA ATATTGTGCG CTTTAAAAGC ATCCAACATG ATATCAACTG 1200 

CCTCAACGTG ATCCTTGATA TCTTGTACTG TTTTAACTTT TTCCCCGTTG ACTTCAATTG 1260 

TAAAAATTGA ATCCTTCAAT CCGATTCTTT CTACTAAACC TTTT G TTACT AATTCCTCTT 1320 

1$ 

CAGGCATTCT AATTAATTGA A 1341 
(2) INFORMATION FOR SEQ ID NO: 449: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1529 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 9: 





TTTTGAAGAT 


ACTACCGATG 


AAAATAGACA 


AAAGATTTTT 


CAATATTTAT 


CACCTGAAgA 


60 


30 


AGTTGCAAAT 


TTCTTTGATC 


AATTAGATAT 


TGATGACGAT 


GAATATGAGT 


TGCTATTTGA 


120 




TAAGATGAAT 


GCGACATACG 


CAAGTCACAT 


ATTAGAAGAA 


ATGTCATACG 


ACAATGCAGT 


180 


35 


AGATATTTTA 


AATGAGTTGA 


CTAAACCAAA 


AGTTGCTAGT 


CTTTTAACAT 


TGATGAATAA 


240 


AGATGACGCG 


AATGAAATCA 


AAGCATTACT 


TCACTATGAT 


GAGGATACGG 


CCGGCGGTAT 


300 




TATGACGACG 


GAgTATTTAT 


CACTTAAAGC 


GCATACGCCT 


GTTAAAGAAG 


CATTATTATT 


360 


40 


GGTCAAAGCG 


CAaGCACCAG 


ACGCAGAAAC 


AATATATGTT 


ATATTTGTCG 


TTGATGATGA 


420 


TGGTAAATTA 


GTAGGTGTTT 


TATCGCTAAG 


AGATTTAATT 


GTAGCTGAAA 


ATGATGCTTA 


480 




TATTGAAGAT 


ATTATGAATG 


AACGTGTCAT 


TAGTGTGAAT 


GTAGCAGACG 


ACCAAGAAGA 


540 


45 


TGTTGCTCAA 


GTTATGAGAG 


ACTATGATTT 


CATGGCTGTA 


CCTGTTATAG 


ATTACCAAGA 


€00 




ACATTTGCTT 


GGTATCATCA 


CGATTGATGA 


TATTTTAGAC 


GTTATGGATG 


AAGAGGCTAG 


660 




TGAAGACTAC 


TCTCGTTTAG 


CCGGGGTATC 


AGATATCGAT 


TCGACTAATG 


ATTCAATCAT 


720 


SO 


TAAAACAGCA 


TTAAAACGTT 


TACCATGGTT 


GATTATTTTA 


ACATTTTTAG 


GAATGATTAC 


780 




TGCGACAATT 


TTAGGGAGAT 


TCOAAAAAAC 


ATTAGAAAAT 


GTAGCGCTAC 


TCGCAGCGTT 


640 
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10 



15 



TCGTAACATT 


ACGAcAGGGG 


AAATTAATGA 


GCAAAGTAAA 


TTTAGAATTG 


CATTAAGAGA 


960 


AGCAGGAAGT 


GGTGTATTAT 


CGGGTGTTGT 


ATGTTCAACA 


ATATTATTTA 


CAATTATTGT 


1020 


TGCAATATAT 


CATCAGCCAC 


TTTTAGCATT 


AATCGTTGCA 


GGAAGTTTAA 


CTTGTGCGAT 


10B0 


GACGGTGGGG 


ACGTTTGTAG 


GTTCGATGAT 


TCCATTATTG 


ATGAATAAAT 


TAAATATCGA 


1140 


TCCAGCAGTG 


GCTAGTGGAC 


CATTTATTAC 


AACAATTAAT 


GATATTATTA 


GTATGTTGAT 


1200 


TTATTTTGOT 


TTAGCTACAT 


CATTTATGGC 


TTACTTAATT 


TAAGGAGGAG 


TTATGGAGTT 


1260 


TTTATCTTTA 


GTTATTGTTG 


TTTTAGCAGC 


GTTTTTAACT 


CCAATAATTG 


TCAATCGATT 


1320 


AAATATTAAT 


TTCTTGCCAG 


TTGTTGTTGC 


AGAAATTTTG 


ATGGGGATTG 


TGATTGGAAA 


1380 


TTCATTTCTA 


AATATAGTAG 


AAAGGGATTC 


AATTCTAAAT 


ATTTTATCAA 


OGTTAGGCTT 


1440 


TATCTTTTTA 


ATGTTTTTAA 


GTGGTTTAGA 


AATTGATTTT 


AAAGCTTTTA 


AAAAAGATAA 


1500 


ACGCGCACGT 


CAAGGACAAA 


ATGATGATG 








1529 



(2) INFORMATION FOR SEQ ID NO: 450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 



35 



40 



45 



TTCTGGAAAC 


CAAAGTATTG 


TCATCTTCTA 


CTAGTAGTAT 


nGGCATCCAT 


ATCACCCAAT 


60 


ATCATTTAGT 


ATATTTCATA 


TTTTCTCCTG 


ATTTTAATCG 


ACTTTGAAAA 


TCTTTAATCC 


120 


GGCAGTCAAC 


TTCAAAGCCA 


TGAATCATCA 


•rrrriTGCAT 


TGGTGCTTGT 


ATAAAGTAAT 


180 


AAATCGGCCA 


AAATAATCGA 


GGGATATAAT 


CGTATAGATG 


TATATAAACG 


ACTGCCGACT 


240 


CTTTGATTAA 


TCTAAATTCT 


AACTTCCCTT 


GATTAACTGT 


ATATTTTri'C 


ACTAAACTTC 


300 


CACTCAATAA 


AATTAAAGTT 


ATTATTCCAT 


CAGCTGTTTC 


TTCTATTTTA 


AATACTGCGA 


360 


GCGGTCGCAC 


CTTATTCTTA 


ATATATATCT 


TAAATTGATC 


ATGTGATTTT 


TCTGTTTTCA 


420 


CAAAAGTTCC 


TTTAGTGTAA 


CCCATCCATG 


CAATAAAATG 


GTTTACAACG 


TTCTTTAATG 


480 


TCCATCCCTT 


TGGTAAAACT 


ACCTTCATCG 


TTGATCTAAC 


ATCATCATAC 


TTTGAAACTT 


540 


GTAATTCTAC 


ATTAACTAAA 


GAACGTTTAA 


AAACTAAATT 


TGTTGTTTCT 


ACAGGTGTAC 


600 


CATATGCACC 


TAGGCGTTCA 


ATTGTTTCAT 


TATCATAACG 


ACTCCCAGGT 


ATGTAGATTA 


660 


CTTTTTTTAC 


TTGATTGATA 


GCCGCTGCTC 


GACCAAAATT 


ATCTGCTGCG 


ATTAATGTTA 


720 
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CAGCAATATC 


TATTTGATTC 


ATTGCTGCAA 


CAACCTGTTC 


GTAATGAAAT 


ATATCACACT 


840 




GAATCCAAGT 


CATTTCAACA 


TCATCTGTTT 


kTTTATTGTC 


TGGATATTTT 


GATATAGCAA 


900 


5 


AAAGTTCAGC 


ATCATTTTCA 


ATCACTTCAC 


TTAAATACTT ACCAATATAT 


CCTGTTCCAC 


960 




CTGCTAATAA 


AACTTTAGGT 


TTCATCTAAA ATACTCCTTT AAACTGTAAC CAAAAAAGAT 


1020 




ATTACTCCAC 


CTTTTAGTTA 


CATATATATT 


ATAATAGTAG 


CAAATGTTTT 


AAAATTTCAA 


1080 


10 


AATACTGGAG 


GcrrriTATG 


GCCCATATTA 


TACGTAGAGT 


TAGTATCAAA 


GATGTAGAAA 


1140 




ATTTCATTTC 


AATGTTAGCG 


AACATATACG 


ACGAATCTCC 


GTATATGTTC 


TACACACCAG 


1200 


1S 


GAGAATATGA 


TCCTAGCGTC 


ACATCGGCTA 


GTAAACAATT 


AGAAGAATAT 


ATCACTTCTC 


1260 


CGCATAAAGT 


CATCTTCGTT 


GCTGAAAGTG 


ATGAACAACT 


CGTTGGCTTT 


GCCTTTGTTA 


1320 




ATACGACACC 


ATTTCAACGC 


7\T""T , T\ II A /"'TV TV*' 


TTGCTAAAAT 


TGATTTAGGT 


GTAAAGAAAT 


1380 


20 


TATATCAACA 


TCGTGGAATT. 


GGCCAAGCAC 


TTCTTGATGC 


CATTATGGCT 


TGGTGTTTAA 






ACAATCAAAT 


ACACCGAATT 


GAAGCAAATG 


TACCACTCAA 


TAACCAACCT 


GCCCTCGAGC 






TTTTTAAAAG 


TGCCGATTTT 


CAAATCGAAG 


GCGTTTTAAA 


AGATAAGTTA 


TTTATCGATG 


1560 


25 


GTAAATATTA 


TGATGACTAT 


ATGATGGCTA 


AAATTCTTAA 


TTAAAGCTAT 


TTTATCATAA 


1620 




TCTTGTATCA 


GAATCGTATA 


ACAACGAATT 


TAATGGTTAC 


CTAATACATT 


ACTCATACTT 


1680 




ATCAATGTTA 


TCTAATCTCA 


AATAAATACG 


TACACTCTTA 


TTCATTTATC 


AAATTTAAAT 


1740 


30 


TCAAAATAnA 
TATATATGTG 


ACACCACTAA 
GTTAGTTTTC 


TGTGTAATTG 
CTnACTA 


ATTAACTATC 


AACTACGATT 


AGTGGTGCTT 


1800 
1827 



(2) INFORMATION FOR SEQ ID NO: 451: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 



ATATAGATTA ATGTTTGTTA TTTGTACTGT GTTTGCCAGT GACATTTATT ATAAAACATA 60 

TGACGTTATT CTATG kTGTA CTCATTtTCT ATATTGTAGG TTTATTAACG ATTAGAAGTA 120 

TTATTAAAAA GTTGAAATAT CAGGAAACAT TATTACGAGA CTAAAAAACT TCCATTGGCA 180 

TGTATGTTGT AAAGGTGCAT GTAATGTTGA ACGCCAAATG ATACGGCGTT CAGATTACAT 24 0 

TAGCATCTAT ACGTTAACAG CATAACCAAT GGAAGTTTTC TTCGAATCTA TTCTTTTATT 300 
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AATGCGTCTT TTGAAAAATG GTCATTAAAQ OCATCAGATT GCTTAAAOTC TTCGTATOCA 



420 



TGTCGATCAG CAAATCCGAA ATAAATTTTG TATGTTGTAC CTTTAGCAGG TCTTAACAAA 



4B0 



CGATAGCTTT TAAAGCCACC AAAGTTTCTG AAATTATCGT CTACACTAAT CAGTTTCTTT 



540 



TCAAGTTGAT ATGCATQATC TTCTGTTGAT GGaATGAAgA TTGCACaATA GAAATGaTGT 



600 



TCACtGAATT CACCAA 



616 



10 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 944 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 

GCACOAGTGA TTAAACGGTT AATCAATGAA ACATTTGATG CAAATTACAT TGAAGTTATT 60 

GAGGGAGGAA TTGAAGAAAC GCAAACGTTA ATTCACTTAC CTTTTGACTA TGTCTTCTTT 120 

ACAGGAAGTG nAAATGTAGG CAAAATCGTT TATCAAGCTG CCAGCGAAAA TTTAGTTCCT 180 

GTGACATTAG AAATGGGCgG TAAAt CTCCa GTCaTCGtTG ATGAAACAGC GAATATTAAA 240 

gTTGCTAGTG AGCGCATTTG TTTTGGGAAA TTCACTAATG CCgGCCaAAC ATGTGTTGC a 300 

CCAGATTACA TTTTAGTACA CGAATCTGTA AAAGATGATT TAATCACAGC CCTATCAAAA 360 

ACGTTGCGTG AATTTTATGG TCAAAATATA CAACAAAGTC CAGATTATGG CCGCATTGTA 4 20 

AACCTTAAAC ATTATCATCG TCTGACTTCA TTACTTAACA GTGCACAAAT GAATATTGTA 4 80 

TTTGGTGGTC ATAGTGATGA GGATGAACGT TATATAGAAC CAACATTGTT AGATCACGTT 540 

ACAAGTGATT CAGCAATTAT GCAAGAAGAA ATTTTTGGTC CTATCTTACC GATTTTAACG 6 00 

TATCAGTCAT TGGATGAAGC AATAGCCTTT ATTCACCAAA GACCAAAACC TTTGAGTTTA 660 

TATTTATTTA GCGAAGATGA AAATGCTACA CAACGTGTAA TAAACGAGCT ATCATTTGGC 720 

GGCGGCGCTA TTAATGATAC ATTGATGCAC CTAGCGAATC CTAAATTACC ATTTGGTGGT 780 

GTTGGTGCCT CAGGTATGGG ACGCTATCAT GGTAAATATT CATTCGACAC TTTTACACAT 84 0 

GAAAAAAGCT ACATTTTCAA ATCnACACGA TTAGAATCAG GTGTCCATTT ACCACCATAT 900 

AAAGGTAAAT TTAAATrCAT CAAAGCTTtC tTTAAAnATT AATT 944 

(2) INFORMATION FOR SEQ ID NO: 453: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4820 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 



10 



25 



30 



35 



40 



45 



SO 



TCTCCAGTAG 


ACCTTGTGTA 


TGAACAGTTT 


CTTTCATATG 


AATGAACATC 


GTTTTTAAAG 


60 


tttcstttcaa 


CTCAGCCTTA 


TCAGGATAAT 


ATCTAGAGAC 


AGTCGTCTCT 


GGCATCTCCA 


120 


TTGTATGATA 


TTTAACCTTA 


TGCAGCGACC 


CATGATTTTC 


GTTATAAACA 


AATGTATGAT 


180 


TCACTTCATC 


GAAATCATGA 


TCTTCTCCTG 


CAATCCAAAA 


AACTGGTACT 


ACTTGTTGCT 


240 


TATGTGTATC 


CGTTAATTCC 


TTAGATAAAG 


TAATGATTGa 


AAATATTTTA 


TGGAATGTAT 


300 


ACAATGGTCC 


CCOGAAAAGC 


CCTGCTTGTT 


GTCCACCAAT 


CACAACTTTT 


GAACCATTAG 


360 


CTAAATGTTG 


TATGTTTAAT 


TCTTGTTCAC 


TTGAAAGCTT 


GAAAGCTTTA 


AATCACTCAT 


420 


ATATTCACGA 


ATAACATTCG 


CTAATGCCGC 


TTCTCTTCCA 


TTATTTTCTT 


TAGACATCCT 


480 


TTTTTCAAAA 


CTAGTTTGTT 


GAGCTGCATC 


ATATTGAAAT 


AATCCTGTTA 


TTACAGGGTC 


540 


ACTGCTCTTT 


ATTTTTGGTA 


TAAACTGATC 


TTTTTCATTT 


AAACTAACTA 


CTTTACAGTC 


600 


GATurrrrrr 


CTCCTTAAGT 


ACGCGATTAC 


AAACTATAGT 


ATAAAGTCTA 


TACCGGTGAT 


660 


TGACAATTTT 


ACGGCTTGAA 


AATCAATTTA 


ATCATGGAAA 


ATTTATAATA 


TTCATTGTTT 


72 0 


TACATTTTCA 


AATCAATGAA 


AAACACAAGT 


GGTTTAATGT 


ATAATAATAG 


TAGTAAACAA 


780 


ATAAGGGGTA 


GATAAATATG 


AGTGAAATCA 


AACGTCTTGA 


AATTAATTAC 


AAAACTGACG 


840 


AATTATTCGA 


AAACTTTAGA 


GCGTTTGGCA 


ACAAAGACTT 


ATACATGGTC 


AATGAGTTAA 


900 


ACGGTCAAAT 


GATTGATGCA 


AGTTCAGATT 


CACCATTTTA 


TGGCATATTT 


GTCGaGATCA 


960 


ATTAGGAGCT 


AGAATGGCAT 


TACTAAAAAA 


AGGTGATGTC 


GAAGAAATCT 


ACTTCCCAGA 


1020 


TTTTGAAGAT 


TATATATTAT 


TATGGAAGTT 


AGAAGTATTA 


CCAAAATATC 


AAAACAGAGG 


1080 


GTACGCTTCA 


GAATTGATTG 


ATTTTGCAAA 


GAGTTTCAAT 


ATGCCAATTA 


AAGCCATTGG 


1140 


CAGAAATGAT 


TCTAAGGATT 


TCTTTTTACA 


TCATGGATTT 


ACAGATGTGG 


AAGCTAAAAA 


1200 


TATAGAGGGA 


CATGATGTCT 


TATTGTGGAA 


ACCATAAGAT 


AATAATATTC 


GACACTACGA 


1260 


GCATGAAAAT 


GCATCTTTTC 


GTAGTGTCTT 


TTTTACAATT 


ACTTTCTTAA 


GCTAATATAA 


1320 


GTaAATCATT 


TTCAAATTAT 


TTGTCTTAAC 


GTACAATATC 


ATTTAGTTGT 


TTCCATGrAT 


1380 


TAATTTCATA 


ATCAGGTATA 


ATTCCTGGAT 


TATGATCAAA 


TCCTCTAAAa 


TTAAACCAGC 


1440 


AAGTAGCTAT 


ACCcGCATTG 


ATTCCACCTA 


GAATGTCAGA 


TGTTAnAGAA 


TCTCCaACTA 


1500 


TAATCGAGTG 


CTGtCTTTCA 


TCCTCACCAA 


TATCATTAAA 


AACATAATTA 


AAAAATTCCG 


1560 
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ACGGCGTCTG ATTTAACCTT CTCTTTTGCG 
AATCATGTCT TTTCGATAAT TCGACAATTG 
5 TAGCTTCTGC TAATCCATTT CTAAATAACA 

GATGATGTTT GAAGTAATTC ACAAATCGTT 
TTTGkTGAAA AGCTTCCCAA TGTTGGTGaT 

w 

TTGTTGCTTT ATGATTAAAA ACATTCGCCA 
AAAAATCAAC AATTGTATCA TCAAAGTCTA 
TCTCCTATAT TGTCTATGTA TCTAAATCTT 

IS 

AAACTGAGTG CATAACATTA AAGTATGCTC 
TATATTAGAA TCCAAATAAT TTACCTAGTA 
TACTTGTGCC TAATTT CACA CTATCATGTT 

20 

CTTCTGCTAG TCCAGTCATT TAAATCTCTC 
CATAACATAT TGAAATTAGA ATCCGAATAA 

25 GATGCTCACA ATGCTTGTGC CTAATTTTGC 

ATCTTTAATT GCGTTAAATA AACCTTCCAT 
GATAACAAAA ACGTGCGTAg yTTTTAAATC 

30 GTTGTTATTT TAAAATAAAT TTGATGCAAT 

TGTACTTAAC TGTATTTTAA AGAGAAAAGA 
ACACAATTAA GTTGCCTGAA TTCGTATTTA 

35 TCTATAATAG TGAAAAATAT AAACATTTTT 

TTTAGATATA ATAATTCTTG TGTTGTTAAA 
ATGGCCTTCT AAGCCATCGG TCGGGGGTTC 

40 

TTATACGCAT TACCAAACAA AAGAGTTCCG 
CAGTAATATA GTATGATGCG TCACCAAAAC 
ATTAAAACAA GCATGCTTAT TCGTTATAGA 

45 

ACACTTGTTT ATACATTTTT AATTAAGTAA 
CTGAACTACA CCAACACTAC ACCAAGATTT 
TTTTATTATT ATATTATTCC TATAGATTTT 

50 

GGGGCCCGAA ATCCTATAAT TATAATTATA 



TTTOGGTTAC ACCATTAGTA ACAATATATA 1680 

TTTCTAATGT TTGATCAAAG TATTTAACTT 1740 

CATCTGCACG ATGCCCATCA ACTTCCATTT 1800 

CTGATAATAC TTCAGACTTC GkTAATTTAT I860 

TGaTTTTTTT AAATGkTAAA AAATCATCCy 1920 

TATAGTGAAn CGCCCATTCT TCTGcATCAT 1980 

TCAAAATATT TTTATATCCC AATTTCCCCA 2040 

AAGAGAGGCT CAAATTTCTG CAAATAAAAT 2100 

ACCCAGTTTA TTTTAAAGAA TATTAGTTAT 2160 

AACCCACACC GTTAGCAACG ATGTCTACGA 2220 

GTTGTGCAGC TTGCACAGTA TTTGCGATTG 2280 

CCTCACCTTT GAAATAATAC TGATTACTTA 2340 

TTTACCTAAT AAACCTACGC CATTTTCAAC 2400 

GCCATCATTA TTAATTGCTG CAGTTACGGT 24 60 

TGAAAACACT CCTTAAAATT TAAATTTGAA 2520 

ACCGAAATGT TATTCGCTTA AC G TTTTGTT 2580 

TAGTTTGTTT ATCCGCACAA CATCTTATAA 2640 

AATACAGTTA GGCATTCAAA ACTGTATTTA 2700 

AGTCTTATTG AACCTTTTTA GATAAATAGC 2760 

TATTTACAAG GTATTGCTAA TTTAAGTTCA 2820 

CGTGTCCTGG TAGCTCAGCT GGATAGAGCA 2880 

GAATCCCTCC CAGGACGTTT ATAGGTATTT 2940 

TGATTACGGG GCTCTTTTTG TTTTGAATTT 3000 

GTCCCCCGCA TAAGCCCCGA AAATACAGTA 3060 

ATTTTTTGAC ACACAATTGA CACGCGTCTG 3120 

TTTTGTG CTC AAATTTCATC TATACTGCAC 3180 

TTAACACTCA CCATTTGCAT GCGTAGAGAT 3240 

GATACTATTC AAAATTTTAG GGACTTTTCA 3300 

TACATCTAAA AAAAATAACC ACGTCCATCG 3360 
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ATACTATTGG CAAATTTATA AAGTAGTTCA GCGTTTTTCA ATGACATATT GTCTAATGAT 34 BO 

CTTTCATTTT TTCTCATTCT GTGTATTGTG CTTTGTGGAA CTCCTGTTTG TTTCGATATA 3540 

5 

TGTAAACTGC TCAAATCACT GTCTAATAGT TTTTGAATTT GATTTCTCAT TTTGTACACC 3600 

CCCTTGTAAA TCTTCAATCA TCATTTAAAT TAAATAATTA CTTTTCCACA TATTCCAATA 3660 

TTTAGGTTGC AAAGCATACC TCAAATATCA TTAATTTTGA GATTTAAATG TCAATTATGT 3720 

70 

GTCTATCAAT CCAATATACA TACTCTAATA ACGTAATAGT ACACACTCTT CTTATTAATT 3780 

GGCCATAGCT ATCATGATAT AATTAGTGAA GAAAATCACA TAAGAAAGGT TGTAAATCAT 3840 

GAGACTTCAA AAAGCACCTC TAGTAACGTC AGGACTAGTC TTAGGATTAT TAGGCCTGGG 3900 

IS 

TAATCTATTA AAAGACTTAT CTCTTACTTT AAACGCTGTT TGCGGAATCT TTGCTTTCTT 3960 

GATTTGGATT CACCTTTTAT GTACTATGAT CAAATATTTT AATAATGTGA AAGAACAATT 4020 

2Q AAACAGTCCT CTAGTTTCAT CAGTGTTCAC AACATTTTTC ATGTCTGGCT TTTTAGGTAC 4080 

TACTTATTTA AATACATTTT TTAGTAACAT AACTTTTATC AATAGCTTAA TAACGCCTAT 4140 

TTGGATTTTA TGCCTTGTGG GAATTATGAC GCATATGATT ATTTTTTCAA TAAAATATTT 4200 

25 AAAAGATTTT TCACTTGAAA ATGTTTATCC TTCGTGGACT GTACTTTTTA TTGGTATTGn 4260 

TATCG CAGGA TTGACGGCAC CCGTTAGCGG ATA TT TTT T C ATAGGTCAAT TAACAGTAAT 4320 

ATATGGCTTT GTAGCTACTT GTATTGTCTT ACCTATAGTT TTCAAGCGAT TAAAAGCATT 43 80 

30 TCCATTGCAG ACGTCAATCA AACCGAACAC ATCGACAATT TGTGCACCAT TTTCTTTAGy 4440 

CGCTGCAGCA TATGTTATAG CTTTTCCTAA GGCGAATGCT TTTATCGTAA TTATATTTTT 4500 

ACTATTAGCT CAAATATTTT ATTTTTATAT CATTATACAA TTGCCTAAAT TACTAAAAGA 4560 

« " ^ _ _ 

ACCTTTTTCG CCCGTATTTT CAGCTTTCAC ATTCCCTTTA GTAATCTCAG CAACTGCTTT 4620 

AAAGAACAGT TTGCCTGTAC TTATGTkTCC AGACATTtGG AAAGGkCTTT TGTTTATCGA 4680 

AGTGTTATTA GCCACTGTAA TAGTACTTAG AGTCTTTATA GGATATCTTC ACTTCTTTTT 4740 

40 

AAAAAAGGAA AAACAAGATA AATTTCTnCG TAATGCGTCT CAGTAACACT ATTACCAAGA 4800 

ATTAACACGT ATATTTAATA 4 820 
(2) INFORMATION FOR SEQ ID NO: 454: 

4$ 

(i) SEQUENCE CHARACTERISTICS: 

(A) I*ENCTH: 4358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
, n (D) TOPOLOGY: linear 
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ATTTGGATCT TTAATATCAC CAATATTTTT 
TACTGTATCT GAGTATTTAA TTGTOAAATA 
5 AATTTGTGAT TTTAATGATA AATAATCTTG 

ATCAACGACA GTtAATGTTG TATTTGATGT 

TGATGATGGC TGTACTGCTG CTATACGCAT 

10 

TTTTGGCAAA ATGATATCTG CATTATTTTC 

AGTTTCATCA TTACTCTTGA TAGCATCACT 

GGCATCTACC TTTTTGTTTT CCTCATTAGT 

15 

TTGCAAATCA GGTTGTAACG CTTCTTGATT 
ATTCGTAGAT TTTGGTGAAO CTTGCTCATC 

20 TCCAGTGGTA GATTGTACAC TTTCTTTTTC 

ATTACCTTGa GATGATACCA TTTCTTTTTG 
TAGTTGCTGC ATATCAACTT TATCACTCGA 

25 TTCAATTCTT TATTAGTACT TTCTGCAGCC 

AATGTTGTAC CTACTAAAAT TGATGCTGTT 

TTTAATCGAT TGGATACCAT GCCTTTCCTT 

30 - 

ATTAGATTAC TCCTAATTCA TCAAATTTTT 

GTATATCAAT ATAGTATTAC ATTTTTAGAT 

TCGTAAAACG TTATTTAACA TTTGTGTTTA 
35 V 

TATTTAATCA TCATAAAAAA TAGTATTATT 

AATTCATAGA CAATTACATT TCTGTAACAA 

TTGTTTTATA TCTGGGAATT TCTAAACAGA 

40 

TAATCATATT ATTTATTTTG TTTTTTACGA 
GCGAATAATC CACCGAATAA TGTGCCATTA 

45 GGTAATGCTT TAGCTGTTTT ATGCTGATCT 

TGTTTACCTG CATCTGAATC TGAATCGCTA 
TCGCTATCTG AATCTGAGTC GCTGTCTGAA 

50 TCTGAGTCTG AGTCGCTATC TGAATCTGAA 

TCTGAATCGC TGTCCGAATC TGAGTCGCTA 

55 
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AATATCTTCC GGATTCAATC CATATACTTG 60 

ATCACCTGAT TTAACTTTGT CATCAACTGT 120 

GGCTGGTACG ATTTTATTGT TTTTATCTGC 180 

GATTAAATCA TTAACATTTT TAGCCTCTGT 240 

TCTTGTATTC AAACGTTTAG GTGCTGTACT 300 

ATTATTTGAA TTACTATTGT TATCAACAAG 360 

TTTAACATTT AATGTAGTTG ATTCAGTTTT 420 

TGGTTGAACA TTTACCACTG ATTTATTCTC 480 

ACTTATAGTT TGTTTAGTGT TTAAATCTTC 540 

TGATTTGGCA GTTGAAACTT CAACTTTATT €00 

TATTAATTTA TTCCCATTTG AAGTCGTTTC 660 

ATTATCATTT TTAGTATTGT CTTCTTGATT 720 

TTGATTATCA CTTGCTGAAG TTGTCGCTcG 780 

TTTGCTTCTT GGTTCCCCAG ACCAAAAATT 840 

CCCACTGTGT ACTTTCTAAT CGAAAATTTA 900 

GTTATTGCCG TTTTATTTTC TCTGTTTAGC 960 

AAATAATACA ATTGTTTTAA ATACAAAAAT 1020 

AAAGCACAAT ACTTTAATTA TTTTTCTTTA 1080 

AATAAAAGTT TTTATGAGTt TTGTAATCTT 114 0 

TGCCCTTGAA ATTAATATCT TAG CTTTT CT 1200 

ATTAAATTGT ATCTATTCCT TAAAGATTTT 1260 

AAAAACCAGG CCACATGGAC CTGGTTAAGT 1320 

CGACCGAATA ACAATAATGA TCCTAATGCC 1380 

TTTGAATTAT TATTTTCACT ACCTGTTTCT 1440 

TTAACCGTAC TCATTGGTTT AGCCGGAGTA 1500 

TCTGAATCTG AGTCGTTGTC TGAGTCCGAA 1560 

TCTGAATCGC TATCCGAGTC TGAGTCGCTA 1620 

TCGCTGTCTG AGTCTGAATC GCTATCTGAG 1680 

TCTGAATCTG AATCGCTATC TGAATCTGAG 174 0 
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TCTGAATCTG AGTCGCTGTC TGAATCTGAA 
TCTGAATCGC TGTCAGAATC TGAGTCGCTA 

5 

TCCGAATCGC TATCTGAATC TGAATCGCTA 
TCGCTATCTG AGTCTGAGTC GCTATCCGAG 
TCTGAATCTG AATCGCTATC TGAGTCTGAG 

10 

TCGCTATCTG AATCTGAGTC GCTATCTGAG 
TCTGATGTTT CTTCTTCGTA GTAGCCATTA 

r5 GTTACATCAA CTTCGCCACC ATCGGCATCT 

TGAGTTAAGC CAGCAGGTTT TTCAAAGATA 
AAGCGGTATT TACCATTTTC ATCTGTyTCA 

20 TGCAAAGTAA CTTTAACACC TTTAATTCCT 

TTACTGTCGT ACCAAACATA ATCACCTAAA 
TCTAATGTCA TGTTGTCAGC GTCTTTAATG 

25 ACAGCATCAT CTGTACCTAC ATTTGCAGTT 

TCTACACTAT AAGTTCCATT GCTTAAACCA 
GTCGTACGAT CTAATTCTTT ACCGTTACTA 

30 

CCTTTTTCAT TGGCATCTTG TTTACCATCT 
AGATTATATT TCTTTTGGTC GCCATTAGCA 
TTTGACCAAC TATATTTAGT TTTGTCAGTG 

35 

GTTGAACTAT TATCTGGATA AGCAACTTGT 
TGGCCTTTCA TTAAATCGAC TGTAGCTGTT 

40 TGATCAGTAA CATCTTTAAG TTTTGAAGTA 

TGATCTGtCA CTtCGTAAAT TTTGAAGTTT 
AAATTAGTAA CAAACGTTTG TTTAGTATAT 

45 ATATTACGCG ATAAATCTTC ATTGTTAATA 

TTTTTATTAC CATAATCGAC AATGATTTCT 
TCCATTTTAT AAGCTGTTTT ATCAGTTOTT 

50 

TCAAAGCTAC CTCTAACATT TGTATATTGA 
GTGTTTGTTG TACTATCATA AATACCTTTT 

55 



TCACTGTCTG AGTCTGAGTC GCTGTCTGAG i860 

TCTGAGTCTG AATCTGAATC ACTGTCTGAG 1920 

TCTGAGTCTG AGTCGCTATC CGAATCTGAG I960 

TCTGAATCGC TGTCTGAGTC TGAGTCGCTG 2040 

TCGCTGTCTG AATCGCTGTC TGAATCTGAG 2100 

TCTGAATCGC TGTCAGAATC TGAGTCGCTA 2160 

TCAAGTGTGA AATCATCATG ATCCGTAATT 2220 

TTATCATCTT CAGTTGTATT TGTACCTGTT 2280 

ACTTTGTATT TACCACTATC TAAATTATCA 2340 

GTTGTACCAA TTACTTCGCC TTTTTCGTTT 2400 

TTTTCAGTCG AATCTTGTTT ACCATCTTTA 2460 

CTATATTTTG GTGTTTTGTA GAATCCACTA 2520 

ACACCTGTTG TAGTTAGTCC ATCAGAATCT 2580 

GTCGGTGTAT AACOGGCTGG TGTTGAAAAC 2640 

GTGAACTGAT ATTTACCATT TTCATCTGTT 2700 

TCTTTAAGAA TGACATAAAC ACCTTTAATC 2760 

TTATTTGTAT CTTCCCATAC ATAGTCACCT 2820 

GTTGATGAGC CATTCACATT TGAATAACTA 2880 

TCTAAAGTAT AATCAATTTT TCCATTATCT 2940 

TGAATGATGT ATTGTTTATT GCTGCTTGTT 3000 

TTATTATCAT TACTATAAAT AACATCGAAT 3060 

TCAGGGGTGA AACTATCCAC AAATTGATTT 3120 

TTTGCATTTG GATTAAATTT ATATCCAGTT 3180 

GTATTTTTAG GTTGATTTAC ATATGCAGTC 3240 

TAGTTTGTAC TTGAAATAAG CGGTTGTGCT 3300 

TCGCTATATG TATCATTACC TAAAGTTACT 3360 

GCATTTTTAC GTTTCGCAAA TGCAACTTGT 3420 

TCTACATAGT TCGTAAAAGT ATATGTTGTT 3480 

GCAATAATAT TACCTTGGGC ATTATATAAA 3540 
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GTAAATGTAT CQCCCTCTTT AACAGAATCA TCOATTGTGT AATTTGCTTT TAATTTTAAA 3660 

ACATCACTTG AAGTTGCCCA AAATTCAGTT TTACCAGTAG TCTGATTAAC ATGTCCTTTA 3720 

6 TCAATCGCAA TGTCAATATT TGAAAAATGT ACTTTATCAT TAACATTTGT TCCTTGTTGT 3 7 BO 

GGAGCTGCAA CAGTATTCAC TGCCATGCGA TTTAAAGTTC TTGGTTTAAT AGTCOTTGTT 3840 

TTAGGTGTAG TTGAAACATC TTTTGCTTGT GTTAAATTAC TTTTATCAGT TTCATTACTA 3900 

10 

TATGTAGTTG ATGATTTATC ATTTGTTGTT ACATTG CT AG TTTTTGTAGT AGATTGATTA 3960 

GCTGTAGCGT TTTGTGGTGA TTGCATGTTA CTACTAGTTT CTTTAACTGT TGCACTATCA 4 020 

15 CTCATTGTCA CTTTAGGCTG ATCTGCAGTT GCAGTTTGCG TATTGTCTTT TAGTTGACGA 4080 

CTATCAACTT TTTTAGTTGT TTTATTCTCA CTTGGGGCTG TCGTTTCATT TTTTGATTGA 4140 

TTTAATTCTC CATTCGTATG TTCTGCCGCT TTAGCTTCAT GACCACTTAA CCCAAAAATC 4200 

20 AATGTTGTCC CTACTAAAAT TGAAGCAGTA CCTACAGAAT ACTTTCTTAT CGAAAATTTG 4260 

TTTAATCGAT TTGGTATCAT GCCTTTnCTA TTTGTnGCTG TCTTTTTATA ATTCATTTAA 4320 

TAATACTCCT TTAAAATATC AAAATTTGAT AAATATAA 4358 

25 (2) INFORMATION FOR SEQ ID NO: 455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1060 base pairs 
{B> TYPE: nucleic acid 
30 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 

35 



40 



45 



60 



TTGACTTCTT 


AATTCAG CAT 


TTTCTGCACT 


TAATGCTTTG 


TTCTTTTTAA 


TAAGTTGCTT 


60 


TCTTGCATAA 


ACTTCGGTAT 


CTATTTTACT 


ATTACTATAC 


CTTTGATTTA 


AAACTAATAT 


120 


ACCAATTAAT 


GCTACAATGA 


TAATGATAAG 


TACAACATAA 


AAAGACATTT 


TTTCACCAAT 


160 


CCTTTTTGAC 


TTCTTTAACT 


TTGTATACAA 


TAATAATTAA 


TAAAGATTAA 


TTGTTATTCA 


240 


ATTTCCCACA 


TTTTTATTAG 


TTGATTTTAG 


TTCATCATTG 


TTATAATCAA 


ATTATAAACT 


300 


GACAGATATT 


GATGTTCAAT 


GAATATGACG 


TGAAAGATTC 


GTGAATTCAA 


GTTTATGTCG 


360 


AATTTATGTT 


ATAACGGTCA 


TTTAAATGAC 


AGAATTAGGT 


CACTCATAGT 


ATTTTGAAGA 


420 


TTGAATTCAT 


TAATTTTAAA 


ATGTATAATG 


ATATTTGTGA 


AAGCGCTTGC 


TTAGGAGGTG 


480 


TATTTGAGAG 


TGAATGAAAT 


GAATGCTAAA 


GAACAATTAG 


TGGACAATTT 


AATGAAAACA 


540 


TCATCGCAAT 


TATTTAAATT 


TCACGGTGAA 


GTTGCCATGC 


AGCTTTTCTT 


AAATGATGAA 


600 
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AAAGTTATTC CGCAATCATA TGCGTTACTA TACATAGATA AGCAAGATCA AGCAATAGCT 720 

AAAGAAGATT TATCACTTTC AAAAATTGCA AAAGTTTATG TGCAATATGA TGATACAACA 780 

5 ATAATGAGTA TTTTCGTTTA TGATGTAGTA AACGATGAAT GGATTTTTAG ATTGGATCCG 840 

AATATACGTA TACCTAAGAG TAACATATAC TTCCATAGTT TAAATTGGGA TGTGGATATA 900 

TTAAACCGGA GtCGTCTAAT GTATGTCTAA TGCaCACCAT CAGaTCATCA TtATCCATTA 960 

10 TAAcGrGCAT AGTGCATAAn yACTwCAtTT TaTTaaATTG AGaGGgGCAC GATAGGTGCA 1020 

TCAGGACATA ATATAGGAAG CATCAACGCG TGAnCAGGTC 1060 



(2) INFORMATION FOR SEQ ID NO: 456: 

1S 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1262 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 



25 


ATAATnACTA 


AATACnAAAG 


TTTAACTGTC 


TTACTAATAA 


TGACTATGTT 


ATAATTTTAA 


60 




AAGTGATATT 


TTGGGTAATC 


GCTATATTAT 


ATAGAGGAAA 


GTCCATGCTC 


ACACAGTCTG 


120 




AGATGATTGT 


AGTGTTCGTG 


CTTGATGAAA 


CAATAAATCA 


AGGCATTAAT 


TTGACGGCAA 


180 


30 


TGAAATATCC 


TAAGTCTTTC 


GATATGGATA 


GAGTAATTTG 


AAAGTG CCAC 


-AGTGACGTAG 


240 




CTTTTATAGA 


AATATAAAAG 


GTGGAACGCG 


GTAAACCCCT 


CGAGTGAGCA 


ATCCAAATTT 


300 




GGTAGGAGCA 


CTTGTTTAAC 


GGAATTCAAC 


GTATAAACGA 


GACACACTTC 


GCGAAATGAA 


360 


35 


GTGGTGTAGA 


CAGATGGTTA 


TCACCTGAGT 


ACCAGTGTGA 


CTAGTGCACG 


TGATGAGTAC 


420 




GATGGAACAG 


AACATGGCTT 


ATAGAAATAT 


CACTACTAGT 


TTAGCTCTCC 


TAGATGATGG 


480 


40 


AGAGCTTTTT 


TCATGAAAAG 


AACACTTAAA 


ATTAACGCCy 


TGTCTTGaTA 


tAATGACaCT 


540 


GCcTTGTTTT 


AAAATAGTAA 


GCGGATGCgT 


TAATGTATCA 


GCGATTAAAT 


TTGTTGGAAA 


600 




TGTATAAAAA 


ACACAAGCTA 


AGAATAAAAT 


ACCTGTATAA 


AAGGAGAATC 


ATATATGTTT 


660 


45 


CAATTACTTG 


CAGTTTGTCC 


GATGGGATTA 


GAAGCTGTTG 


TTGCTAGGGA 


AATTCAAGAA 


720 




TTAGGCTATG 


AAACAAATGT 


TGAAAATGGT 


CGTATATTTT 


TTGAaGGAGA 


CGCAAGTGCA 


780 




ATTGTAAAGG 


CAAATTTATG 


GTTGCGCACA 


GCAGACCGAA 


TCAAAaTTGT 


TGTTGGACGT 


840 


SO 


TTTAACGCAA 


CAACGTTTGA 


CGAATTATTC 


GAACAAACCA 


AAGCGCTCCC 


TTGGGAATCT 


900 




ATAATTGATA 


AAGAGGGTAA 


CTTCCCAGTT 


CAAGGTAGAA 


GCGTTAAATC 


AACACTACAT 


960 
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TATAACGAAA AAOQTTOGTT AAATGAATCA GGTGCCAAAT ACCCTGTTOA AGTTGCCATT 1080 

TTAAAAGATA ATGTATTATT GACTATCGAC ACATCAGGTT CTGGTTTGAA CAGACGTGGT 114 0 

TATAGATTAG CACAAGGTGA AGCACCAATT AAAGAAACGT TGGCAGCAAO TTTAATCCGT 1200 

CTTGCCAAAC TGGGAAAGGT GATTACACCT TTnAATTGGT CCCATTTGCG GTTCnGGTTA 1260 

CA 1262 



(2) INFORMATION FOR SEQ ID NO: 457: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 1142 base pairs 
(B) TYPE: nucleic acid 
15 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 





CCTGGCTGCT 


TTATCAG CAT 


CTACTACTTT 


AAAACCGAAT 


ACGGATAAGA 


GTTCTGATAC 


60 




TGTTGATTTT 


CCTGAGGCGA 


TTCCACCTGT 


TAGACCAATA 


ACTTTCGGCA 


TAATTTCACT 


120 


25 


CTTTCTTTAT 


TTTTGACATA 


CTGGACAATA 


ATGACTATTT 


CTTGTCGCGA 


TGATTTTTGT 


180 




TTCAATTTGA 


CTTCCACACA 


CTTTGCATAC 


CGGCTGCTTA 


TATACATTAA 


GATGCAATTG 


240 




CATCTCACCA 


GTTTTTCCAT 


CAGCATGACG 


ATAATCTGAA 


ATACTTGTAC 


CGCCATATTT 


300 


30 


AATAC CTTCT 


TCTAGTACTT 


CTCTAACATA 


ATAAAAAACC 


ATTTCTTGTT 


GTTGGTGTGT 


360 




TAAGTCITTT 


acitttttat 


CTGGTAAAAC 


ACCTGCACGA 


AACAACGCTT 


CACATGCGTA 


42 0 




AATATTTCCA 


CAACCTGCGA 


TTACTTTATG 


ATCCAAAATC 


ACTTGTTTGA 


TTGGTTTATT 


480 


35 


CTTATTAGAC 


TGTTGATGAA 


TTCGATTTAA 


ATAATACGTC 


AATGCTTCAT 


TTGAAAAAGG 


540 




TTCAGGCGCT 


ATTTCTAAAA 


ATGAAGGATA 


AGATGCTACA 


GACGCAACAT 


TTCTAATTTC 


600 


40 


TCCAAAACGA 


CGTATATCTG 


AATAAATTAA 


CTTTTTGTCA 


TTTGACAACT 


CAAAAATAAC 


660 


ATGCCAATGC 


TTACGATAAT 


TAGGTATCAT 


AATATCTTCA 


AGTTCATCTA 


CAATGAAAAA 


720 




ACCGCCCGCC 


ATACCTAAAT 


GACTAATTAA 


TGTACGTTGT 


TCTCGTTTAT 


TATCTAGCTG 


780 


45 


AAAAACGATA 


TATTTACTTC 


TTCGTTCTAC 


ATTTGTAATG 


GTATAGCCTT 


CCGATAAAGT 


840 




TTTAAAAGTA 


TCTAATTCAA 


TTCCTTTTAT 


AATTGTTTCC 


TTGCCTTGAG 


CTTTACCTTC 


900 




GATTACTTTA 


TCCGAAAATA 


TAACGTGTTC 


AATTTTTTGA 


TTTATAACGT 


AGGGTTCAAT 


960 


60 


TCCTCTTTTT 


ACATGTTCTA 


CTTCTGGTAA 


TTCGGGCATA 


CCATTAACCT 


CACTTTATTT 


1020 




TGCATCATAC 


CAGGTTGCAC 


CATAACTTGA 


GTCTACTTTT 


AATGGAACAT 


CTArTTGCAA 


1080 
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TT 1142 
(2) INFORMATION FOR SEQ ID NO: 458: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) topology: linear 

10 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 



1S 


CCTTTAGTAA 


ACAATCCTTC 


TTTAGTTTTA 


GTACGTTGTT 


CCAATCCAAA 


TAATTTGTAT 


60 


TTCATTGCCT 


CGCCCGATTG 


AGTGCCGCTA 


AAGTTATCAT 


CTTTCATGTT 


AGGCGTGTTG 


120 




GTAAACATGT 


GTATATCACT 


GTTTAAACGG 


TCTTTATAAG 


CTTCGGTACC 


TTGTACATCG 


180 


20 


TATTGCTTAT 


AAATATAACC 


ACCATCAACA 


GAGCCTTCTG 


TTTCTCTACC 


TTCG CTATCA 


240 




GCATAAACAG 


TCGGTTCTAA 


AAACAACACG 


TTAGCTTCCT 


TTTGTTTTCT 


AACTTCTACA 


300 




GGATCTAAAT 


TTAAATTACC 


TTTAATAAGT 


AACATAGCGT 


CATTTAAATC 


ACTCATATAO 


360 


25 


TTAGCAsymy 


CTGATTCAGC 


ATTATCATAC 


AAATCAATTA 


AAGTGATTAC 


TTTCTCATAA 


420 




TCCCCTTTTC 


TTCTTTCGTT 


GTTGCTAAAT 


TCTGTAATAG 


GCATACGTTC 


GAAAGAGTGT 


480 




GATTCAAAAC 


CGTTTTCACG 


TGGTGTGAGC 


TTCAATCCAT 


TTGTTCTACT 


GGTAAGATAT 


540 


Ju 


CTATAAACAC 


CGTGTGAAGT 


GAATAAATCA 


ACTGTAAACA 


CTTCATCTTC 


GTCAGTCTTG 


600 




TCTATTGGTT 


TAGTTCTTAA 


ATATCTAACG 


CCTGCGATAC 


TATTACGTTC 


AATTGTATTG 


660 




TCGTATATGA 


CAAAAGTACT 


CATTGCATCA 


CTCTTGTATA 


AACGCGTTTC 


ATCATCTTGG 


720 


35 


TTTCTAATCA 


TTAACTCATA 


AGCTTTGCCA 


TAAATTGACA 


AATCTAATCC 


TAAAGATCTA 


780 




TTGTGTGACT 


CAACATCATT 


TAAATCATTG 


AACGCCTCAA 


TAACTTCTAA 


TACATCTTTG 


840 


40 


TCATCATCTT 


GATATTGAAT 


TGGATTACCC 


AAGAAATAGC 


CGTTGATAAA 


ATCGCTAATA 


900 


TAAGATGCGT 


AATCATGCGC 


TACACGGTTA 


TCTGCCATGT 




GCGTCGTGTT 


960 




AACTCAACTA 


AGTTCTTAGT 


TTTACCTTCG 


TAATAATCAC 


TTAACACTTT 


CAATCTAGGT 


1020 


45 


CGTTGGTAAT 


CCATGTGATG 


TTCAATGTAT 


TTACTTACTT 


CATTAACGTT 


TTGTAATAAA 


1080 




TCGGATTCCG 


TCCCGTCATA 


TGTGTAAACA 


ACATTGGCTT 


CATCATTAAA 


TAAGTAATTT 


1140 




ATGTTTCCCC 


GTAGATCTGT 


ATCTGTTTCA 


AATTCGTTTA 


CTTTTAACAT 


TTGTTCCCTC 


1200 


SO 


CTATAATCCT 


AGAGATTTTA 


TTGTGTCAAC 


TTTCGAACTG 


AGATTTGTGC 


GTTTt CTAAC 


1260 




CGGTCTGTAG 


AATCGTTCCA 


CTGAATAACG 


CAACGAATCG 


ATACAATGAT 


TGTATGTATC 


1320 



SS 
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CTCTTCAATA GTCTTGAAAC AACGTTCATC AACAATGATT TCAAATTGCA TTAAGAATTG 1440 

TAACCCTTGT ACAACCGAGC CCTTCCCTTT TTTGGTTGGT AAAATCCTTT TAAGCCCTAG 1500 

5 ATTCCTTAAT TCAGCTATAC TTTTTTGTTC TGCACTATCT GCTGTAATTT CTTCTTTAGC 1560 

ATAACCAAGT TGCTTTATGA CATTAGCTAT TTCATCATTC AGCATACCTT GTTTAACATA 1620 

CTCTTCAATG ATGTATAACT TCTTTTTCTT TACATCTATT TTAGAATGTA TAAAAGCACT 1660 

10 AGGATCATTA ACGTAGCCAA AGTCCAATCC AAAATAAGAA GGTAAATGTC TTAACTCATC 174 0 

TTTATTTATT AAACGTTTTT CATACTTAGG GAAAACCAAT TTGTCTAGTG TAGCAAATTC 1800 

ACCTAACGCA TAAA 1814 
(2) INFORMATION FOR SBQ ID NO: 459; 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: double 

<D) TOPOLOGY: linear 



2S . (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 

AATTnAGATT ATTACCCTCC TTTAAAAATA TTTGTTTCAC AATTTTTTAT TACCTATTTA 60 

CTGGTTTCAT GTCTTATGGG GCATTTTACT TGTTGGTATT TGAAAATGTG CAAAATTTAA 12 0 

30 TCTTATATGT TTCTTGGCTT TTCATGACTA TGCTATTTAT GTTTATGAAT ATG CATTCAA 180 

. XTATAGATAA AAAAGTACAT ATATTCTTAA AGTCTAATAA ATAGTTACAA ATTTAGTTAG 24 0 

TTTTCAATTG TTAATTAGGG GTGGTAAACA GTGCTTTGTG AATCTAGACA AATTTATAAA 300 

AATCCTAAAT ATCGAGTTAT TAGATATAAT AATGAATATT TCATGGTCGA TTTAGTAAGT 36 0 

ACTTGGATTA CTTATTTTTT CCCTATGATT AATTGGTTTT TGCCCAAAAA ATACGCAAAA 420 

ATTAGCGAAA ATGAATTTGA AAGGTTAAAT ATAGTCGAGC CTGTTAAAAA TAATGTTTTT 480 

TGGCCGGTTG CAGGAAGTTC AGTT CTATTT GGAATTATAT TGAGAAAGTA CGGTAACTTC 540 

TTTAATGTTC AGTTTGAAAA ACAACTAGCA ATCACTGTAT TTTTTATCAT GTTAATAGGG 600 

ATGTTAATTT TTTATTTTTA TCTAAATAAA AAATTAACAT TAAAAATTTT TAATACCAAC 660 

GTGGGTAATA AGAATAGGAG TTGTAT 68 6 
(2) INFORMATION FOR SEQ ID NO: 460: 



35 



45 



SO (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 



20 



30 



40 



niVflvJWwii x 


X UwvV^tvAV. 




GCAAGTAACT AAAGGAGAAG 


l J"W— A x x inun 


60 


-1 Innn X 


AX X X InVlnA X 


T 2Vf5 & HfSTTVl 2k 


TGAAAGAGCA 


AAAGCAGGAT 


XaX X I* X xww 


120 




fPATfUfi & 21 2k 




TACAAATGCT 


GATTTCATGC 


v»X IWUjt^nnl 


1 RO 
X o u 


CAATGCZUAAA 


CGrGAAUAAG 


uALAAuAAAT 


CAACTTAATG 


CAATTTATTA 


TV TV T\*l"l'7\/^ 

AGAAA1 IAGA 


OA A 


TAAAAACATG 


GATTTTCTAG 


ACATAGATAA 


AGACATGGCA 


CAACGTTATT 


*1*TA TV K <TV* TA TV 

TAAATGAAGG 


^ ft f\ 

300 


TTTCTCAGGT 


GGAGAGAAGA 


AACGTAACGA 


AATCTTACAA 


TTAATGATGT 


TAGAACCTAA 


360 


GTTTGCaATC 


TTAGATGAAA 


TCGATTCAGG 


GTTAGACATC 


GATGCATTAA 


AAGTTGTATC 


420 


TAAAGGTATT 


AACCAAATGC 


GTGGGGAAAA 


CTTTGGTGCA 


TTAATGATTA 


CACACTATCA 


480 


ACGATTATTA 


AATTACATTA 


CTCCTGATAA 


AGTACATGTA 


ATGTATGCTG 


GTAAAGTCGT 


540 


TAAATCTGGT 


GGTCCAGAAT 


TAGCAAAACG 


TCTTGAAGAA 


GAAGGATATG 


AATGGGTTAA 


600 


AGAAGAGTTC 


GGTTCAGCTG 


AATAATCTTA 


TTAATACAGT 


ATCCATGAGA 


TGTTCATCTA 


660 


TATATGATGA 


AAATGAACAT 


TTATACGAAA 


TAGTAAATTT 


CATCAAGTAG 


GAGGAAAAAG 


720 


TTATGACAAC 


TGATATTTTG 


rACaTTyCTG 


AAGAACAACT 


TGTTGATTAT 


TCTAAAGCCC 


780 


ACAATGAACC 


TTCTTGGATG 


ACAGAATTAC 


GTAAAAAAGC 


TTTGAAATTA 


ACAGAAACTT 


84 0 


TAGAAATGCC 


AAAACCTGAT 


AAAACAAAAT 


TAAGAAAATG 


GGATTTTGAT 


TCTTTTAAAC 


900 


AACACGATGT 


AAAAGGTGAT 


GTTTATCAAT 


CTTTATCACA 


ATTACCTGAG 


TCAGTAAGAG 


960 


AAATTATTGA 


CGTAGATCAT 


TCTAAAAACT 


TAGTAATTCA ACATAATAAT 


ACGATTGCGT 


1020 


ACACACAAGT 


TGATGATAAT 


GCATCGAAAG 


ATGGCGTTAT 


CGTTGAAGGT 


TTAGCAGACG 


1080 


CTCTTATGAA 


CCATAGTGAT 


TTAGTACAAA 


AGTACTTTAT 


GAAAGATGCA 


GTAACAGTAG 


1140 


ATGAACATCG 


TATCACAGCG 


CTACACACGG 


CATTAGTTAA 


TGGTGGOGTA 


TTTGTTTATG 


1200 


TTCCTAAAAA 


TGTAGTTGTA 


GAACATCCAG 


TACAATACGT 


TGTGTTGCAC 


GACGACGAAA 


1260 


ATGCAAGCTT 


TTATAACCAT 


GTTATCATCG 


TTACTGAAGA 






1300 



(2) INFORMATION FOR SEQ ID NO: 4 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 
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GACAGCAAAT TCAAGATACA TTAAATAAAG ATATTGTCAT AAAGCATATT CTTGTTCGAG 60 

ATAAATCTAA AAAGAGACCG CTAAATATTA GCCAATATCA TTTAACTGAA GATGTTAATG 120 

AAATTTTAAA TGATGATTCA TTAGATATTA TCGTTGAAGT CATGGGAGGA ATTGAACCAA 180 

CTGTAGATTG GTTAAGAACA GCACTTAAAA ATAAAAAACA TGTTATTACC GCAAATAAAG 240 

ATTTATTAGC AGTACATCTT AAACTTTTAG AAGATTTAGC AGAAGAAAAT GGTGTAGCTT 300 

TAAAGTTTGA AGCGAGTGTA GCAGGTGGTA TTCCGATCGT AAATGCCATA AATAATGGTT 360 

TGAATGCGAA TAATATTTCA AAATTTATGG GAATTTTAAA TGGTACCTCT AATTTTATTT 420 

TATCTAAAAT GACTAAAGAG CAAACGACAT TTGAGGAAGC ACTTGATGAA GCGAAAAGAC 480 

TTGGTTTTGC TGAAGCGGAT CCAACTGATG ATGTAGAAGG GGTAGATGCA GCGOGTAAAG 540 

TTGTCATTAC ATCATATTTA TCATTTAACC AAGTCATTAA ATTAAACGAC GTTAAACGAA 600 

GAGGAATTAG TGGCGTAACT TTAACTGATA TTAATGTAGC CGATCAACTG GGGTATAAAA 660 

TTAAATTGAT TGGTAAGGGA ATATATGAAA ATGGCAAAGT TAATGCATCG GTAGAACCAA 720 

CGTTAATTGA TAAAAAGCAT CAATTAGCAG CTGTAGAGGA TGAATATAAC GCGATTTATG 780 

25 TTATTGGTGA TGCCGTTGGT GACACGATGT TTTATGGAAA AGGAGCAGGC AGTTTAGCAA 84 0 

CAGGTAGTGC CGTTGTCAGT GATTTATTGA ATGTAGCATT ATTCTTTGAA TCAGATTTAC 900 

ACACATTGCC AC CACATTTT GAATTAAAGA CAGATAAAAC ACGGGAAATG ATGGATTCAG 960 

30 ATGCAGAAAT TAATATTAAA GAAAAATCCA ATTTCTTTGT AGTAGTGAAT CATGT CAAAG 1020 

GTTCAATTGA AAATTTTGAA AATGAGTTAA AGGGAATATT ACCATTTCAC GGATGATTAA 1080 

GAGTTGCAAA TTACGATAAT CAATCATATG CCGCTGTTAT AGTTGGATTG GAATCATCAC 114 0 

CGGAAGAATT AATCACTAAG CATGGATACG AATTGACAAA GTATACCCAG TAGAAGGAGT 1200 

TTAATTATAA TGAGAAGATG GCAAGGATTA GTAGAAGAGT TTAAAGCACA TTTACCAGTA 1260 

AATGAAAATA CACCAAAATT AACATTGAAC GAGGGAAATA CACCACTCAT TCATTGTGAA 13 20 

AATATGTCTA AAATACTAGG CATAGATTTA TATGTGaAGT ATGAAGGTGC CAATCCGACA 1380 

GTTCATTTAA AGATCGCGGT ATGGTAATGG CTGTGACAAA AGCAAAAGAG CAAGGTAAGA 1440 

AAATTGTAAT ATGCGCTTCG ACTGGAAATA CATCAGCGTC TGCAGCAGCA TATGCAGCGA 1500 

GAGCAGGTTT AAAAGCTATC GTCGTAATAC CAGAAGGTAA AATTGCATTA GGTAAATTGT 1560 

CGCAAGCAGT AATGTATGGT GCAGAAATCG TTTCTATTGA AGGAAACTTT GATGAAGCTT 1620 

SO TAGAAATTGT AAAAGAAATT GCAAAAAGTG GCGAAATCGA GCTTGTAAAC TCTGTCAATC 16 80 

CATTTAGAAT CGAAGGACAA AAGACAGGCT CATTTGAAAT TGTACAACAA TTAGACGGTG 174 0 
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AAGGCTTTAA 


AGAATATGAT 


GAAGCTAAAG 


GATCACAATT 


GCCGAAAATG 


TTTGGCTTCC 


1860 




AAGCTGAAGG 


CjGCATCACCA 


ATTGTTCAAA 


ATAAAGTCAT 


m* «ft t % fTV/ 'u' Mil 

TAAAAATCCT 


IV fV 9V TV < IWI W 1 

GAAACGATTG 


1920 


5 


CAACTSCTAT 


XCGAAA A\aQT 


IV TV TV^^TVT'TTl 

AATCGTGCTA 


GTTGGGATAA 


s*t m^v^ iv «*■ 1 1 av tv ft* 

GGCGAC7TAAT 


GCTCTTAAAG 


1980 




AATCAAATCKi 


ATTAATAGAT 


AGTGTTACTG 


1V*1V^ JV*TV^ K IV IV m 

ATGATGAAAT 


TCTAGAAGCA 


TATCAGTTAA 


2040 


10 


fW^ Ik A omit It 

TGACAACTAA 


AGAAGGTGTC 


TTTAGTGAAC 


CAGCGAGTAA 


TGCTTCTATT 


GCAGGTTTAA 


2100 


TTAAATTGCA 


TAGACAAGGT 


AAATTACCTC 


AAGGTAAAAA 


AGTAGTTGCT 


ATTTTAACTG 


2160 




GTAATGGATT 


AAAAGATCCT 


GATACTGCTA 


TTTCACTACT 


AGATAATCCG 


ATAAAGCCAT 


2220 


15 


TGCCAAATGA 


TAAAGATAGC 


ATTATCGATT 


ATATTAAAGG 


AGCTTTATAA 


CATGTCGAAT 


2260 




UTTTTIJGAGT 


TAACAATTCC 


TGCATCAACA 


GCCAACCTTG 


GAGTTGGCTT 


TGATTCTATA 


2340 




GGTAT06CTT 


TAOATAAATT 


TTTGCATCTG 


TCTGTAAAGG 


AAACATCAGG 


GACAAAATGG 


2400 


20 


GAATATATTT 


TCCATGATGA 


TGCATCTAAG 


CAATTGCCTA 


CTGACGAAAC 


AAACTTTATT 


2460 




TATCATGTAG 


CACAACAAGT 


TGCTTCTAAA 


TATAGTGTTG 


ACTTGCCTAA 


TTTATGTATC 


2520 




GAAATGAGAA 


GTGATATTCC 


ATTGGCAAGA 


GGGTTAGGTT 


CGTCAGCTTC 


TGCTTTAGTA 


25B0 


25 


GGAGCTATAT 


ATATCGCAAA 


TTATTTTGGT 


GATATCCAAC 


TGTCTAAACA 


TGAGGTATTA 


2640 




CAATTAGCGA 


CTGAAATCGA 


AGOACATCCT 


GATAATGTTG 


OGCCGACCAT 


TTATGGTGGT 


2700 




TTAATCGCTG 


GATATTATAA 


TGATGTCTCG 


AAAGAAAGGT 


CaGTtGCACA 


TATCGACATA 


2760 


30 


CCAGACGTGG 


ATGTGATTGT 


AACGATACCA 


ACTTATGAAC 


TAAAAACAGA 


AGCATCAAGA 


2820 




CGTGCTTTAC 


CACAAAAATT 


AACACATAGT 


GAAGCGGTTA 


AAAGTAGTGC 


AATTAGTAAT 


2880 


35 


ACAATGATTT 


tGgCATTAGC 


ACAGCACAAT 


TATGAATT AG 


CAGGTAAACT 


CATGCAACAA 


2940 


GATGGCTTTC 


ATGAACCGTA 


TCGTCAGCAT 


TTAATTGCTG 


AATTTGATGA 


AGTGAAAACA 


3000 




ATTGCTAGTC 


AACATAATGC 


CTATGCAACT 


GTAATTAGTG 


GTGCTGGACC 


AACTATTTTA 


3060 


40 


ATATTTAGTC 

TCATGCCATT 


GTAAAGAAAA 
CTGAA 


TAGTGGGGAA 


TTGGTTCGCT 


CTTTAAATAG 


TCAGGTAGTA 


3120 
3135 



(2) INFORMATION FOR SEQ ID NO: 462: 

4S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 

55 



1378 



EP0 786 519 A2 





AGnTCAATAT 


TTAGATCAAG 


CCGTTTTAAG 


TAATTACGAA 


CAAGTTTATA 


TCATTCATGG 


120 




TAAAGGTACA 


GGTGCACTTC 


AAAAAGGTGT 


ACAACAACAT 


TTGAAAAAGC 


ATAAAAGTGT 


180 


5 


TAGTGACTTT 


AGAGGTGGTA 


TGCCAAGCGA 


AGGTGGATTT 


GG CGTTACCG 


TTGCAACACT 


240 




AAAATAAATT 


ATAATTTGAT 


AAATTAAATA 


GCTGCAGTTA AAATAATGTA AAGCAACAAG 


300 




AATACATTTC 


AAACATGTTA 


TTTGAAATAA 


GCATAAAAAT 


TGAGCAAATA 


GAAATACATG 


360 


10 


AAGCATGTTA 


TCTGATATAA 


TTTGAACATC 


ATAATAATAA 


TTAAGGAGGA 


TTGGCATTTA 


420 




TGGCAATCGT 


AAAAGTAACA 


GATGCAGATT 


TTGATTCAAA 


AGTAGAATCT 


GGTGTACAAC 


480 


15 


TAGTAGATTT 


TTGGGCAACA 


TGGTGTGGTC 


CATGTAAAAT 


GATCGCTCCG 


GTATTAGAAG 


540 


AATTAGCAGC 


TGACTATGAA 


GGTAAAGCTG 


ACATTTTAAA 


ATTAGATGTT 


GATGAAAATC 


600 




CATCAACTGC 


AGCTAAATAT 


GAAGTGATGA 


GTATTCCAAC 


ATTAATCGTC 


TTTAAAGACG 


660 


20 


GTCAACCAGT 


TGATAAAGTT 


GTTGGTTTCC 


AACCAAAAGA 


AAACTTAGCT 


GAAGTTTTAG 


720 




ATAAACATTT 


ATAAGTTACA 


ACCAATGACG 


ACTGGGGCAT 


TTCTTTAATG 


AATTGCTCCA 


780 




GTTTTTGTTT 


GTGTTTTTAA 


TATAAAAAGT 


TGAATGATAA 


GTCATCATAT 


TGTTTACGAC 


840 


25 


TTGAGAATGG 


TGGGATTAAT 


AAATCTATGA 


ACGTTAAATG 


ATAATCTAGC 


ATGCTGATAG 


900 




ATTTGTAG CA 


GTTGGTTTGA 


TAAAACCATG 


TTCAATATTA 


CATGATGTGC 


ATGAAAAGTC 


960 




ATACTCGAAG 


ATGTTGATTA 


TTAACTAGAA 


TTAGTGGTGA 


TAAATTTGAA 


GCACTTTTGT 


1020 


30 


AGCATCATTC 


ATTTTAAAAT 


TAGAAGGGGG 


GATATTTTTG 


GAAGACTATA 


AGCAACGAAT 


1080 




TAAAAATAAA 


TTAAATGTCG 


T AC CTATGG A 


ACCAGGATGC 


TATTTAATGA 


AAGATCGTAA 


1140 




TGATCAAGTG 


ATATATGTTG 


GCAAAGCTAA 


AAAGCTAAGA 


AATCGATTGC 


GATCATATTT 


1200 


35 


TCACGGGTG 












1209 



(2) INFORMATION FOR SEQ ID NO: 463: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 
AGTTCAACAC GACGAATTTT ACCTGAGTTT GTTTTTGGTA AGTCGTCAAC GAATTCAATC 60 
SO TCTCTCGGAT ATTTATATGG TGCAACTTCA TTTTTAACAA ATTGTTGTAG TTCTTTAACT 120 

AACGTATCAT CACCCGCAGT ATGGTCCTGT AAAATAACGA ATGCTTTAAC AATATTTCCT 180 
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10 



1$ 



GCATCTTCAA CTTCAAAAGG CCCAATCGTA TAGCCTGAAC TAATAATAAT GTCATCTCGA 300 

CGTCCTTCAA ACCAGAAATA ACCATCATCA TCTACATGAG CTAAGTCACC AGTGATGTAG 360 

TATTtACCTG TTTGCGCTTT CGCCGTAOGT tCTGGCTCTT tATAATACCC TTTGAAAAGT 420 

GCTGGCAAAT CAAGTGGTAC TGCAATATTC CCTTTCGTAT TAGCAGGTAC GCTATTCCCC 480 

TCATCATCTA CTACAGTGAC CGAACTACCC GGAATGCCTT TACCCATTGA TCCAATCCTC 540 

TGTGGTGTAT CTTTTAAAAA GCCTATAAGC AAGGTACTTT CAGTCTGGCC ATATCCATCT 600 

CTTACAGTTA AATTAAAGTA TTTCTTGAAT TGTTCAACTA CTTCTCGATT TAGTGGCTCA 660 

CCTGCAOAAA CGGCACTATG TAAATGCGTT AAGTCATAAT CATTTAAGTT CTGTAATTTA 720 

GCCATCATAC GATATTCTGT CGGTGTACAA CATAAAACAT TAATTTGATA TTTTTGAAGC 780 

AATTCTAAGT ATGTTTCAGG ACTGAACCTT CCATTAAATA CAAAAGCAGT TGCACCTGAA 840 

20 CCTAATACAG ATAAGAAAGG ACTCCATACC CATTTTTGCC AACCTGGTGc TGCTGTTGCC 900 

CAAACTAAGT CATCTTCATT aATACATaAC CAATGTTTTG GTGCCATTTG TaAATGTGcA 960 

AATCCCCaTC CATGACAATG TGTAACGGCT TTAGGATTGC CAGTTGTACC AGATGTATAT 1020 

25 GACAGAATCG CCATATCATC ACGCGTCGTA TCTGCCATTT CTAGTTTGTT ACTTGCGTTT 10 BO 

TCTTTTTCAG CTTCAAGTGA AATCCATCCA TCTTTTTGAC CGGCAATAAC AAATTTAGTT 1140 

AACGCATCAT ATTCTTTAAT TTTTTCAAAT TCAACTGTGA ATGGCTCTAG TGCAATAACT 1200 

30 

GCATTAATTT CACCATGTGT GATACGGTAT TGTAAATCTT TAGTTCTTAG CATTTCAGAA 1260 

CATGGAATGA TTGCAACACC TAATTTTAAA GCAGCAATAT ATAATTCATA CGTCGCAATA 1320 

GATCGTGGCA TCATAATGAG TACTTTATCG CCTTTAGATA AACCGTGCGA TGCTAAAACA 1380 

TTACCTACTT TATTAGACTG TTCAATGAGC TGTTGGTAAG TGACTGATAT ATCTTCGCCT 1440 

TCAGTATTAT GATATAAAAT TGCCTTTTTA TCTGGTATGT GGCTATATTT TTCGATTTCC 1500 

GAAATAATGT TATATTTTTC AGGCGCGAAT AGAGCTGACT TTTGCATAAC TAACTTCCTT 1560 

TCATACATCC ACTTTTCCTG TGATGAACAT TGTAATTTTA TAAATGAATT ATATACATCA 1620 

TACGCCTATC TTTACAGAAT TTTCAATTAA ATAGGGTTAA ATACCAAAGT CCTCGACACT 1680 

45 ACACTTTGAC ATGACGTAGC ATTCAAGGAC TTTCAAATGA TTGAGGGTTG ATATCTCGGG 1740 

CTAGACCATA TCAGCTAATT CAATACGAAT ATTGTATGAT AATTCACGAT TAATTATTTT 1800 

TACATCTGCA CCTTTCGAAG TGCCACGATG CTTGTGTGTA TGCTTGTACT CAGCTGAATT 1860 

SO TTGCCAATGA TAAAATGCTT GCCTATTTTC CCACAGCGTA ATAATGATAT AGTGTCTACC 1920 

AGCTGTTCTA GGTCTTAAAA ACCTTAATGC TTTAAATCCA TCAACGTTTT TTAAATGCTT 1980 



55 
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ATTTAACACA CATAATGAAT CATTTOATAA ATCATTTATC OCTTCTAOCA CATCGTAATA 
TGCAGTGTCA TTATTTTTT T GTATTGTGAG ACAATCATCC AGTTCTTCTA TTACATAACT 
TCTATATTCA TCATAAATTT TCATAATAAA TGCCTTCATT TCATTTATAT TTTTGGTCAT 
ATTACTkTAT ATCTATTACT AAtkCATTCC CGTATTTATT AATTACAATC ATAGTTTGGC 
TyCTTTTTAA AAGATAAGAC TTTGTAAAAA GTATTAATAT TTCATGCAAA TGGGGGACAG 
GAGTCGCCCA CTATTTTTGT GTCTTCAATT TCATGATCAT TATTTAACAT TAGTCATGAA 
AATAGCCGAC 

(2) INFORMATION FOR SEQ ID NO: 464: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 590 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 464: 
TTTATTAATT GTAAAAAATT GAGTAAATTA TCTTTACATT CTAAATTAGT CTTAACTACA 
ACTAGTATCC TAATAATTAT AGGAGCTATT ACATTCTTTT TATTAGAACA GTTTAATACT 
ATGCAACATA TGGGACTAGT TGAAAAAATC GGAAATTCTT TTTTCCAATC AGTAACAACA 
CGAACAGCGG GTTTTAACAG TATAGATATA GCAAGCATTA rCAAATCTAC CGCATtAaTG 
TTAATGCTAC TTATGTTTAT TGGTGGTGCC CCTCTCAGTG CAGCTGGAGG AATTAAAATA 
ACTACTTTTG CAGTTGCGTT TATTTTTGTA CTAAATTATA kACGTAAAGA AAATAATGTT 
TCAGTATTCA ATAAAGAAAT ATCTGACAAA CATATAAAAC TATCTATTGT T AC CATTAAT 
ATCTCATTTC TATTTATCAG CATCATTACT TTTATATTAT CGATAATTAA TCCGAACATA 
T CATTAAT CA AGTTATTATT CGAAGTGGTT TCTGCATTCG GAACAGTAGG GTTAAGTATG 
AACCTTACCA CAGAATATCA TGGTATTACT AAAATAATTA TTATATTCGT 
(2) INFORMATION FOR SEQ ID NO: 465: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 905 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465: 
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TATTCQATTT 


GAnTCTTTTA 


AAATTATGTT 


TAAAAACATG 


TCTAATGATT 


CCGCATCATT 


120 




TTTTGCACCA 


TCAATAAGCG 


TTTCAGCAAA 


CCCCTTAATT 


GAAGTAATAG 


GTGTTTTTAA 


180 


5 


TTCATGTGAA 


ACATTTGCTA 


CAAATTCAOG 


TCTTAGATTT 


TCAAGTTGTT 


TCAGATTTGT 


240 




TATATCATGC 


ATCACaACTA 


AAATCCCtTG 


CAAACTTTTT 


TGAGACCTAG 


TTAAAATCGG 


300 


10 


AACGCATOAA 


ATATCAAAGT 


ACTTGGCATG 


GACTTGGTTT ATTGCAACTT 


CCAATTGTTC 


360 


ATAAATAGGT 


TTTTCAACTT 


TAAAACTTTC 


TAAAATTAAT 


TGCTCAATTT 


CAGTATTAAC 


420 




ATAGCCGTGA 


TAGCCTACTT 


GTTCAATATT 


ATGCGAGATG 


TTGAACTGTT 


CATAATAOGC 


480 


15 


TTTATTTGCA 


ACAACGATTT 


TTCCATTTCG 


ATCTATCATT 


AAAATAGCAC 


TTGGAATATT 


540 


TTCAATCGTT 


GTTTTTAAAC 


GGTTGGATTG 


AATTTTTTGC 


TCATTATTAA 


Gcrrri'GAAG 


600 




GCGTCGTGCT 


AAATCATTGG 


TAGACACAAA 


AAGCGCTTTA 


GTTTCTACAA 


CATTACTTTC 


660 


20 


AGGTACACGT 


ATGTGATAAT 


AACCATTTGC 


CAACAATTGT 


GTTGCATAAG 


TAACTTCTTG 


720 




AATGGGACGG 


ATTAATGTAC 


GCTTAAAACT 


ACGGCTTGCA 


AAATACAGAC 


AAATGAGTAC 


780 




AACTAAACAT 


GTCAAAATAA 


GATATTTCCA 


CAACGTCCAA 


TGCATTTCTG 


TAATATCGTT 


640 


25 


ATTGTAACCT 


TTAATCCATA 


CATGATAACC 


GTTAACCTTC 


TTATtAAAAA 


TAAAAACGTC 


900 



CCTTT 905 



(2) INFORMATION FOR SEQ ID NO: 466: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1016 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 



45 



60 



TTTGGTTAGC 


CCATAAAAGA 


AAAAAACAAG 


TAGTCATTTT 


TAAACAACAT ATCAAGTCTA 


60 


CCCAAGAAAT 


ACGTTTTGAC 


AAAGCGAAAG 


TGCTTGAACA 


CAAAGATGAA ATAGCAAATT 


120 


TTATTTCTTT 


CGAACCACAA 


AGTTTTGAAT 


TTTATTATTT 


TACAGAATCG GAATTTTCAG 


180 


AAGAACAATT 


AAATGAAGTT 


TCGCCAATTA 


GAATTAAATT 


CAATGTTATA AGACACACAA 


240 


AAGATTTGaT 


AAAGCATATG 


CCGAATATAT 


TTTTGGcTAG 


ACTTATTTCA GAAGATAATG 


300 


ATAAAAAGAC 


ATATATGTTC 


TATAAACGCA 


AAGTATTAAC 


CGATAACTTT TTAGATAAAT 


360 


ATATGCAGAA 


ATTTTCACCG 


GCAACATACA 


CAATAATATT 


TGTAAATGTC TTAATATGGT 


420 


TATGTATGAT 


TTTATATTTA 


AATAATTTTT 


CGGATGTAAA 


ATTATTAGAT GTTGGCGGGT 


480 
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ATTTTAGTTT TGAACATATA CTTATGAATA TGCTTTCATT ATTTATTTTT GGTAAAATAO 
TCOAAGCAAT TATTGGTTCA TGGCGGATGT TAACTGTATA CTTTATTGCA OGGTTGTTTG 
GAAACTTTGT ATCACTATCA TTTAATACGA CTACAATTTC AGTTGGGGCT AGTGGTGCTA 
TATTTGGTCT GATTGGATCA ATTTTTGCGA TGATGTATGT TTCAAAAACA TTTAACAAAA 
AAATGTTAGG ACAGTTATTA ATTGCATTAG TGATATTAGT TGGTGTTTCT CTGTTTATGT 
CAAATATAAA TATTGTGGCG CATATTGGAG GATTCATTGG TGGTTTATTA ATAACTTTAA 
TTGGCTATTA CTATAAAGTG AATCGtAATA TTTTTTGGaT TTaCtAATTG GTATGCtTGt 
tATATTTAwT GCACyTCmAA TTAg A t TTTT ACmATTAAAG AAGATAATAw TTATAA 
(2) INFORMATION FOR SEQ 1 ID NO: 467: 

<X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 406 base pairs 

(B) TYPE: nucleic acid 
<C> STRAND EDNESS : double 

<D) topology: linear 

• (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 
AACTTTAAAT TTAGACATCT TTAAAACCTC TCTTAAACCA TGCCTATATC TCAAGATGAT 
ATTTCAAATG AACAATACTA TTGCTTGAGA CCATTAATGA ATGATCATAA ATATTTCTTT 
CTATAAAATT AGCTTTCCAA TAACTGTGTT GTTGCATAAT ATCATTCACA AGTACACCAT 
TTTCGGAAGT ATGATTATCT TTATCTATAC TTAAAACAAT TTGTTTAGTT TTAGCATGGC 
TAAATTGTTG AAGAC CCTT A CACGATAAAC GTATAGCGTC TGAATTCTCA TTTAACAATG 
CGGCTGGgCA AACAAATGAC ACATTGTACT TCATGTTTGA AC t TCGTTAC AATCaTCGTG 
kCatTTTGat AAATAACAAT CCCTCGTAAT kGATTAAGTA TATTAT 
(2) INFORMATION FOR SEQ ID NO: 4 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 
AGATAATACA CTTGAAGTTG GAATGGTTTG TGAOGGTTAT TTAATGCGAA TTGAAAACTT 
AACACCATCA AATTTCTTCA ACTCAGCAAG TGAAGATACG ATTACTAAAA TTAAATTAAA 
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AGGTACAGCG TTAAAACTAA GAGAAGCCAT CAATTATGAT GAAATGGTTA TTGTAGATAG 240 

TATGACGTAG TTCCTAATTA TGCkAAAAGG GATTGATGAA AAACTGAAGG GCTTTTCATC 3 00 

5 AATCCCTTTT ATTTTAGGGG AATTGAATAQ ATAGTTTTAA ACTATACGAA TTATTAATAT 360 

TTGAGATTTA ATTGAAATAA GTTTTAAAAA TTGGAGGAGA TAGATTAAGC GAAGTCATTT 420 

AAAGGTGAAG TTAAGTGTAT TCACAAAAAn TAGCCACACT CATATGACAT CGGATGAGTG 480 

10 

TGGCTTAAGG ATCTATGGGG GGAGG AAnCC ATAGATGTTT ACTTTGATAG GCCAGATTAA 540 

ATATCAAAGT ATG CG ATT AT TT AT AG CTTG ATGCAAAAGT GGTATGCCTA TTAAAAGTTA 600 

CTGCACATAG CTTTTAATAT TCCGTTCAAA GGAAAGGGGC ATACAATTGA ACAATCTGTA 660 

1S 

ATAGTACTTT TAACCAGCTA TGCTAAAAGT CTAGTAGGGA GAACAGTTGT CCAATCACAT 720 

AAGAACCTCT AACTTCGTTA GTACGATTAA GAAAAGCTTT TTAGTTAGTA TGTAATACAA 780 

2Q TTTATTGACG CGCGTGAATC TCTTTTATAA GAGTGTGTAG GGAATGGCGT TGTATAAATT 840 

GTATTAGAAG AACTTCTAAC GCATCTCTGT GGTTAAAAGA GATGAAGGGA ACGACAGTTT 900 

aATTAAAACT GCATAAGAAC TTCTAGCTTT TCTCTCTCGT TCAAAGAGAA GCAGcTGTTC 960 

25 GCAGTTTAAT CAAAACCACA TAAAGCTTTT AACTTTACTC TTTGATTTAA AG AG TGATAA 1020 

ATGTTTACAG TTTAATTAAA ACTGCATAAG AACTTCTAGC TTTTCTCTTT CGTT CAAgAG 1080 

AAGCAGCTGT TCGCAgTTTA ATGAAAACCA CATAAAGCTT TTAACTTTAC TCTTTGATTT 1140 

30 AAAGAGTGAC AAATGTTTAC AGTTTAATTA AAACTGCATA AGAACTTCTA GCTTTTCTCT 1200 

TTCGTTCAAA GAGAAGTTCT AATACCACCA TATCGTGCGA TCGGGAACGG TATATATATT 1260 

AATAGGAGGG TAATATATAT TTAACGCACG ATATGGGACT ATTAGCCTTC GACTTTGTTA 1320 

35 TGTTGATGTG TGGCCTAAAA TATTGGAGAT ACCAATATTT T AGGTTG CAT CAACATCA 1378 

(2) INFORMATION FOR SEQ ID NO: 469: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 

TCCCAACCAA TAATCGTGGC AAAAATACGG ATATTGGTAT GGCTTAACAA ATTGCAAATA 60 

50 TCGTTTAATC ATACATCCCC CCTAATCTAT TGCCCTATCC TATTCaTAAG CATAAAAATG 120 

AATAGAGGTT GGATACATAA TTTGTAGATG TAAATTCTTC TTACAATTTA CATTTTTAAA 180 
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TTATATTCTA TTCAATTTAA TCTATGGATA CTGTGTCCCC ACACGACAGC AAAAGTTATC 300 

ATACTTCTTT ACATCACTAA GTCAATATAA ATGATTTAAT CAGTATTTAC ACTTTATTTG 360 

5 CTTAATACTG TCTAATTTTT TTGTAACGTT CTTTCCAAAC TTTGATAAAA TCTGGCGCGA 420 

ATGGGCCCTT CTTCTGTTCT ATCCATTGTT OAAGAATGTC CACGTTGCGT CTTAAAATAA 480 

TATCAATATC ATGOGGATAA TTCATTTGAT TCATATGTTG CTCATATTCA TCTTCATCTA 540 

10 

ATAAATGATA CTTTCCGTTT GGATATACTT TAATATCTAA ATCATAGTCT ATATATTTTA €00 

ATGCCTCTTC ATCACAAACA AATGGTGATG ACAAATTGCA ATAGTAATAA ATTCCATCTT 660 

CTCTAAACAT GCAGATAACA TTaAACCAAT ATTCTGAGTG AAAGTaAACA ATTGCCGGTT 720 

1$ 

CACGTCTTAT CCAAGTTCTT CCGTCACTTT CAGTCACTAA CGTATGATCA TTTCCACCAA 760 

TGACAACATG ATCAGTACCC TTTAATATTG TTGTTTCAGA CCAAACGCGA TGAATCTTAC 84 0 

2Q CATCATGTTT ATAACTCTGA ATTTTAATGT TTTCCCCTTC TTTAGGTATG GATTCTCTGA 900 

CCATACTCCA CACCACCTTC TGTTAATTTA ACCATTATAA ATTATAGCAT ATTTCAGAAA 960 

TAGTATTATA TAAATACATA TTTTTACGAA ATAAGATTTT ACTACTTAAT AATTAAACTC 1020 

25 GGTAATATTG CTAAGTACTA CAACAGAGAT TTACATGTCC CATTTAAAGT ATATAAAATC 1080 

ATCACTTTTA TATATCAACA CTTTAACTTT TTGACATTGT TATTCTATGA GATTTAAAGA 1140 

TATCATTTAT ACTTTTTAAA ATTAATGTCA CTATGTTTTC CGATAATATT ACCAATCATC 1200 

30 GAATGTTACC CATTTATAAA TTGATAAATs TTTGACATAG GTACAGGGAA TGTATATTGA 1260 

TCTCGATCAC TTAAATCAAA CCAAATCATG TCATCTGGTA ATGTTTCAAT GTTAATTGCT 1320 

CCTGAAACGG CGTATACTTT AATCTTCCAT GTTAAATGAG TAAATTGATG CTTCAACTCA 1380 

35 AAAATAGGTG TTTCTACTGG TTGAATGTCA TGACCGATTT TTT CAGT CAT TTTACGTCTA 144 0 

GCATGCTCAC TTTCAAACAT AGGAAATTGC CACATA C CAT GCAATAATTT TTCGCTACGC 1500 

TTTTGCAACA GATATTGACC TTGATTATTT CTAATTAAAA AGACGGATTG CTCAATTACT 1560 

40 

TTTTTACTTA CATTTTTAGA TTTAACAGGT AACTTTTCAA ATGTACGTTT ATCAAATGCC 1620 

TCACAGTTTT CTTGAACTGG ACAAAATAAG CATAATGGAT TTTTTGGTGT ACAAATTAAC 16 80 

GCCCCTAATT CCATCATAGC TTGATTAAAC GTTCCAGCTT CTGTAGTAAC ATACGGTAAT 1740 

45 

AATTCTTGTT CGTACGATTT CCTCGTCGAT TGTAATTTAA TATCTCGATA GTCATCATTC 18 00 

AATCTAGACC ATACTCGAAA AACATTTCCG TCTACAGTTG CTAGTGGTAC ATTATATGCA 18 60 

60 ATGCTCATTA CTGCAGCTTG TGTGTATGGG CCAACACCTT TTAACGCTTT AAATTGATCA 1920 

GGATCTTTGG GAACTAAGCC TTCATATTTA TCATGAACTT CTTTAATCGC CGTATGAAAA 19 80 
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GCTTGaCTCA AAACTTCCAC AGTTGGAAAT CGTTCAACAA AACGATGATA ATAGTCAATA 2100 

ACTGTTTTAA CTTGTGTCTG TTGTAACATG ACCTCACTTA ACCAAATATA GTACGGATTG 2160 

5 GTCGTTTGTC GCCATGGCAT TTCTCTTTGA TTTTCATCAA ACCAGTGTAT CAAATTTTCT 2220 

TTAAAACTAG ACTGCTGATA GATTTATAAA ACCCTTTCCT CACCAAAATT AATTGTCTTT 2280 

ACTCATAATG TTTTTATTGT ACATTAAAAT CATGGTTAGT ATGTAAGTTA ATTTAGTTAT 2340 

10 

TTGCGAAATT GGATTATAAT AGTATATATA ATATTATGAA ATCAGTGAAC TGATATGGAC 2400 

ACTGCAACAC ATATCGCAAT TGGGGTGGGC CTTACAGCAC TTGCAACTCA AGATCCAGCA 2460 

ATGGCTTCTA CGTTTGGTGC AACAGCTACA ACCCTTATCG TTGGTTCATT AATTCCTGAT 2520 

1S 

GGGGATACTG TTCTTAAATT AAAGGACAAT GCAACATATA TTTCGCATCA TAGAGGTATC 2580 

ACGCATTCCA TCCCTTTCAC AATACTATGG CCAATTTTAA TTACATTTTT AATATTCACG 2640 

20 TTCTTTAGTG GAACCAACCC ATTTCATGTA TGGATGTGGG CTCAGCTCGC AGTATTTTTA 2700 

CATGTCTTTG TAGATATATT CAATTCTTAT GGTACACAAG CGCTTAGACC TATCACAAAC 2760 

AAATGGATTC AATTAAGTGT GATTAACACA TTTGACCCTA TTATTTTCAC AGTTCTTTGT 2820 

25 ATTGGTATTG TATTATGGGT TATAGGCTTG CATCCATTTG CAGTCTTCTT TCCTATAATC 2880 

GCTTTACTAA TCATTTATTA CATGATTCGT TTTAAAATGA GAGCCGTAAT TAAGCAACAA 2940 

GCTTTAAAAG CAATTCAACA AGAGCATCAC CCTGTTAAAG TATTTGTTGC GCCAACAATA 3000 

30 AAATTTATGG AATGGCGTGT CGCGATACAA ACTGATGCAC -ATGACTATGT TGGAAAAGCA 3060 

TATGGTAGGA ATGTGGTGTT TAGTGATAAA GTGGAACGTC AAACATTATC AACAGACTCC 3120 

ATTTTATGGA AAGTCAAAGG TAATAAAGAT ATACGTACTT TTTTAAACTT TTCATCAATC 3180 

35 

TATCGTTGGC AAACAACAAC GTTAGCAGAT GGTTCTACTG AAATTGGTTT GATTGATTTG 3240 

CGTTATTTAA AAAATGATCA TTATTCATTT GTGGCAATTG CACATGTAAC AAACGATAAT 3 300 

GTCATAGACC ACTCTTATAT TGGCTGGGTA TTTACAGAAG ATAAGTTACA ACGTAAACTG 3360 

40 

TATGCTAAAT AATTTCAAGT TATTATTCAC TAAAGTTAAT CTATAAAAAA TGAACAACCG 3420 

GGCAGAATGA AAATCAAAAC GATTTTTACT CTGTCCGGTT TTTTAATGTA AAACTATGAA 3480 

TGCTTTTACA AAATCTAAAA TTTATATTGT TGCTAAGAAA CTACCTTTAA TGACTCGAAA 3540 

45 

TATCAAAATC AGTATAGGAA AACAATATCT AGATGATATT CTAATTGTTT CTGATTCTCA 3600 

CAGATTAATT TACACAACAG GTCAGCTAAA CATCATGAAG AAGTATCCGC CTCGTCTGTA 3 660 

50 CTATCATTTG AAACATCCTG TTGATTATCa GTTTGTGTCA CTTTAGATGT TTTATTaTAA 3720 

ATTGCGTGTG TCGTATACCT TGCTAATACG AAATTTACTG CAGCTATTAA ACATAAGAAT 3780 
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TGTGCTATAC CATTAACAAT GTAATACATT GGATTTAGCA TTAGGATGTG ATTGATAAAT 3900 

ACATGATTTG GATTTGGTAT GAAAATAATT GGTAACAATA AGAAACACAA TACACAAACC 3960 

CCATAAAATA TGATATTTAT TTTTTCAGwT AACAGTCGAA TAAGACCAAA AGTAACGGAT 4020 

ATTAATCCTA CAAAAATAGT TGCCATCACA ATAAAATAGA ATAGCGCTAT ATATGATGTT 4080 

TCGAAGTTTA CTGGTTTAAC CAATGCACTA ATCATCGTCA AAATGACTAG CATAATAAAA 4140 



10 

CTTAAAATAG ACATAATAAC TACTGGCGTC G 4171 
(2) INFORMATION FOR SEQ ID NO: 470: 
(i) SEQUENCE CHARACTERISTICS: 

15 (A) LENGTH: 9821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 

TGGTTGAAGT AGCAGTTAAT TCAAAATCTG CAACAGTTTC AGCAGAATaG GGGCTTTCAA 60 

25 AATAAtCAAA GGAGAATAAT TTATGACTAA AACTTTAAAG GTTTATAAAG GAGACGACGT 120 

CGTAG CTTCT GAACAAGGTG AAGGCAAAGT GTCAGTAACT TTATCTAATT TAGAAGCGGA 180 

TACAACTTAT CCAAAAGGTA CTTACCAAGT GGCATGGGAA GAAAATGGTA AAGAATCTAG 24 0 

30 TAAAGTTGAT GTACCTCAAT TCAAAACCAA TCCAATTCTA GTCTCAGGCG TATCATTTAC 300 

ACCAGAAACT AAATCAATTA TGGTAAATAC CGATGACAAT GTTGAGCCAA ACATTGCACC 360 

AAGCACAGCA ACGAATAAAA TATTGAAATA TACAAGTGAA CATCCAGAAT TTGTTACTGT 420 

35 AGATGAAAAT ACAGGAGCAA TTCACGGTGT AGCTGAAGGT ACTTCAGTAA TCACTGCTAC 480 

GTCTACTGAT GGAAGCGATA AGTCAGGACA AATTTCAGTG ACAGTAACAA ACGGATAGGG 540 

ATTTAAGGCG CAGTATATCT GCGTCTTTTT TATTTGAATA AAAGGAGCTA ATACAATGAT 600 

40 

TAAATTTGAA ATTAAAGATC GTAAAACAGG AAAAACAGAG AGCTATACAA AAGAAGATGT 660 

AACAATGGGC GAACAGAAAA ATGCTATGAG TATTTAGAAT TAGTAAATCA AGAGAATAAA 720 

AAAGAAGCAC CTAACGCAAC AAAAATGAGA CAAAAAGAGC GACAGTTATT AGTAGATTTA 780 

45 

TTTAAAGATG AAGGATTGAC TGAAGAAGAT GTTCTGAACA AGATGAGTAC TAAAACTTAT 840 

ACAAAAGCCT TACAAGATAT ATTTCGAGAA ATCAATGGTG AAGATGAAGA AGATTCAGAA 900 

50 ACTGAACCAG AAGAGATGGG AAAGACAGAA GAACAATCTC AATAAAAGAC ATTTTATCGA 960 

ACATTAAGAA AATACAACGT TTCTGTATGG AGCAGTATGG GTGGACATTA ACTGAAGTCA 1020 
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AAGAAAAACA 


AAGTGAACAA 


AAAGTCATTA 


CAGGTACGGA 


TTTAAGAAAA 


CTTTTTGGAA 


1140 




GCTAGAAAGG 


AGGTTAATAT 


GAATGAAAAA 


GTAGAAGGCA 


TGACCTTGGA 


GCTGAAATTA 


1200 


5 


GACCATTTAG 


GTGTCCAAGA 


AGGCATGAAA 


GGTTTAAAGC 


GACAATTAGG 


TGTTGTTAAT 


1260 




AGTGAAATGA 


AAGCTAATCT 


GTCAGCATTT 


GATAAGTCTG 


AAAAATCAAT 


GGAAAAATAT 


1320 


10 


CAGGCGAOAA 


TTAAGGGGTT 


AAATGATAGG 


CTTAAAGTTC 


AAAAAAAGAT 


GTATTCTCAA 


1380 


GTAGAAGATG 


AGCTTAAACA 


AGTTAACGCT 


AATTACCAAA 


AAGCTAAATC 


CAGTGTAAAA 


1440 




GATGTTGAGA 


AAGCATATTT 


AAAGTTAGTA 


GAAGCCAATA 


AAAAAGAAAA 


ATTAGCTCTT 


1500 


15 


GATAAATCTA 


AAGAAGCCTT 


AAAATCATCG 


AATACAGAAC 


TTAAAAAAGC 


TGAAAATCAA 


1560 




TATAAAOGTA 


CAAATCAACG 


TAAACAAGAT 


GCGTATCAAA 


AACTTAAACA 


GTTGAGAGAT 


1620 




GCAGAACAAA 


AGCTTaAGAA 


TAGTAACCAA 


GCTACTACTG 


CACAACTAAA 


AAGAGCAAGT 


1660 


20 


GACGCrtACA 


GAAGCAGTCC 


GCTAAGCATA 


AAGCACTTGT 


TGAACAATAT 


AAACAAGAAG 


1740 




GCAATCAAGT 


TCAAAAACTA 


AAAGTGCAAA 


ATGACAATCT 


TTCAAAATCA 


AATGATAAAA 


1800 




TTGAAAGTTC 


TTACGCTAAA 


ACTAATACTA 


AATTAAAGCA 


AACAGAAAAA 


GAATTTAATG 


1860 


25 


ATTTAAACAA 


TACTATTAAG 


AATCATAGCG 


CTAATGTCGC 


AAAAGCTGAA 


ACAGCTGTTA 


1920 




ATAAAGAAAA 


AGCTGCTTTA 


AATAATTTGG 


AGCGTTCAAT 


AGATAAAGCT 


TCATCCGAAA 


1980 




TGAAGACTTT 


TAACAAAGAA 


CAAATGATAG 


CTCAAAGTCA 


TTTCGGTAAA 


CTTGCAAGTC 


2040 


30 


AAGCGGATGT 


CATGTCAAAG 


AAATTTAGTT 


CTATTGGAGA 


CAAAATGACT 


TCCCTGGGAC 


2100 




GTACAATGAC 


GATGGGCGTA 


TCTACACCGA 


TTACTTTAGG 


TTTAGGTGCA 


GCATTAAAAA 


2160 


35 


CGAGTGCAGA 


CTTTGAAGGG 


CAAATGTCTC 


GAGTTGGAGC 


GATTGCACAA 


GCAAGCAGTA 


2220 


AAGACTTAAA 


AAGCATGTCT 


AATCAAGCGG 


TTGACTTAGG 


AGCTAAAACA 


AGTAAAAGTG 


2280 




CTAACGaAGT 


TGCTAAAGGT 


ATGGAAGAAT 


TGGCAGCTTT 


AGGCTTTAAT 


GCCAAACAAA 


2340 


40 


CAATGGAGGC 


TATGCCAGGT 


GTTATCAGCG 


CAgcAGaAGC 


AAGTGGTGGA 


GAAATGGCTA 


2400 


CAACTGCAAC 


TGTAATGGCT 


TCAGCGATTA 


ACTCTTTCGG 


TTTAAAAGCA 


TCTGATGCAA 


2460 




ATCATGTTGC 


TGATTTACTT 


GCGAGATCAG 


CAAATGATAG 


TGCTGCAGAT 


ATTCAATATA 


2520 


45 


TGGGAGATGC 


ATTAAAATAT 


GCAGGTACTC 


CAGCAAAAGC 


ATTAGGAGTT 


TCAATAGAGG 


2580 




ACACTTCTGC 


AGCAATTGAA 


GTTTTAT CTA 


ACTCAGGTTT 


AGAGGGGTCT 


CAAGCAGGTA 


2640 




CTGCATTAAG 


AGCTTCGTTT 


ATTAGGCTAG 


CTAATCCAAG 


TAAAAGTACA 


GCTAAGGAAA 


2700 


50 


TGAAAAAATT 


AGGTATTCAT 


TTGTCTGATG 


CTAAAGGTGA 


GTTTGTTGGA 


ATGGGCGAAT 


2760 




TGATTAGACA 


GTTCCAAGAT 


AACATGAAAG 


GCATGACGAG 


AGAACAAAAA 


TTAGCAACAG 


2820 
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CAGATAAAAT TAATAGCTAT AGCAAATCAT 
CAGCTGATTT GATGAAAGAT AACCTCAAAG 
5 AATCGTTAGC AATTGAAGTT GGTAAAGATT 

GATTAACAAA ATTAGTTGAT GGATTTACAC 
TAGGCTTAGC AATTTTTGGT GCATCTATTG 

10 

TACGTGCAGT TGGGAGCGCG GCTAAAGGCT 
ATACAATTCT TTCTAATACC AATTCAAAAG 
TTCTTGGTTC TACAACAGGA AAAACGTCAA 

IS 

TGTTTAATTT AAAACCTATA AATGTTTTGA 
TCAAACTTTT GAAAAACGGT TTAGGATTAG 

20 GCGCAAGATT TGCTGGTGTA GCCTTAAAGT 

CTGCTATTAC AATTGCATAT AAAGTTTTTA 
GAAACGGTAT TAACGGTTTA GGAGAAACTA 

25 GTGCTGTTAG GAAGCTAGGA GAGTTTAAAA 

AAGAAAAGTT TTCAAAGGAT ATGAAAGATG 
TGAAAGTAGG AGTCAACAAG TTTAAAGGAT 

30 AAGCATCTGA TACTGTAAAA GTGTTGGGGA 

TAGAAAAATA CGTACACTAT T CTGAAG AG A 
ACTCGGGTCA AATAACAGAA GACAAAGCAA 

35 

CTAATAACCT TATAGCTGAA ATAGAAAAAA 
AACTTATTGA TAAGTATAGT GCATTCGATG 
CTAAAGAAAA AAATGACTTG CGAATTAAAA 

40 

AATTGAAAGA AAAAGCTTTA AGTGATGGTC 
AAAAGCTTGA AAATCAAAGA CGTGACATCA 
AGCAAGAGCG TATTTTAGTA AGAATGCAAA 

45 

CGAGCAAAGC AATTAAAGAA GCAGAAAAAG 
AGCAGTATGA AGATGATGTC ATTGCTATAA 
60 AAGATAAATT GTT AG CT ATT GCTGATCAAA 

CTAAAAAAGA TGCTGTAGTA GACGTTGTTA 

65 



TGAAGAACTC 


TAATGGTGAA 


AGTAAAAAAG 


2940 


GTGCTCTGGA 


ACAATTAGGT 


GGCGCTTTTG 


3000 


TAACGCCTAT 


GATTAGAGCA 


GGTGCGGAAG 


3060 


ATCTTCCTGG 


TTGGGTTAGA 


AAGGCTTCGG 


3120 


GTCCTGCTGT 


TCTTGCTGGT 


GG CTTATTAA 


3180 


ATGCATCATT 


AAATAGACGC 


ATTGCTGAAA 


3240 


CAATGAAATC 


TTTAGGTCTT 


CAAACATTAT 


3300 


AAGGCTTTAA AGGATTAGCC 


GGAGCTATGT 


3360 


AAAATTCTGC 


AAAGCTAGCA 


ATTTTACCGT 


3420 


CCGCAAAATC 


CTTATTTGCA 


GTAAGTGGAG 


3480 


TTTTAACAGG 


ACCTATAGGT 


GCTACAATAA 


3540 


AAACCGCATA 


TGATCGTGTG 


GAATGGTTCA 


3600 


TAAAGTTTTT 


TGGTGGCAAA 


ATTATTGGCG 


3660 


ATTATCTTGG 


AAGTATAGGC 


AAAAGCTTCA 


3720 


GTTATAAATC 


TTTGAGTGAC 


GATGACCTTC 


3780 


TTATGCAAAC 


CATGGGCACA 


GCTTCTAAAA 


3840 


AAGGTGTTTC 


AAAAGAAACA 


GAAAAAGCTT 


3900 


ACAACAGAAT 


CATGGAAAAA 


GTACGTTTAA 


3960 


AAAAACTTTT 


GAAAATTGAA 


GCGGATTTAT 


4020 


GAAATAAAAA 


GGAACTCGAA 


AAAACTCAAG 


4080 


AACAAGAAAA 


GCAAAACATT 


TTAACTAGAA 


4140 


AAGAGCAAGA 


ACTCAATCAG 


AAAATCAAAG 


4200 


AGATTTCAGA 


AAATGAAAGA 


AAAGAAATTG 


4260 


CTGTTAAAGA ATTGAGTAAG 


ACTGAAAAAG 


4320 


GAAACAGAAA TGCTTATTCA ATAGACGAAG 


4380 


CAAGAAAAGC 


AAGAAAAAAA 


GAAGTGGATA 


4440 


AAAATAACGT 


CAACCTTTCT 


AAGTCTGAAA 


4500 


GACATAAGGA 


TGAAGTAAGA 


AAGGCAAAAT 


4560 


AAAAGCAAAA 


TAAAGATATT 


GATAAAGAAA 


4620 
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GTTGGTGGTC 


TAACTTTAGA GAAGACCAAA AGAAGAAAAG 


TGATAAATAC 


GCTAAAGAAC 


4740 




AAGAAGAAAC 


AG CTCGTAGA AACAGAGAAA ATATAAAGAA ATGGTTTGG A 


AATGCTTGGG 


4800 


5 


ACGGCGTAAA 


AACTAAAACT 


GGTGAAGCCT 


TTAGTAAAAT 


GGGCAGAAAT 


GCTAATCATT 


4860 




TTGGGGGCGA 


AATGAAAAAA ATGTGGAGTG 


GAATCAAAGG 


AATTCCAAGC 


AAATTAAGTT 


4920 


10 


CAAGTTGGAO 


CTCAGCCAAA 


AGTTCTGTAG 


GATATCACAC 


TAAGGCTATA 


GCTAATAGTA 


4980 


CTGGTAAATG 


GTTTGGAAAA 


GCTTGGCAAT 


CTGTTAAATC 


GACTACAGGA 


AGTATTTACA 


5040 




ATCAAACTAA 


GGAAAAGTAT 


TCAGATOCCT 


CAGATAAAGC 


TTGGGCGCAT 


TCAAAATCTA 


5X00 


15 


TTTGQAGAGG 


CACATCAAAA TGGTTTAGCA ACGCATATAA AAGTGGAAAG 


GGTTGGCTAA 


5160 




TAGATATGGC 


TAATAAATCG 


CGCTCGAAAT 


GGGATAATAT 


TTCTAGTACA 


GCATGGTCGA 


5220 




ATGCAAAATC 


CGTTTGGAAA 


GGAACATCGA 


AATGGTTTAG 


TAACTCATAC 


AAATCTTTAA 


5280 


20 


AAGGTTGGAC 


TGGGGATATG 


TATTCAAGAG 


CCCACGATCG 


TTTTGATGCA 


ATTTCAAGTT 


5340 




CGGCATGGTC 


TAACGCTAAA 


TCAGTATTTA 


ATGGTTTTAG 


AAAATGGCTA 


TCAAAAACAT 


5400 




ATGATTGGAT 


TAGAGATATT 


GGTAAAGACA 


TGGGAAGAGC 


TGCGGCTGAT 


TTAGGTAAAA 


5460 


25 


ATGTTGCTAA 


TAAAGCTATT 


GGCGGTTTGA 


ATAGCATGAT 


TGGCGGTATT 


AATAAAATAT 


5520 




CTAAAGCCAT 


TACTGATAAA 


AATCTCATCA 


AGCCAATACC 


TACATTGTCT 


ACTGGTACTT 


5580 




TAGCAGGAAA 


GGGTGTAGCT 


ACCGATAATT 


CAGGAGCATT 


AACGCAACCG 


ACATTTGCTG 


5640 


30 


TATTAAATGA 


TAGAGGTTCT 


GGAAACGCCC 


CAGGTGOTGG 


AGTTCAAGAA 


ATAATTCACA 


5700 




GGGCTGACGG 


AACATTCCAT 


GCACCCCAAG 


GACGAGATGT 


GGTTGTTCCA 


CTAGGAGTTG 


5760 




GAGATAGTGT 


AATAAATGCC 


AATGACACTC 


TGAAGTTACA 


GCGGATGGGT 


GTTTTGCCAA 


5820 


35 


AATTCCATGG 


TGGTACGAAA 


AAGAAAAAAT 


GGATGGAACA 


AGTTACTGAA 


AATCTTGGTA 


5880 




AAAAAGCAGG 


GGACTTCGGT 


TCTAAAGCTA 


AAAACACAGC 


TCATAATATC 


AAAAAAGGTG 


5940 


40 


CAGAAGAAAT 


GGTTGAAGCG 


GCAGGCGATA AAATCAAAGA 


TGGTGCATCT 


TGGTTAGGCG 


6000 


ATAAAATCGG 


CGATGTGTGG 


GATTATGTAC 


AACATCCAGG 


GAAACTAGTA 


AATAAAGTAA 


6060 




TGTCAGGTTT 


AAATATTAAT 


TTTGGAGGCG 


GACTAACGCT 


ACAGTAAAAA 


TTGCTAAAGG 


6120 


45 


CGCGTACTCA 


TTGCTCAAAA 


AGAAATTAGT 


AGACAAAGTA 


AAATCGTGGT 


TTGAAGATTT 


6180 




TGGTGGCGGA 


GGCGATGGAA 


GCTATCTATT 


TGACCATCCA 


ATTTGGCAAA 


GGTTTGGGAG 


6240 




TTACACAGGT 


GGACTTAACT 


TTAATGGCGG 


TCGTCACTAT 


GGTATCGACT 


TTGGTATGCC 


6300 


60 


TACAGGAACG 


AACATTTATG 


CTGTTAAAGG 


CGGTATAGCT 


GATAAAGTAT 


GGACTGATTA 


6360 




CGGTGGCGGT 


AATTCTATAC 


AAATTAAGAC 


CGGTG CTAAC 


GAATGGAACT 


GGTATATGCA 


6420 
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10 



15 



ATCAGGTGCT ACAGGTAATT TCGTTAGAGG AGCACACTTA CATTTCCAAT TGATGCAAGG 654 0 

GTCGCATCCA GGGAATGATA CAGCTAAAGA TCCAGAAAAA TGGTTGAAOT CACTTAAAGG €600 

TAGTGGCGTT CGAAGTGGTT CAGGTGTTAA TAAGGCTGCA TCTGCTTGGG CAGGCGATAT €660 

ACGTCGTGCA GCAAAACGAA TGGGTGTTAA TGTTACTTCG GCTGACGTAG OAAATATCAT 6720 

TAGCTTGATT CAACACGAAT CAGGAGGAAA TGCAGGTATA ACTCAATCTA GTTCGCTTAG €780 

AGACATCAAC GTTTTACAGG GCAATCCAGC AAAAGGATTG CTTCAATATA TCCCACAAAC 684 0 

ATTTAGACAT TATGCTGTTA GAGGTCACAA CAATATATAT AGTGGTTACG ATCAGTTATT 6900 

AGCGTTCTTT AACAACAGAT ATTGGCGCTC ACAGTTTAAC CCAAGAGGTG GTTGGTCTCC 6960 

AAGTGGTCCA AGAAGATATG CGAATGGTGG TTTGATTACA AAGCATCAAC TTGCTGAAGT 7020 

GGGTGAAGGA GATAAACAGG AGATGGTTAT CCCTTTAACT AGACGTAAAC GAGCAATTCA 7080 

20 ATTAACTGAA CAGGTTATGC GCATCATCGG TATGGATGGC AAGCCAAATA ACATCACTGT 714 0 

AAATAATGAT ACTTCTACAG TTGAAAAATT GTTGAAACAA ATTGTTATGT TAAGTGATAA 7200 

AGGAAATAAA TTAACAGATG CGTTGATTCA AACTGTTTCT TCTCAGGATA ATAACTTAGG 7260 

25 TTCTAATGAT GCAATTAGAG GTTTAGAAAA AATATTGTCA AAACAAAGTG GGCATAGAGC 7320 

AAATGCAAAT AATTATATGG GAGGTTTGAC TAATTAATGC AATCTTTTGT AAAAATCATA 73 80 

GATGGTTACA AGGAAGAAGT AATAACAGAT TTTAATCAGC TTATATTTTT AGATGCAAGG 744 0 

30 

GCTGAAAGTC CAAACACCAA TGATAACAGT GTAACTATTA ACGGAGTAGA TGGTATTTTA 7500 

CCGGGCGCAA TTAGTTTTGC GCCTTTTTCA TTAGTATTAA GGTTTGGCTA TGATGGTATA 7560 

GATGTTATAG ATTTAAATTT ATTTGAGCAT TGGTTTAGAT CTGTGTTTAA TCGCAGACAT 7620 

35 

CCTTATTATG TTATTACTTC TCAAATGCCT GGTGTTAAAT ATGCAGTGAA TACAGCTAAT 7680 

GTTACATCTA ATTTAAAAGA TGGTTCTTCA ACTGAAATTG AAGTAAGTTT AAATGTTTAT 7740 

40 AAAGGGTATT CTGAATCAGT TAATTGGACC GATAGCGAGT TCTTATTCGA CTCTAATTGG 7800 

ATGTTTGAAA ATGGAATTCC TCTTGATTTC ACACCTAAAT ATACTCATAC ATCAAATCAA 7860 

TTTACTATTT GGAACGGTTC TACTGATACG ATAAATCCAC GATTCAAGCA CGATTTGAAA 7920 

45 ATATTAATTA ATTTAAATGC GAGTGGAGGA TTTGAACTGG TTAATTATAC AACAGGTGAT 7980 

ATTTTTAAGT ACAACAAAAG TATAGATAAA AACACTGATT TTGTTTTAGA TGGTGTGTAT 8040 

GCATATCGAG ATATAAACAG AGTGGGAATT GATACAAATA GAGGCATTAT AACATTAGCG 8100 

50 

CCAGGTAAAA ATGAATTTAA GATTaAAGGA GACGTCAGTG ATATTAAAAC TACATTTAAG 8160 

TTTCCTTTTA TTTATAGGTA GGTGATTTAA TGGATTATCA TGATCATTTA TCAGTAATGG 8220 
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ATTATGAACT GAATGAAGCT AGGTACATCA CCTTTACAGT TTATAGAACT ACTCATAATA 8340 

GTTTTGTTTT TGATTTATTG ATTTGTGAAA ACTTCATAAT TTATCATGOT GAAAAATACA 8400 

5 CAATTAAGCA GACAGCGCCA AAGGTTGAAG GTGATAAAGT TTTTATTGAA GTTACGGCAT 8460 

ATCACATAAT GTATGAATTT CAAAATCACT CAGTGGAATC AAATAAGCTT GATGAOGACA 8520 

GTAGCGAAAC TGGTAAAACG CCAGAATACT CTTTAGATGA GTACTTAAGA TATGGATTTG 8580 

10 

CAAATCAAAA AACGTCAGTC AAGATGACCT ATAAAATAAT TGGAGATTTT AAAAGAAAAA 8640 

TACCAATTGA TGAATTAGGT AATAAAAATG GCTTAGAATA TTGTAAAGAA GCAGTAGATT 8700 

1S TGTTTGGTTG TATTATTTAT CCAAATGATA CGGAGATATG TTTTTATTCT CCTGAAACAT 8760 

TCTATCAAAG AAGCGAAAAA GTAATAAGGT ATCAATATAA TACTGATACT GTGTCTGCTA 8820 

CTGTCAGTAC GTTGGAATTA AGAACAGCTA TAAAAGTTTT TGGGAAAAAG TACACAGCCG 8680 

20 AGGAAAAGAA AAATTATAAT CCTATTAGAA CAACTGACAT TAAATATTCA AATGGTTTTA 8 940 

TAAAAGAAGG TACTTATCGT ACCGCAACAA TTGGGTCTAA AGCTACTATT AACTTTGATT 9000 

GCAAGTATGG TAATGAAACA GTTAGATTTA CAATAAAaAA GGGCTCTCaA GGTGGAATAT 9060 

25 ATAAGTTGAT TTTAGACGGC AAGCaAATTA AGCaAATTTC TTGTTTTGCT AAGTCGGTTC 9120 

AGTCTGAmAC AATAGATTTA ATAaAAAATA TTGATAAAGG CAAGCACGTT TTAGAAATGA 9180 

TATTTTTrGG AGArGrCCCC AAAAATAGAA TTGATATATC TTCAAATAAA AAAGCTAAGC 9240 

30 

CTTGTATGTA TGTTGGAACT GAAAAATCAA CAGTCTTAAA TTTAATTGCT GACAACTCAG 9300 

GTCGCAATCA ATACAAAGCA ATTGTTG a CT ACGTCGCAGA TAGTGCAAAG CAGTTTGGGA 9360 

TTCGATATGC TAATACGCAA ACAAATGAAG ATATCGAAAC ACAGGATAAG CTGTTAGAAT 9420 

35 

TTGCAAAAAA GCAAATAAAT GATACTCCTA AGACTGAATT AGATGTTAAT TATATAGGTT 9480 

ATGAAAAAAT AGAGCCAAGA GATAGCGTAT TCTTTGTTCA TGAATTAATG GGATATAACA 9540 

40 CTGAATTAAA GGTTGTTAAA CTTGATAGGT CACATCCATT TGTAAACGCA ATAGATGAAG 9600 

TGTCTTTCAG CAATGAAATA AAGGATATGG TACAAATTCA ACAAGCGCTT AACAGACGAG 9660 

TTATTGCACA AGATAATAGA TATAACTATC AAGCAAATCG TATAAATGAT TTATACACTA 9720 

45 GTACTTTGAA TTCTCCTTTC GAGACAATGG ATATAGGGAG TGTATTAATA TAATGGCAAC 9780 

AGAAGAAGTT AAAATCAAAG CGCTACTTGA AAACGATAAA C 9821 
(2) INFORMATION FOR SEQ ID NO: 471: 

SO 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 



5 



10 



15 



20 



25 



30 



A A 4% A * W A AAA 


CCAAATGGTC 


A A \» A WVJNV a Af\ 


A A A*W\V»%>*X\ A I* 


V7V7 AAA V^V\A >■» 


PTACflTG CTT 


fin 


V~TW A A A W AAA 


^•UAi A AA A A V* 


ACl A A X WvKr A 




V- lul/ilnnnu 


vr\j a Lnnnu A a 




1 n/i 1 v-v*rl a 




l«rt a /Ujw*VV-«H 




wMlAl 1 X a\J 




ion 
leu 


TATGGTATTA 


CCTTTTCATC 


CAAAAAXAAA 


ATGCCGTTTC 




ATGGTGTTTT 


240 


TGAAATGGAT 


ATCTGCAGTG 


CTTATATCGA 


TAGACTGGCG 


CTATAACAAG 


^^*H1>^ TV TV TV TV ^1 TV 

CGTGAAAACa 


300 


GCAACAAGTA 


ATATTATAAT 


CACACTACTA 


ACAATGGATA 


AATGGTTATC 


TAAAATCTCC 


360 


CAAAATAGCC 


AGTTCAAAAT 


ATTAAATGOG 


ATTAAAAGTA 


CAAGCGCTAT 


GGG TATCCAA 


420 


AACAGCACAG 


CACTTAACCT 


CATCACTTTT 


TTAGCGTGTG 


GTGACATAAA 


ATTATAATCC 


480 


CTCATCATTT 


TCACCTCTTA 


AATACCATGA 


TTTCAATTTA 


TTTGCATCTT 


CACTTCTTGC 


540 


GTATTTTAAG 


TTAATCGATT 


GGGCTCCCAC 


ACCTTTAGCA 


ACAATAAAGC 


TAAAATTATT 


600 


TAAATTGTTG 


CGTTTAAGTA 


ATGTATGTTG 


CCAAGTGTCA 


AATCCTATAA 


TGTGATGCGC 


660 


TTTAAAATAA 


TAAATATTTC 


GTTTCAATAG 


CTCGAAATTC 


TGGATAGTAA 


TTTGTTCTTC 


720 


TGTCATTTTA 


AAACCCGCAT 


GTTTGACATA 


AAGATATCCT 


TTGATCACAA 


ATAAACCAAT 


780 


AATGACTATT 


GTTATAATCG 


TAAATAACAA 


TAATAATTGA 


TTCCAAAAAT 


AACAGCCTAT 


840 


ACCTGCCATA 


GCTATGACAA 


TAATACTAGG 


TATTAAAAAG 


TGTCTGTGGA 


AACCTGACAA 


900 


AGGCATACCT 


TCATTAACTT 


GTTGATAAGA 


TAAATCTGGT 


ACTAAATTCT 


GGATAATTTG 


960 


ATATGCTTTG 


TCTCGTTTAA 


TAAACGGCaA 


TATCGGCACA 


CTACCTGAAT 


CATTGTC 


1017 



35 (2) INFORMATION FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 6806 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 

45 

TAGGATTGAT TAATCCTTTC TTCAAAATGA TGAATGTGTA ATGTTAAATA TATAATTTCA 60 
GACTCACTGA TATTAACATC AAATTGTTTT TGTATCATAG TTAAAATTTT ATATGCTGTG 120 
TTATAGCAAA TCGGATAGTG ATTTTTAATC ATAGACACAA AATCATCTTG TGCATGTATA 1B0 

60 

TATTCTTTTC TTCTTAATCG GCGAATTAAA AATTGTACGT GC CTT AT AAA ACGTTGGTAT 24 0 
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TTAATAACAT TATTnGATCA AGGTCATCTC 
AATATGTAAT GCAATAAATC CTATCTCATC 

5 TAACTGATCA ATCACCTGTT TAGCAATATG 

CATAACAAAT GGATTGCTAA TAACTTGATT 
ATGATCCGTT AATGAAACTA CAAGTTGTTT 

10 AAAATTCAAC GAATCAATAA TTACTTGTAA 

TTTATAATGT GCTTTTTGTT GCTCACTTTC 
TTGGTCGTTT AACGCCATTC CCTCTTTTTT 

IS 

AACTTCTTGA TCATTATTAG TACATACTAC 
ATATTCTCCC ATTATCATCA CCTATTTTTC 

2Q AACTTGAAAG TATAATAATT TAAATACTTG 

TATCTTAATT TAATTTCGAT GTCTCTCAAA 
CCCATATTTA CAAATAATTC TATTTATTTA 

25 . CTAAAACGTA CTATTAAATT GTGCAAAGCT 

TATGTCTTAT CAATGGTATA GTCTTTGCAC 
TATATTAATA TATAATCAAA TCTTAGATTA 

30 TACAAATTTA AAACCCTGAG ATTTTCGCTT 

TTTATATAAT GAATCGTTAC ATTAAAAATA 

CGCCCCAAGC ACTAATTCCA AATAAGTTAA 

35 

TCTTGCCTTC TTTTTTCTGC TTTTGATAAT 

GTCCTATAAT GGAAATAATG TTTACAATTG 

TCCATACTGT TTCTGTTTTT ACAACTGCAC 

40 

TAAAGATAAA TAATATTACA CGGTTTGTTG 
AGTAATTACC TACAACAGAT GAAAATGCAA 

45 CTCCAGCAGA . ACCTAAATGC TCATTAAGTG 

CGTTATCACC AAATTTCAGT CCTGAATATA 
CCAACATTGT ATCAAAGAAC ACACCTAATG 

SO GTACGGCAGC AGTTGCCGCT GCATTCGGCG 

AACCACGTTT GATACCTTGA AGAACCGCAG 

55 



ATGCAXAGAT 


AAATCTTCTG 


T ATTaGATG C 

A A A A V* W 




TTCAGGaAAA 


TGT a CATGTA 


ATGCTGCATT 


^* « w 


ATATGGATCA 


CTATATAACT 


GCATAGTTTC 

%J A «»Vrf AAA \r 


t O \J 


TTGTTTTAAO 

• a>»a A A A rllTW 


CGTTTATAAG 


CAAATATAAT 




TGAATCAACA 


TTCATCGCAG 


TATTAGAAAT 

X^IX X AunnnX 






TPtfffVi 1TTT 
X LnuLUH* X X 


mArTAa&rT 

WtnL l/vihV» X 




T* A llTTT A*F A A 


ft T i-i-i-ii'' , i -f*^*T\ 
t\ X 1L1H 


X luliinlnu 1 


^ o rt 
720 


{3TTAAAAfv*A 


AlnLLl X X-roL. 


UjAX xaaaax 


780 




X lUUilul X X 


lAu 1 AAUUiT 


840 


X AX 1 AX lAAu 


a *T*1* A *PA f*a TV* 

AX IAiAxAxC 


V5G AAAl\j xui 


900 


TG TT CT ATT A 


CACGCCTATC 


CTATATGATA 


960 


vTGGAATaAC 


TATAATAAAA 


AAATCTGACT 


1020 


TATATTATCT 


GAATTAATAC 


TCAATACAAA 


1080 


AAAACAAATT 


TATATTCATC 


TATCCAACAA 


1140 


ACCAATGGAG 


GAAATAAATC 


TCAACCTTAC 


1200 


AwTAGTGTAA 


TGATAGAGAT 


GATAATTGAG 


1260 


TAATTTGAAA 


ACCTCZAGGGT 


TTATTTGATT 


1320 


TTTATTTATC 


AGAGlTCliA 


TATTTGTTAG 


1380 


TTTCTAAGTT 


TTCAGGTTTA 


AAGACAGGGT 


1440 


\- X X 1 1. 




nuil X\avsACA 


1500 




f*ATA & *T& Af2 
V-rti AAAX AA\9 


TPiwrnT TV HP 
1 v» XtjV-CVj 1 Ax 


1560 








1620 


ATAAAAATTC 




ff3Ar*l*V5T"li AT* 


icon 


ACAGTOTAAC 


*»o»- mji x w x x 


AAfSAA AATAf 




CTGATTGAGT 


AACTGCAACA 


CCTTGAGGTG 


1600 


GTAAAATCAT 


GATTGCAGTT 


GCTGTACAAA 


1860 


ATTGGATTAA 


ACCTTGCTTA 


ACAGGGTGTG 


1920 


CAGAACCCAT 


ACCAGCTTCG 


TTAGAGAATA 


1980 


CACCTACAGC 


GCCACCAGTT 


ACTTGTTCGA 


2040 
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GCAATATTAC TAAAACCATA CCAATGTAAA TG AT AG C CAT AATCGGTACA ATTAACGAAQ 2160 

ATAACGTAGC AATACTACGT ACACCACCAA ATATAATAAT AGCTGTTACG ATTGCTAAAA 2220 

5 TAATACCTGT GATTACTGGA CTAATATTAT ATTGCOTATT TAACGACTCC GCAATTGTAT 2280 

TAGATTGCAC TGTGTTAAAT ACAAATGCAA ATGTAATTGT AATTAAAATC GCAAATACGA 2340 

TACCTAGCCA TTTTTGATTT AAACCTTTAG TAATATAGTA AGCTGGACCA CCACGGAATC 2400 

10 CACCATCTTT ATCATGTACT TTATAAACCT GAGCCAAAGT CGCTTCTATA AATGCACTCG 2460 

CTGCACCTAT AAATGCAATA ACCCACATCC AAAATACTGC ACCTGGACCG CCTAAAACAA 2520 

TCGCAGTCGC AACACCAGCA ATATTACCAG TACCAACTCT CGAACCAGCA CTAATCGCAA 2580 

ATGCTTGGAA TGGCGAAATA CCCTTCTTAC CATCTTCTAA AGTTTCTGGA CGTTCTACTA 2640 

AAGCTCTAAA CATTTCAGGT AACATTCGTA ATTGAACGAA TTTAGAACTA ATCGTAAAGA 2700 

AGAATCCAGC TGTCAATAAT AGACCAATTA AATATTGAGA CCATATTAAA TCGGTACCAA 2760 

20 

CATGGACAAA TTCTTTAAAC CATCCAGGTA TTAAACTATC GAAATCTTTC AAAATAAACC 2820 

CCTCGCATCC TCTACATGAA TCATGTACCT TCTATAAAAT TAGACCG AAT TGAACCTTCA 2880 

2S GTAAATATAG AGATACATCA TCATTTCTTA TACAATACAA GAGATTTATA TTAGTTTGGT 2940 

CAAAGTATAT CGCTAATTTA ACGATAAGTA CTTGGTCAG C ATTTAATATA AATCCCTTGA 3000 

ATTTAGTCAA AATTTAACAT T A CTGTATTT TATCATTTAA TTTCGTGATT GCATATAGTT 3060 

30 TTTAGCTAAT ATACATGTCT ATTACTTCAC CAAAATCATC TGTATCTACA ATGAATGAGC 3120 

CATTTGTATA TTGTTCAGAT TTATGAATAT CATTAATTAA ACCATGTTCT TCATTTGATT 3180 

TTGAATATAA TGTATATTGA CTATGTTCAC CTGTCACTAC ATGTGCAGCT ACAATACGAT 3240 

35 GTGGATTTTT CTTTAATTCT TTTAATAAAG TTATTCCaCG TTGTGCTCTT TTAGCAACTT 3300 

GTAAGATTTT AAAACTAATA CGTTTTAACG AGCCGCGTTG TGTGGCCATC AATATAGTAT 3360 

CATTTTCAGA AACACCTTCT GTCATAACAA CGAAATCTTC AGCTTTAAGA TTTATTGATT 34 20 

40 

TAACACCAGC TGCCCTTAAT CCGGTATCTG ATAGTTCACT TGTATTATAC GTTAATGACA 34 80 

TACCTTTATT AGTAATGACG GTAATTAATT GATCTTTTTC AAAGCGCATA ACACTAATCA 3540 

AATCATCATT TTCTTTAACT TTAGTAGCAA TTAAAGGTTT ATTAAAACGC GTTGTTTTAA 3600 

45 

ATAGAGGCAC TGTACTTTTC TTAATCATGC CATTTTGAGT CGCAAAAACA TAAAATGCAT 3660 

CTGTATTAAA GTCCTTTTCA TTAAAGACAT TAATAACCAC TTCATCTTCT TCGATAGGAA 3720 

60 CTATTTGTGA TACATGTTGT CCCAATTCTT TCCAACGAAT ATCTGCTAAT TTATGAACCG 3780 

GTATAAATAG ATAACGACCT TTATTTGTAA ATACTAGTAC GGTATCTTGC GTATTTACTT 384 0 

55 
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TAAAGCTACG AATAGAAGTA CGTTTAATAT ATCCATGACG TGTCATACTT AAAATAACTy 3960 

CTTCACTAGG CACCATAACT TCTTTGTCAA TTTTAATTTC TTCAATTTCT GCTTCAATTA 4020 

AAGACAGTCG TTCAGATTTG AATTTCTTTT TAATTTCATT CAATTCTTCT TTTATGACAT 4080 

TCAATAATOC ATCATGGTTA TCAAGAATAT GACGTAATTG TTTGATTAAT GCTTCAAGTT 4140 

CTTTATQTTC ACCTTCAAGC GCAACTATGT CAGTATTTGT TAAACGATAT AACTGTAACA 4200 

TTACAATTGC TTCAGCCTGT TCTTCTGTGA ACTCGTATAC TTCGATAAGG TTTTCTTTAG 4260 

CGTCACGCTT GTTTTTAGAG CwACGAATCA ATTCGATTAC TTTATCTAAA ATTGACAACG 4320 

CTTTAATCAA ACCTTCAACG ATATGCATAC GTTTTTCTGC ATTATCTAAT TCAAACTTCG 4360 

TTCTATTTGC AACAACCTCA ATTTGGTGAT TCAAATAACT ATCTATAATT TGACGAATAC 444 0 

CCATCAATTT TGGACGACCA TCACTAATAG CGACCATGTT GAAATTATAT GAAATCTGTA 4500 

AATCAGAGTT TTTATAAAGA TAATTTTTGA TTGATTCACT GTTCACATCT TTTTTCAATT 4560 

CAATTGCTAT TCGTAAACCA GTTCTATCAG TTTCATCACG TACTTCAACG ATACCATCGA 4620 

CTTTTTTGTC AGCACGTAAT TCATCGATAC GTTTTACTAA GCTACTTTTG TTCACTTCAT 4680 

25 ATGGAATTTC AGTAATAATT AACTGTTTAC GTCCATTGCG TAAAGTTTCT TCTTCAACTT 474 0 

TAGAACGAAC TATAATTCTA CCTTTACCTG ATTCATAAGC TTTTTTAATA CCATCAATAC 4 800 

CTTGAATAAT ACCACCAGTT GGAAAATCAG GACCTTTAAT ATATTTCATT AATTGATTGA 4 860 

30 CTGTAATATC CGG ATTATCA ATATATTTAA GTGTTGCTTG AATCACTTCA GCTAAATTAT 4920 

GTGGTGGTAT ATCTGTCGCG TAACCTGCAG ATATACCTGT AGAACCATTC ACTAGTAAGT 4980 

TAGGAAATCT TGATGGCAAT ACCATTGGTT CGAGTGTCGT ATCATCATAG TTTGGAATGA 5040 

AAGAAACTGT CTCTTTATTA ATATCACGTA AT AACTCTT C AGCTAGTAAG CTTAACTTAG 5100 

CTTCAGTGTA ACGCATTGCC GCTGGCGGAT CATTATOGAT ACTACCATTA TTACCATGCA 5160 

TTTCTATTAA GACATGTCGT AACTTCCAGT CTTGACTTAA ACGGACCATT GCTTCGTACA 5220 

CTGAGGAGTC TCCATGTGGA TGATATTGAC CAATAACATC ACCGACTGTT TTCGCACTTT 5280 

TACGGAAATT TTTATCGTGT GTATTACCAC TTGAATACAT TGCATATAAA ATACGACGTT 5340 

GTACTGGTTT TAAACCATCA CGAACATCTG GCAATGCACG CTCTTGAATA ATATATTTAC 5400 

TATATCTTCC AAAGCGATCA CCTAAAACAT CTTCAAGTGA TAAATCTTGA ATTATTTCAC 5460 

TCACTAGATT TCCTCCTCAT CAAATTGATC ATTTTCAAGC ACTTGTACTT CAGAATTATC 5520 

60 TAAAATACTT TGGTCCTCTT G CATACCAAA CTCAACATGC TTTTCAATCC ATTCACGTCT 5580 

AGGTTGTACT TTGTCACCCA TTAATGTTGT TACACGTTTA GATGAACGCA CTTCATCTTC 5640 



35 
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AGGGTTCATT TCACCCAAAC CTTTGTAACG TTOTAACOTG AaOCCTTTAC CAAQTTCTTT 5760 

TTGCAATTTA TTAAGCTCTT CGTCTGTCCA AGCGTATTCA ACTCGCTTTG TTTTGCCTTT 5 620 

5 ACCTTTTTCC AATTTATAAA GTGGAGGTAA AGCAATAAAT ACACGACCTG CTTGAACAAG 5880 

CGGTTTCATA TATTTGAAGA AGAATGTTAA CAATAGCACT TGAATATGCG CACCATCAGT 5940 

ATCAGCATCA GTCATAATAA TTACACGATT ATAATTACTA TCTTCAATTT TAAAGTCAGT 6000 

10 ACCAACGCCT GCCCCGATTG TGTGGATAAT TGTATTAATT TCTTCATTTT TAAAAATATC 6060 

TTCTAGACGT GCTTTCTCTG TATTAATTAC CTTACCACGT AATGGTAATA TCGCTTGGAA 6120 

TTTGCGGTCT CGTcCAAGTT TTGCTGAACC TCCCGCAGAA TCACCTTCGA CTAAATACAA 6180 

IS 

TTCATTTTTT TCAGTGTTTT TACTTTGTGC AGGTGTTAAT TTACCAGATA GCAAAGTGTC 6240 

TTTACGCTTG TTTTTCTTAC CTGAACGAGC ATCTTCACGA GCTTTACGTG CAGCTTCCCT 6300 

TGCTTGTTGT GCTTTAATCG CTTTTTTCAC AAGTGATTTA GACAATTGTC CTTTTTCTTC 6360 

20 

TAAATAGAAT GGCAATTTGT CTGCAACAAC TGAATCAACA GCACTTCTAG CTTCAGAAGT 6420 

ACCCAATTTA GATTTCGTTT GTCCTTCAAA TTGCAATAAT TCTTCTGGAA TACGAACAGA 6480 

2s CACAACAGCT GTTAAACCTT CACGAATATC ATTACCATCT AAGTTTTTAT CTTTTGTTTT 6540 

AAGTTCATTA ATACGACGTG CATAATCATT AAATACACGT GTCATTGCTG TTTTAAAACC 6600 

AACTTCATGT GTACCACCAT CTTTAGTACG TACATTATTT ACAAAACTTA AAATACTTTC 6660 

30 TGAATATTGA T CATT AT ATT GGAAAGCTAC GTCTACCTCT ATACCATTTG CTTCACCTGA 6720 

AAATGTAGCC ACGTCATGCA AAACTTCTTT TCCTTCATTG ACATAACTAA CAAACTCTTT 6780 

GATTCCTTCT TATAATGGTA TGTCTT 6806 

35 ; < 2) INFORMATION FOR SEQ ID NO: 473: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1716 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 

GGGGCAnAAA TTCCAATACA CT CATT A CCA AATATATACA CCACTTCCTG CTACAAGTnn 60 

TTTTACTTGA TCTTGGTCTT TTCCGCAGAA AG AG CATTTC CAAATTTTCT TCATCTTCCA 120 

TTGAATTTAA ACATTCTTTT TACACCCCTA TTCGTTAAAG ACTATACTAG ATTGGATGTT 180 

ACAATGCAAC ATATTAACAT ACAAACTTTT TGCTTAAAGA ATAGTAGCAG ATACATAAGC 240 
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CCtTCAACGA ACTTTGCGTT ATCTCTTAAT AAATOGATAA CTTTTTGGAT ACGAACATCA 360 

TTTTTAATGA TATCAGTATT ACCTAAAGTA TTTTTGATAT CTTCAACTGA GATATTAAAT 4 20 

TGTTTACTCA TTTTTTCTAA TTCTTTATCG ATATCTTCAT CAGTAGCTTC GATTTTTTCA 480 

GCTTCAGCGA TCGCAGTTAA AGTTAAGTTA GTTTTAACAC GTTGTTCTGC ATCGTCTTTC 540 

ATTTGCTCTC TTAATTGAGT TTCATCTTGA CCTGAGATTT GGAAGTACGT TTGTAAATCT 600 

AAACCTTGTT GTTGAATTCT TTGTGCAAAT TCAGACACCA TACGATCTAA TTCAGTATTA 660 

ACCATTGCTT CAGGAATATC GATTGTTGTA TTATCAGTAG CTTTTGTAAT CGCTTCTTCT 720 

TTTTCAACAT TTTCAGCATC TGTAGCTTTT TGTTCAGCTA AACGTTTACG TAAGTTTTCT 780 

TTGTACTCGT CTACTGTATT TGCTTCTGCA TCTAATTCAT TAGCAATTTC ATCTCTTAAT 840 

TCTGGGACTT CTTTAAATTT AATTTCGTTA ACTTTTGTTT TGAAAGTTGC TTCTTTACCG 900 

GCTAATTCTT CAGCATGGTA TTCTTCTGGG AATGTTACGA CAACATCTTT TTCTTCGTCA 960 

ACTTTCATAC CTTCTAATTG CTCTTCGAAA CCAGGTATGA ATGAACCTGA ACCGATTTCT 1020 

AAATCGTAAC CTTCAGCTTG TCCACCTTCG AATTCTTCtC CGT CAACTGA ACCACTAAAG 1080 

TCGATGTTAA CTGTGTCGCC ATTTTCAACA ACAC CATCTT CTTTAACGAC CATTTCAGCT 1X40 

AAATGTCCTA AGCTGTGGTC AATCGCTTCT TGTAACTCAT CATCAGATAA TTCAGTTTCT 1200 

TGTTTTTCAA TTTCAAGACC TTTATAGTCT CCTAATTTAA CTTCTGGCTC AACTGTAACT 1260 

30 GTTGCTTCAA AAATGAAATC TTTACCTTTT TCAATTTGAG TAACACTTAC TTCtGGTTGT 1320 

GCAACTGGTT TAATATCAGT TTCGTCAATT GCTTCACCAT AAGCATCTGG TAATAAAATG 1380 

TCGATAG CAT CTTGATATAA TGCTTCTACA CCAAAGCGTT GTTCAAAAAT TGGACGTGGC 1440 

ACTTTACCTT TACGGAATCC AGGTACGTTA ATTTGTTTAA CCACTTTTTT GAATGCTTGA 1500 

TCTAACGCTT TGTTTACTTT TTCTGCAGGA ACAGTAACAG TTAATAAACC TTCGTTACCT 1560 

TCCTTTTTTT CCCAAGTTGC TGTCATGTAT ATATACCTCC ATGATTAACT AATTTATTTT 1620 

TTCAACTTCC CTATTATATC ATACGTCTAT TCCCTATACA AACATTGAAA TCACAACGTT 1680 

TATATATTTG TAAATCAACT TTTTTCGTCA AAACTA 1716 
(2) INFORMATION FOR SEQ ID NOi 474: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 
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TGACCAAGTA 


CGTTTCGAAG 


TTGCCATTAA 


AGCATTAAAC 


CCATCATTGA AAGCATTCGC 


60 


ACCTGTACGT 


GAGTGGGCAT 


GGAGTCGTOA 


AGAAGAAATC 


GATTATGCAA 


TTAAACATAA 


120 


TATCCCTGTA 


TCAATCAACC 


ATGATTCACC 


TTATTCTATC 


GATCAAAATC 


TATGGGGCAG 


180 


AGCGAATGAA 


TGTGGTATTT 


TAGAAGATCC 


TTATGCTGCG 


CCACCAGAGG 


ATGCGTTCOA 


240 


TCTAACAAAT 


GCTTTAGAAG 


AAACACCAGA 


TACTGCTGAT 


GrAATCATTT 


TAACGTTTGA 


300 


TAAAGGCATC 


CCAGTTCAAA 


TTGATGGCAA 


AACATATGAA 


TTAGACGATT 


TAATTTTAAC 


360 


GTTGAATGCA 


TTAGCTGGTA 


AGCATGGTAT 


CGGAAGAATT 


GACCATGTAG 


AAAATAGACT 


420 


TGTAGGTATC 


AAATCAAGAG 


AAATTTATGA 


GGCACCTGCT 


GCAGAAGTTA TTTTAAAAGC 


460 


GCATAAAGCA 


TTAGAAACGA 


TTACGTTAAC 


GAAAGATGTC 


GCACACTTTA 


AACCAATCAT 


540 


TGAGAAGCAA 


TTTGCTGAAC 


AACTATACAA 


TGGACTTTGG 


TTCTCACCTT 


TAACTGATAG 


600 


CTTGAAATTA 


TTTATTGATA 


GTACTCAGCA 


ATACGTAAGT 


GGTGATGTCA 


GAATTAAATT 


660 


ATTCAAAGGT 


AATGCCATCG 


TGAATGGTAG 


AAAATCACCT 


TACACATTAT 


ATGATGAAAA 


720 


ATTAGCAACT 


TATACAAAAG 


AAGATGCATT 


TnATCAAGAC 


GCTGCTGTTG 


GCTTTATCGA 


7B0 


TATCTATGGT 


TTACC 










795 



(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 887 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 



40 



45 



SO 



CGATTGAAAG 


AAGACGGTTC 


AGTTGAAAAG 


TTTCCAAAGC 


CAGTAATTAG 


CCAACAACCA 


60 


GAAGGATATA 


CGAGTCATTT 


TAGAGATCCT 


AAAGTTTTTA 


AATATGATGr 


GaAATATTAT 


120 


GCAATCaTTG 


GTGCmCaAAA 


TAATGATCaG 


CaAGGTCGAT 


TATTACTTTA 


TAATACTGAA 


180 


GATATAATTA 


ATTGGCATTA 


TTTAGGTGAA 


ATAAATACAG 


AGTTGGATGA 


TTTTGGATAT 


240 


ATGTGGGAAT 


GCCCAGATTA 


CTTTAATGTA 


GATAATCAAG 


ATGTCATACT 


TATTTGTCCA 


300 


CAAGGTATTG 


AACCAAAAGG 


CGATCAGTTC 


AAAAATATTT 


ATCAAAGTGG 


TTATATACTT 


360 


GGAAAGTTTG 


ATATTGAAAA 


GTTAACATAT 


GAACATGAAA 


ATTTTGTCGA 


GCTTGATAAT 


420 


GGTTTTGATT 


TCTATGCACC 


TCAAACATTT 


TTAGATGAAA 


AAGGCCGACG 


AGTACTAATT 


480 


GGATGGATGG 


GGTTACCGGA 


AATCGAATAT 


CCTACTGATA 


ATGAAGGATG 


GGCCCATTGC 


540 
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GCGTTGGAAA AATTACGTCA CAATAAAGAG ACAGCATTtA GGCtACGCAA ATAAATTTAC €60 

TCGAAAATTA CATCCGTATG AAGGTAAACA GTATGAATTA ATCATAGATA TTTTGGATAA 720 

TGATGCTACC GAAGTGTACT TTGAATTACG TACATCTAAG ACTTCTTCAA CATTAATTGC 780 

TTATAACAAG CGTGAAAATA AAATAACATT AGATCGCAGC GACAGTGGTT TATTGCCGAC 840 

AAATGTTGAA gGTACGACGC GTAGTACGAT ATTAGACACG CCATTAA 887 



(2) INFORMATION FOR SEQ ID NO: 476: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1183 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : double 

<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476: 

20 





TTGGAAnCAA 


AAAACCATTG 


GTAAACCGTG 


TTnAACCGGA 


TTTCCGATGG 


ACCTTTTAAA 


60 




ACnACCAAAT 


AGAAAGCTTT 


GATAAAAGGT 


AATTATGGTA 


CTGATAACAA 


ACAAGTTCAA 


120 


ss 


AAACATCATG 


ATTTAGTACG 


TATGCTTTTG 


ATGGATCAAG 


ATGGTTTTTT 


AACTGAAAAT 


180 




AATAAAGTTG 


ATCATTTCAT 


TGATGGAAAT 


GATTTATATG 


ATCAAGTTTT 


AAAAGATATT 


240 




AAAAATGCAA 


AAGAtATATC 


CATTTAGAGT 


ACwATACTTT 


CGCTTwAGAT 


GGTTwAGGTA 


300 


30 


AAAGAATTTT 


ACATGCTTTA 


GAAGAAAAAT 


TGAAACAAGG 


TCTAGAAGTA 


AAAATATTAT 


360 




ATGATGATGT 


TGGATCTAAA 


AATGTTAAGA 


TGGCAAATTT 


TGATCATTTT 


AAATCGTTAG 


420 




GTGGAGAAGT 


TGAAGCATTT 


TTTGCTTCAA 


AATTACCGTT 


ATTGAATTTC 


AGAATGAATA 


480 


35 


ATAGAAATCA 


TAGAAAAATC 


ATCGTAATCG 


ATGGTCAACT 


AGGTTATGTC 


GGAGGATTTA 


540 




ACATTGGTGA 


TGAATATCTm 


GGATTAGGAA 


AATTAGGATA 


TTGGAGAGAT 


ACGCATTTAC 


600 


40 


GTATACAAGG 


GGATGCGGTT 


GATGCACTGC 


AGTTGCGATT 


TATTTTAGAC 


TGGAATTCGC 


€60 


AAGCGCACCG 


TCCACAATTT 


GAATATGATG 


TTAAGTATTT 


CCCTAAAAAG 


AACGGACCAT 


720 




TGGGCAATTC 


ACCAATTCAA 


ATAGCTGCAA 


GTGGCCCGGC 


TAGTGACTGG 


CATCAAATTG 


780 


45 


AATACGGTTA 


TACAAAAATG 


ATTATGAGTG 


CAAAGAAATC 


TGTATATTTA 


CAATCACCAT 


840 


ATTTCATTCC 


GGATAATTCA 


TATATaAATG 


CCATTAAAAT 


TGCTGCTAAA 


TCAGGTGTAG 


900 




ATGTACATTT 


AATGATTCCA 


TGTAAGCCAG 


ATCATCCATT 


AGTATATTGG 


GCGACATTTT 


960 


SO 


CAAATGCCTC 


TGACTTATTA 


TCAAGTGGTG 


TTAAAATTTA 


TACGTATGAA 


AATGGATTTA 


1020 




TACATTCTAA 


AATGTGCTTA 


ATTGATGATG 


AAATCGTATC 


AGTGGGCACA 


GCAAATATGG 


1080 
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CTAAAGATTT AAGGGTGGCT TATGAACATG ATATTACAAA ATC 1183 

<2) INFORMATION FOR SEQ ID NO; 477: 

S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2332 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 
GGTTATATGT TTGQTATTAC TCATTATATT GATTGGAGGT TGTGTCATTA TGACAAAAAC 60 

15 

AAATGGTCGA AACGCTCAAA TTAAAGAAAA TTTCAACAAA ACATTAAGTG TATATCTAAC 120 
CAAGAATCTC GATOATTTTT ACGATAAGGA AGGTTTTCGA GATCAAGAAT TTGATAAAAG 180 
AGATAAAGGG ACTTGGATTA TTTATTCTGA AATGGTTATC GAACCAAAAG GGAAnAATAT 240 

20 

GGAATCGAGA GGAATGGTGC TCTATATCAa TCGCAATACT AGAACmACGA AGGGTAATTT 300 
TATTGTCACC GAAATAACTG AAGATAGTAA AGGATATTCA CGTAGTAAAG AAAAAAAATA 360 

2S TCCTGTCAAG ATGGAAAATA ATCGAATTAT TCCAACAAAG CCTATACCGG ATGACAAGTT 420 
AAAAAAAGAG ATTGAAAACT TTAAGTTCTT TGTACAATAT GGAAATTTTA AAGATTTTAA 4 80 

AGATTATAAA AATGGTGATA TTTCATATAA TCCTAATGTG CCAAGTTATT CTGCAAAGTA 54 0 

30 TCAATTGAAT AATGATGACT ATAATGTTCA ACAGTTAAGA AAACGATATC ATATTCCAAC 600 
CAAACAAGCG CCCGAATTAA AATTGAAAGG ATCCGGCAAT TTAAAAGGCT CATCCGTAGG 660 
ATCTAAGGAT CTAGAATTTA CGTTTGTAGA AAATCAAGAA GAGAATATCT ATTTTTCAGA 720 

35 TTCGGTCGAA TTTACACCTA GCGAGGATGA TAAATCATGA GTCAAACGGA ATATCAAATA 780 
AAATCTGGCA ATATAAAAGG TAACTCTGAA GAAACAAGTA CAGTATCTAA TATAAGTTAT 84 0 

GAAATAGAAA ACGCAAATAA CAGTGGTTTA AAACAAAATA AAATTGATAA ACAAATTAAA 900 

40 

AAGTTACAAG AAAAAAATAA ATTCCCTAAA AATCTTTCAT ATCTTAAAAG TTATACGGAC 960 
CCCAAAACAG G CACGACT AC AAGCGCCTTT TTAAATAAAG ACACTGGCAA AGTTACTTTA 1020 
GGTATGACAG GTACTAATGT ACACAAAGAC GCAATATTAA AACAAACATT TGGTGTTCCT 1080 

45 

TCTTATCAAG GATATATAGA CGTGAGTGAA ACgCTaAAAG ATATTGGGGC CGATGTCAAT 1140 
ATTGGCCTTC ATTCCGTCAC AGATAAAGAT CCACATTATA AAAATACCCA AGACTTTATC 1200 
60 AAAAATATCA AAAAAGACTA TGATATTGAT ATTATTACCG GACATTCGCT GGGCGGTAGA 1260 
GATGCGATGA TTTTAGGTAT GAGTAATGAT ATTAAACATA TCGTTGTGTA TAATCCAGCT 1320 

SS 
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ATTGAAAAGT ACGATGGTCA CATTGTAAGA 


TTTGTGTCTG 


ATGAAGACGA 


ATTAGATGCA 


1440 


GGTGTCCGCA ATCATTTATA TGAAACTGCT 


GGAGAAAAAA TAGTACTTAA 


AAATGGAGAA 


1S00 


GGCCATGCAA TGAGTGGTAT TTTAATGAGC 


AGAACACAGG 


CTATAATCTT 


AGCTGAATTA 


1560 


AACAAAGTTA AAGGCTACCA AGACGAAAAT 


AATAAAGCAT 


TAAAATCCGT 


TCGTAAACAA 


1620 


ACGAGGCATA GATTACATAA AGTAGAGACG 


TTAAGAGCGA 


ATTGGATTCA 


AACAACGGGT 


1680 


GGATCACTCT CTTCCTCCCw ACAACAATTA 


TTAGAAGCTT 


TAACAGCACT 


AACCATTGCC 


1740 


GAAGGCTTAA ATCAATTAGT GAATGAAGAA 


AGCCAACATT 


TGAAAAAATG 


TATCACGCGA 


1800 


TGGCACATAA ATTTGGAGAC AACTGGAAAA 


AAGCGCAAGA AGTTGGAAAT 


GAAATTGOTG 


1860 


AAAAATTAAC CTCTGAAGAG GTTATAGATG 


r ATTAAOAAA AGGTGGCGCG 


TATGAAAGTa 


1920 


AACTTGAAAC AGATCCCAAA AGAAAAATTG 


ATGATAAGAT AAAGAAATTA 


AATGATGTTT 


1980 


ATAAAAATTG TAATGGCTAT ATCGCAAAAA 


TTAAACAGAG 


TATCGAAGCA 


ATTGTTTcrrA 


2040 


ATGACCAAAT GTTAGCGAGC CAGATTGATG 


GGATGATGTA ATGTTTACTA 


CGTATAAnAA 


2100 


TATTAATGAA CTTGAAAATG CCTATGATGA 


AGAAAGAAAA 


CAATTGAATG 


ATGCATTCAA 


2160 


TCAAATTGAT GAATTAAGAC ATCAAACACG 


CAAGAmATGT 


GAACAAATGT 


ATGATCATTT 


2220 


CTTATATCTC AAACATAAAA TGAATTmyms 


TGAAGACGCT 


ATGATCAGGA 


TGACACGTAT 


2280 


TATAGAATCT TTCGATAGAG AAACGAATCA 


ACGTATCCGA 


CATCACGAAA 


TG 


2332 


<2) INFORMATION FOR SEQ ID NO: 478: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY : linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478: 






TTTACTACCC AGTATCTCTT TTTATAAATT 


ATATAGCCAC 


CACATATGGT 


GGAAAGTCTT 


60 


TTTAATTAGA ATTTTGTTTT TTCAGTTAAG 


AAAGCTTCTA 


ACTCTGAGAT 


TGGCATACGA 


120 


ACTTGTTCCA TTGAATCTCT GTCACGTACT 


GTAACTTGAT 


TATCTTCTAA 


TGAATCAAAG 


160 


TCGAATGTtA CACAATAAGG TGTaCCGATT 


TCATCTTGAC 


GACGGTATCT 


TTTACCGATA 


240 


GATTGTGATT CATCGAAATC GATTGAGAAT 


TTAGAACTTA 


ATTGCTCAAA 


AATCTTAATC 


300 


GtTCGCCAGA TAATTTCTTA CTTAAAGGTA 


AAATCGCTGC 


TTTATATGGT 


GCTAATGCAG 


360 


GATGGAAGTG TAAAACTGTA CGTGCATCTT 


TACTACCTTC 


AACGCCTTCT 


TCATCATATG 


420 
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GAATATATTT TTCGTTCQTT TCTGOATCAT GGTATCTGAA ATCTTCACCA OAOTOTTCAG 54 0 

CATGTTTACG TAAGTCOAAG TCTGTACGAC TTGCGATACC CCATAACTCA CCCCAACCAA 600 

5 ATGGGAATTT ATATTCAATA TCAGTTGTTG CATTTGAGTA ATGAGATAAT TCATCTTCAT 660 

CATGATCACG T AAACG CAT A TTTTCACTGC TCATATTTAA GCTTGTTAAC CAGTCACTTO 720 

CAAAAGTTTT CCAATAATTT TGCCATTCGA TTTCTTCTCC AGGTTTACAG AAGAATTCAA 7B0 

10 GTTCCATTTG TTcAAATTCT CTTGTTCTGa AAATGaAGTT ACCTGGAGTG aTTTCaTTAC 84 0 

GGaATGaTTT ACCAATTTGG ACCGG 865 



(2) INFORMATION FOR SEQ ID NO: 479: 

IS 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOLOGY: linear 

20 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 



25 


TAATGAGTAA 


ATAAGTACCA 


GATAAAATCA 


TGAAT AT CAT 


CCAACATGCG 


GTTAACTCTA 


60 




CTAAATAATT 


AATAATAGTA 


TTTTCAGTAA 


ACAAAGAACT 


ATGTATACTT 


CGCATCACAT 


120 




TAGAATACGT 


ATGTTTCGCA 


TTTTGATCTG 


CAACAAATTG 


ATTGTTATGA 


TCTAGAAAGA 


180 


30 


CGTAACGTTG 


ATTTCCTGCC 


ATATCACTCA 


GTGTAATTCG 


TTTGTTATAT 


GGTTCATCAA 


240 




GTATGCTAAC 


TTTACTTACA 


AAGAATCCTT 


CATATTGTTG 


TTCAACTTGA 


TGTACTGCAT 


300 




CATTTAATGT 


TTGATGCGTT 


TTTACATCAC 


TGTCACCAAA 


AAACTCATTC 


TTATAAATAT 


360 


35 


TATTTTCAAC 


TTCTGGAAAG 


AACAAGTAAC 


CAATGCCCGA 


AATGGTTAAA 


GTGATTAACA 


420 




GTGGAGCAAT 


AAATATTGCT 


G CAT AG AAAT 


GTAATCTTTG 


TAATGGATTA 


AATGTATTTT 


480 


40 


TCATATTTCC 


CTCCCAATTG 


GCTATTATAC 


GGTGTCAATT 


CTGTGATGTG 


TGTGAACAAA 


540 


CTGTGACAAT 


ATTTATTTTC 


TAGAAAAATT 


TAACGATGAT 


TTGTGATTTT 


TAGAAAAATG 


600 




AACTTTTAAG 


TTGGAATGTT 


TGAAGAAAAT 


TGATTATTCG 


TATGTTTTAT 


CAAGCAGCTA 


660 


45 


TGATAAAATT 


TAAACATAAT 


ACAATGCGAG 


CCATTTAACG 


ATCTATGTTT 


AAATGGACAT 


720 


CGATATTGTA 


TGAATTCGTT 


GTAACAAGCA 


AGCATTTCTA 


TGTGAACGAA 


CCAAAGGGGA 


780 




AAGTAACATG 


ATTAATAAAG 


AACAATTAGA 


TCTTTTATAT 


AAATTAAAAA 


AAGAAGTTGA 


840 


50 


AAAGTCGCGA 


AATGAAGCAC 


TTTTACATAC 


AATTAACCAA 


GTAATTAAGA 


AAGTATATTT 


900 




GCAGCAATAT 


ACATGTTCGT 


TCGTTGGACA 


TTTTTCTGCA 


GGTAAATCGA 


CACTGATAAA 


960 
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TATTGTGTCA GTTTCAGACA ATCACGATAT TATTGCTAAT TTGCCGAATC AAACGTATGC 10B0 

CAAATTATCT AATTATQATO AAGTAAGGOA AATGAATCGC CAAAATGTCG ACGTTGAATC 1140 

5 TGTAGAAATT AATTTTCAAT CAGCTAAATT TGAAAATGGG TTTACGTTGC AAGATACACC 1200 

AGGTGTTGAT TCAAATGTTG CATCACATCA OTCAATAACA GAACAATATA TGTATACAAG 1260 

TAATATGATA TTTTATACGG TTGACTATAA CCACGTTCAA TCTGAACTTA ACTTTAAGTT 1320 

10 TATGAAGCAT ATAAATGATG TTGGaATACC TGTTGTGTTT ATCATTAATC AAATTGACAG 1380 

CATCCAAGAC GATGGAATTG TCATTCTCTA CGTnTTAAAT CTCGAGTTGG AAAAATCAAT 1440 

TGGC 1444 



75 

(2) INFORMATION FOR SEQ ID NO: 480: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6309 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480: 



40 



4$ 



SO 



GCAGCAGCTT 


ATCGTGAGTT 


ATCATTATTA 


TTACGTAGAC 


CTCCAGGTCG 


TGAAkCATAC 


60 


CCAGGTGACG 


TATTCTACTT 


ACATAGTAGA 


tTATTAGAAA 


GAGCAGCAAA 


ATTAAACGAT 


120 


GACTTAGGTG 


GCGGTTCAAT 


TACTGCATTA 


CCAATTATCG 


AAACACAAGC TGGTGATATT 


ieo 


TCAGCTTATG 


TACCAACAAA 


CGTTATTTCA 


ATTACAGATG 


GACAAATCTT 


CTTACAATCT 


240 


GATTTATTCT 


TCTCAGGTGT 


AAGACCAGCG 


ATTAATGCCG 


GACAATCTGT 


ATCTCGTGTT 


300 


GGTGGATCTG 


CACAAATTAA 


AGCAATGAAG 


AAAGTTGCTG 


GTACGTTACG 


TCTTGACTTA 


360 


GCGTCATACA 


GAGAACTTGA 


ATCATTTGCA 


CAATTCGGTT 


CAGACCTTGA 


TGAATTTACT 


420 


GCAAGTAAAT 


TAGAACGTGG 


TAAACGTACT 


GTTGAAGTCT 


TAAAACAAGA 


TCAAAACAAA 


480 


CCATTACCAG 


TCGAACACCA 


AGTGTTGATT 


ATTTATGCAT 


TAACAAAAGG 


ATATTTAGAT 


540 


GATATTCCAG 


TTGTAGATAT 


CACACGTTTT 


GAAGACGAGT 


TAAACCACTG 


GGCAGAATCA 


600 


AATGCTACTG 


AACTGTTAAA 


TGAAATCAGA 


GAAACTGGTG 


GCTTACCAGA 


TGCTGAGAAG 


660 


TTTGACACAG 


CAATTAACGA 


ATTCAAAAAA 


AGCTTTAGCA 


AATCTGAATA 


ATAAACAAGT 


720 


TTAGTATAAG 


GTGGTGAGAT 


AGTGGCTTCT 


CTTAAAGAAA 


TAGATACTCG 


AATAAAATCA 


780 


ACCAAAAAAA 


TGAAGCAGAT 


TACGAAAGCG 


ATGAACATGG 


TATCAAGTTC 


AAAACTTCGT 


840 


AGAGCTGAAA 


AAAATACAAA 


ACAATTCACA 


CCATATATGG 


ATAAAATGCA 


AGATGCAATT 


900 



55 



1404 



10 



IS 



20 



EP 0 786 519 A2 

ACTAGAAGTG GATATTTAGT TATCACGAGT GATAAAGGTT TAQCAGGTGC ATATAGTGCA 1020 

AACGTGCTTA AAAAATTGAT TACTGATATT GAAGCGAAAC ATCAAGATAG TAGCGAATAC 10 80 

AGTATTGTAG TTTTAGGGCA ACAAGGTGTT GATTTCCTAA AAAATAGAGG TTATGACATT 1140 

GAGTATTCTC AAGTAOACOT ACCTGATCAA CCTTCTTTCA AATCTGTTCA AGCACTAGCT 1200 

AACCATGCTA TAGACTTATA CAGTGAAGAA GAAATTGATG AATTAAATAT ATACTATAGT 1260 

CATTATGTCA GCGTTCTTGA AAACAAGCCT ACATCTAGAC AAGTATTACC ATTATCTCAA 1320 

GAGGATTCTA GTAAGGGGCA TGGTCATTTG TCTTCTTATG AATTTGAGCC AGATAAAGAA 13 BO 

TCTATCTTAA GTGTAATCTT GCCTCAATAT GTTGAGAGTT TGATTTACGG AACAATATTA 1440 

GACGCAAAAG CAAGTGAGCA TGCAACACGT ATGACTGCGA TGAAAAATGC CACTGATAAT 1500 

GCAACTGAAC TTATTGATGA CTTATCATTA GAATATAACA GAGCGAGACA AGCAGAAATT 1560 

ACGCAACAAA TTACTGAAAT TGTTGGTGGT TCCGCAGCGC TTGAATAATA TTTAAAGGAG 1620 

GAAAATAGCA TGGGAATTGG CCGTGTAACT CAAGTTATGG GTCCTGTAAT TGATGTTCGA 1600 

TTTGAACATA ACGAAgTTCC TAAAATTAAT AACGCCTTGG TTATTGATGT GCCTAAAGAA 1740 

2s GAAGGTACAA TACAACTAAC ATTAGAAGTT GCGCTGCAAT TAGGTGACGA CGTTGTTCGT 1800 

ACAATTGCGA TGGATTCAAC TGATGGTGTC CAAAGAGGCA TGGATGTAAA AGATACAGGC 1860 

AAAGAAATTA GTGTACCTGT TGGTGACGAA ACATTAGGTC GTGTATTTAA TGTACTAGGT 1920 

30 GAAACAATTG ACCTTAAAGA AGAAATTAGT GATTCTGTTC GCCGCGATCC TATCCATCGT i960 

CAAGCACCAG CATTCGATGA ACTTTCAACA GAAGTTCAAA TTTTAGAAAC AGGTATTAAA 2040 

GTAGTAGATT TACTAGCACC TTATATTAAA GGTGGTAAAA TCGGATTGTT CGGTGGTGCC 2100 

GGTGTAGGTA AAACAGTATT AATCCAAGAA TTAATTAACA ACATCGCTCA AGAGCACGGT 2160 

GGTATTTCTG TATTCG CCGG TGTAGGTGAA CGTACTCGTG AAGGTAACGA TTTATACTTC 2220 

GAAATGAGTG ACAGTGGTGT AATTAAGAAA ACAGCCATGG TATTCGGGCA AATGAATGAG 2280 

CCACCTGGTG CACGTATGCG TGTTGCATTA TCTGGTTTAA CAATGGCTGA ATATTTCCGT 2340 

GACGAACAAG GTCAAGACGT ATT ATT ATT C ATCGATAACA TTTTCAGATT TACACAAGCT 2400 

GGTTCTGAGG TATCTGCATT ATTAGGTCGT ATGCCTTCTG CAGTAGGTTA CCAACCAACA 2460 

CTTGCTACTG AAATGGGACA ATTACAAGAA CGTATTACGT CTACAACAAA AGGATCAGTT 2 520 

ACTTCTATTC AAG CGGTATT CGTACCTGCC GATGACTATA CTGACCCAGC GCCTGCGACA 2580 

60 GCGTTTGCCC ATTTAGATGC AACTACAAAC TTAGAACGTA AATTAACTGA AATGGGTATT 2640 

TATCCAGCCG TGGATCCATT AGCGT CTACA TCAAGAG CAT TGGAACCATC AATTGTAGGT 2700 
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CAAGATATCA TTGCTATCTT AGQTATGGAC GAATTATCTG ATGAAGATAA ACAAACAGTT 2820 

GAACGCGCAC GTAGAATTCA ATTCTTCTTA TCTCAAAACT TCCACGTAGC GGAACAATTT 2880 

ACTGGTCAAA AAGGTTCTTA TGTACCTGTT AAGACAACAG TTGCAAACTT TAAAGATATC 2940 

TTAGATGGTA AATATGACCA TATTCCAGAA GATOCATTCC QTTTAGTTGG TAGCATGGAT 3000 

GATGTTATTG CAAAAGCTAA AGATATGGGT GTTGAAGTAT AACAATTAGG AGGAATGGAT 3060 

AATGAATACA TTAAACCTAG ATATTGTCAC TCCTAATGGT TCTGTTTACA ATCGTGATAA 3120 

TGTTGAACTC GTTGTTATGC AAACAACAGC TGGTGAGATA GGT6TCATGA GTGGACATAT 3180 

TCCAACTGTA GCTGCTTTAA AAACAGGCTT TGTAAAAGTG AAATTTCACG ATGGAACTGA 3240 

ATATATTGCT GTAAGCGATG GCTTTGTTGA AGTTAGAAAA GATAAAGTTT CAATCATTGT 3300 

TCAGACTGCA GAAACTGCAA GAGAAATTGA TGTTGAAAGA GCTAAATTAG CCAAAGCAAG 3360 

20 AGCAGAGTCT CACTTGGAAA ATGATGACGA CAATACTGAT ATTCATAGAG CCGAAAGAGC 3420 

TTTAGAGAGA GCAAATAACC GTTTGCGTGT GGCTGAATTA AAATAGTAAA TAAAGGGTCG 3480 

AAGATGTGAT TTCATATCTT CGACCCTTTT TTGAATTATA TTGATTTAAA GATACAAAAC 3540 

25 ATGAGAGGGG GGAAGGAATT GATAAAGAAC CATTAAAGAT TTATGATGTA GTGGTTCTTT 3600 

ATCATTAAAC ACAGCTAATG TGTATTTAAA AATAGGaayA CATgAGTAAA ACTCATGTAT 3660 

AAGAAATACT AATTTCTAAA GAAAAAGTAT TT C TT T ATGT TGGGGCCCCG TCAACTACTG 3720 

30 CCAAATACAA CACTATAGAG TCTAGACATT GATTTATGTC CGACTCCCAA GAATAGTTTT 3780 

ACTTTTTTAC AATCACTAAT AGATTGCTAA AATCAAAATT TCCTTCACCA CTATCTACAG 3840 

TCGACATTTC ATTTTTTGAA ATTATCTACA TTTTTTCATA CCAAGATATT TTATAGTTAT 3900 

GATATTTATG TAAAAAGAAT TATATAGTAA GTTAGCTTAA ACTTTACTAA AAACGGGTAT 3960 

TAAACTTTGT ATCATTATTT AAATTTTTCA TGTACAATGT AATACAGTAA TCTTATGAGG 4020 

TGATAAAATG GATTATATCG GACAATATGC AGTTATCCAT TTAGTGTTAC ATGTTGTATG 4080 

TATTTGTATT GCCTATTGGG CTTTACAATC AATTAGATTA GATCAATTTT TTAAAAAAGG 4140 

ATACGCCACT CAATTACAAG TGTGTATGAT ATTTGTTGCT ATTTTATTAG GCACTGCAGT 4200 

45 AAQCAATTTT ATTGTAGATT TGTTACAATA CTCGACGCAG GTAAAATATT TAATAAAATA 4260 

AGTCTAACTC TATGATTTGT AATCAAAACT AGATATAATT AAATAATGAC TTAAAATAAT 4320 

TTTAAAATAG GGAAATGTAA AGTAATAGGA GTTCTAAGTG GAGGATTTAC GATGGATAAA 4380 

SO ATAGTAATCA AAGGTGGAAA TAAATTAACG GGTGAAGTTA AAGTAGAAGG TGCTAAAAAT 4440 

GCAGTATTAC CAATATTGAC AGCATCTTTA TTAGCTTCTG ATAAACCGAG CAAATTAGTT 4500 
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GAOGTTACAT ACAAAAAGGA CGAAAATGCT GTTGTCGTTG ATGCAACAAA GACTCTAAAT 4620 

GAAGAGGCAC CATATGAATA TGTTAGTAAA ATGCGTGCAA GTATTTTAGT TATGGGmCCT 4660 

5 CTTTTAGCAA GACTAGGACA TGCTATTGTT GCATTGCCTG GTGGTTGTGC AATTGGAAGT 4740 

AGACCGATTG AGCAACACAT TAAAGGTTTT GAAGCTTTAG GCGCAGAAAT TCATCTTGAA 4800 

AATGGTAATA TTTATGCTAA TGCTAAAGAT GGATTAAAAG GTACATCAAT TCATTTAGAT 4 860 

10 

TTTCCAAGTG T AGG AG CAAC ACAAAATATT ATTATGGCAG CATCATTAGC TAAGGGTAAG 4920 

ACTTTAATTG AAAATGCAGC TAAAGAACCT GAAATTGTCG ATTTAGCAAA CTACATTAAT 4 980 

GAAATGGGTG GTAGAATTAC TGGTGCTGGT ACAGACACAA TTACAATCAA TGGTGTAGAA 5040 

is 

TCATTACATG GTGTAGAACA TGCTATCATT CCAGATAGAA TTGAAGCAGG CACATTACTA 5100 

ATCGCTGGTG CTATAACGCG TGGTGATATT TTTGTACGTG GTGCAATCAA AGAACATATG 5160 

20 GCGAGTTTAG TCTATAAACT AGAAGAAATG GGCGTTGAAT TGGACTATCA AGAAGATGGT 5220 

ATTCGTGTAC GTGCTGAAGG GGAATTACAA CCTGTAGACA TCAAAACTCT ACCACATCCT 5280 

GGATTCCCGA CTGATATGCA ATCACAAATG ATGGCATTGT TATTAACGGC AAATGGTCAT 5340 

25 AAAGTCGTAA CCGAAACTGT TTTTGAAAAC CGTTTTATGC ATGTTGCAGA GTTCAAACGT 5400 

ATGAATGCTA ATATCAATGT AGAAGGTCGT aGTG CTAAAC TTGAAGGTAA AAGTCAATTG 5460 

CAAGGTGCAC AAGTTAAAGC GACTGATTTA AGaGCAGCAG CCGCCTTAAT TTTAGCTGGA 5520 

30 TTAGTTGCTG ATGGTAAnAC AAGCGTTACT GAATTAACGC ACCTAGATAG AGGCTATGTT 5 580 

GACTTACACG GTAAATTGAA GCAATTAGGT GCAGACATTG AACGTATTAA CGATTAATTC 5640 

AGTAAATTAA TATAATGGAG GATTTCAACC ATGGAAACAA TTTTTGATTA TAACCAAATT 5700 

35 

AAACAAATTA TACCTCACAG ACAGCCATTT TTATTAATTG ATAAAGTAGT TGAATATGAA 5760 

GAAGGTCAAC GTTGTGTGGC TATTAAACAA GTATCAGGAA ACGAACCATT CTTTCAAGGG 5820 

CATTTTCCTG AGTATGcGGT AATGCCAGGC GTATTAATTA CTGAAGCGTT AcTCAAACAG 5880 

40 

GTGCGGTAGC TATTTTAAAT AGTGAAGAAA ATAAAGGTAA AATCGCTTTA TTTGCTGGTA 5940 

TTGATAAATG TCGTTTTAAA CGTCAAGTAG TACCTGGTGA TACTTTAACG TTGGAAGTAG 6000 

45 AAATCACTAA AATTAAAGGA CCAATAGGTA AAGGTAATGC TAAAGCTACT GTCGATGGTC 6060 

AACTTGCTTG TAGTTGTGAA CTTACATTTG CAATTCAAGA TGTAAAATAA AACAAAAAAA 6120 

ACATTCAAAG ATTTAATGTG TTGGCATAAT CTTTGAATGT TTTTTATTTT ACTCTTCTAA 6180 

60 TTTTTCATCC TTTAACTTTG GTTTAGACTG CaTCATTCGA TTAAATGATT TTTTTAATTC 624 0 

TTCACCAGAT AATC CAT CAT CAATAAGTTG GTTCTAATAA ACTTTCAGCA TACTGTTGGA 6300 
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(2) INFORMATION FOR SEQ ID NO: 481: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 4 81: 

AGTTGCTACA CCAGACATGA TGGGTGAAGT TGGTAAATTA GGTCGTGTAT TAGGACCAAA 60 

is AGGTTTAATG CCAAACCCTA AAACTGGAAC TGTAACAATG GATGTTAAAA AAGCTGTTGA 120 

AGAAATCAAA GCTGGTAAAG TAGAATATCG TGCTGAAAAA GCTGGTATCG TACATGCATC 160 

AATTGGTAAA GTTTCATTTA CTGATGAACA ATTAATtGAA AACyTCaATA CyTTACAAGA 240 

20 TGTATTAGCT AAAGCTAAAC CATCATCTGC TAAAGGTACA TACTTCAAAT CTGTTGCTGT 300 

AACTACAACA ATGGGTCCTG GAGTTAAAAT TGATACTGCA AGTTTCAAAT AATAAATGAT 360 

ATAAACAATT ACAGGCTGAA AGAAATATCT TTCAGTCTGT AAAAATATAT TGACAATAAG 420 

25 TAATTTCCAA G TTAT ATT AC TTATTGTGAT TATTTTACCT AAGACAGTAG GAGTTATTTA 4 80 

TAACTTAAAA TTTATCCTGC CGAGGCTAAA ATTGACTTGA ACGTGATGAT CTATGATCTT 540 

TCAAGCACTT TTTGCOGTGG GTAGAAAGTG CTTTTTTTAT TAATTTTAAA AAAAGCACCA 600 

30 AAAATTTAAA TGGAGGTGTC TGAATGTCTG CTATCATTGA AGCTAAAAAA CAACTAGTTG 660 

ATGAAATTGC TGAGGTACTA TCAAATTCAG TTTCAACAGT AATCGTTGAC TACCGTGGAT 720 

TAACAGTAGC TGAAGTTACT GACTTACGTT CACAATTACG TGAAGCTGGT GTTGAGTATA 780 

35 

AAGTATACAA AAACACTATG GTACGTCGTG CAGCTGAAAA AGCTGGTATC GAAGGCTTAG 840 

ATGAATTCTT AACAGGTCCT ACTGCTATTG CAACTTCAAG TGAAGATGCT GTAGCTGCAG 900 

OGAAAGTAAT TTCTGGATTT GCTAAAGATC ATGAAGCATT AGAAATTAAA TCAGGCGTTA 960 

40 

TGGAAGGCAA TGTTATTACA GCAGAAGAAG TTAAAACTGT TGGTTCATTA CCTTCACACG 1020 

ATGGTCTTGT ATCTATGCTT TTATCAGTAT TACAAGCTCC TGTACGCAAC TTCGCTTATG 1080 

45 CGGTTAAAGC TATTGGAGAA CAAAAAGAAG AAAACGCTGA ATAATTTTTA GCGTAAAAAA 1140 

ATTAAAAATA ATGGAGGAAT TATAAAATGG CTAATCATGA ACAAATCATT GAAGCGATTA 1200 

AAGAAATGTC AGTATTAGAA TTAAACGACT TAGTAAAAGC AATTGAAGAA GAATTTGGTG 1260 

60 TACTGCAGCT GCTCCAGTAG CAGTAGCAGG TGCAGCTGGT GGCGCTGACG CTGCAGCAGA 1320 

AAAAACTGAA TTTGACGTTG AGTTAACTTC AGCTGGTTCA TCTAAAATCA AAGTTGTTAA 1380 
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TCCTAAAGTA ATCAAAGAAG CTTTACCTAA AGAAGAAGCT GAAAAACTTA AAGAACAATT 



1500 



AGAAGAAGTT GGAGCTACTG TAGAATTAAA ATAATTCAAG TATCTTAAAC TTAATAATCA 



1560 



AAGTTTTATA GCAAGTATTG CTATAATATA ATGATTCTTT GAGAAGTTAA AACCCCGTTA 



1620 



TTTTGATAAC GGGgTtTTAT TCaTTTAAAG ACTGAGTGAA ATGTTATAAT TATAATGACG 



1680 



AGTTACAAAG TGAAGATGAG GTGGGAATAA TGAGTCA 



1717 



10 



IS 



20 



25 



30 



35 



40 



45 



SO 



(2) INFORMATION FOR SEQ ID NO: 482: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1279 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482: 

GTAAATCTGT TACTCGAAAT GTAACGATTA AAGAAAAGGG CTCATCTCAA ACATATATTT 60 

TGTTAGGCTA TCCAACAAAA GCACAGAAGA ATAGTCATAG CAAATATAGT GGAGTCTTTA 120 

TATATAAAGA CTTGAAATCA ATCGAAGATA CAAATAATGC TATTACGATT ATCACCATAA 1B0 

TTACGGCTGT TATTTTCTTA ACAATTACAA CAGTCTTTGC GTTTTTCTTA TCGTCAAGAA 24 0 

TTACAAAACC TTTAAGACGT TTAAGAGACC AAGCTACACG TGTATCTGAA GGGGATTACT 300 

CTTATAAACC TTCTGTCACA ACGAAAGATG AAATTGGTCA ATTATCG CAG GCATTTAATC 360 

AGATGAGTAC AGAAATCGAA GAGCATGTCG ACGCATTATC CACATCTAAA AATATTAGAG 420 

ACAGCTTAAT TAACTCTATG GTAGAAGGTG TCCTAGGTAT TAATGAGAGT CGACaAATTA 480 

TCTTATCTAA TAAGATGGCG AATGATATTA TGGACAATAT TGATGAAGAT GCTAAAGCTT 54 0 

TCTTATTAAG ACAAATAGAA GATACTTTTA AATCAAAACA AACTGAAATG CGCGATTTAG 600 

AAaTGAATGC ACGATTCTTT GTTGTGACCA CAAGCTATAT CGACAAGATT GAACAGGGAG 660 

GTAaAAGTGG TGTTGTTGTG ACAGTTCGTG ATATGACTAA TGAGCACAAT CTAGATCAAA 720 

TGAAGAAAGA TTTCATTGCT AATGTATCAC ATGAATTACG TACACCGATA TCATTACTTC 780 

AAGGTTATAC TGAATCAATT GTAGATGGTA TTGTTACAGA ACCGGATGAA ATAAAAGAAT 840 

CGCTTGCCAT TGTCCTTGAT GAATCGAAAC GTTTAAATCG TTTAGTTAAT GAATTGTTAA 900 

ATGTCGCACG CATGGATGCT GAAGGGTTAT CCGTAAATAA AGAAGTTCAG CCTATTGCAG 960 

CGTTACTAGA TAAGATGAAA ATTAAGTATC GCCAACAAGC TGATGATTTA GGTCTAAATA 1020 

TGACTTTTAA TTAyTGTAAG AAG CGTGTTT GGAGTTATGA TAwGGATCGC ATGGACCAAG 1080 
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TTACTTGTGA TGAAAATGAA AGCGAAGATA TTTTATACAT TAAAGATACA GGTACAGGCA 1200 

TTQCACCAGA ACATTTACmA CAAGTATTTg ATCGTTTTTA TAAAGTTGAT GCAGCGnAnA 1260 

5 ACCCCGnGGT AACCAnGTA 1279 

(2) INFORMATION FOR SEQ ID NO: 4B3: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1144 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 
<D) TOPOLOGY: linear 



1S 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483: 





GAGCTGTTGT 


TACTTTGATG 


CCTGCAGCTT 


TATTACGGCT 


GACTTGGTAA 


TGATAAGTTT 


60 


20 


CAGCATATTG 


CTCAATATAT 


GCTATATCAT 


ATTGAATGGT 


ACGAGGTGAT 


ACACCAAGTT 


120 




GATTAGCAAT 


GGTATTGATT 


GGAATAAACG 


TTTGCTCATG 


AATTAAAAGA 


TACAAAATTT 


180 




CGATTTGTCT 


ATAACTTAAC 


AACGTAATAT 


CCTCCTATTT GTAATTGTAA 


GCGATTTCTT 


240 


25 


AAAAACGTAG 


ATATGCAATC 


TCTTTCATAT 


TTTAATCCGA 


AAAATTGCAT 


ATCAAAATGT 


300 




TTATGGCGCA 


AGATTTTATA 


GGAACTTTTA 


AAATAAATTA 


rATATTCATG 


TTGACAATTT 


360 




AAAAATGTCG 


CAGTATATTT 


AGTTAGACAT 


CTAACGAAAT 


GGTGGTGCAA 


TAAATGGAAT 


420 


30 


TCACTTATTC 


GTATTTATTT 


AGAATGATTA 


GTCATGAGAT 


GAAACAAAAG 


GCTGATCAAA 


480 




AGTTAGAGCA 


ATTTGATATT 


ACAAATGAGC 


AAGGTCATAC 


GTTAGGTTAT 


CTTTATGCAC 


540 


3S 


ATCAACAAGA 


TGGACTGACA 


CAAAATGATa 


TTGcTAAAGC 


ATTACAACGA 


ACAGGTCCAA 


600 


CTGTCAGTAA 


TTTATTAAGG 


AACCTTGAAC 


GTAAAAAGCT 


GATCTATCGC 


TATGTCGATG 


660 




CACAAGATAC 


GAGAAGAAAG 


AATATAGGGC 


TGACTACCTC 


TGGGATTAAA 


CTCGTAGAAG 


720 


40 


CATTCACTTC 


GATATTTGAT 


GAAATGGAAC 


AAACACTCGT 


ATCGCAGTTA 


TCTGAAGAAG 


780 




AAAATGAACA 


AATGAAAGCA 


AACTTAACTA 


AAATGTTATC 


TAGTTTACAA 


TAAATGATAA 


840 




GTGTGACTGG 


TAGAAATCAG 


TCACTTTGTC 


TTTAATATTA 


TAGTTAGATA 


TCTAATTGTT 


900 


45 


AGTAAGCTAA 


TTATTGGAAA AGACAAGGAG 


TATTGAACAA 


TGAAAGACGA 


ACAATTATAT 


960 




TATTTTGAGA 


AATCGCCAGT 


ATTTAAAGCG 


ATGATGCATT 


TCTCATTGCC 


AATGATGATA 


1020 




GGGACTTTAT 


TAAGCGTTAT 


TTATGGCATA TTAAATATTT ACTTTATAGG ATTtTyAGAm 


1080 


SO 


GAyAGCCACA 
GTTA 


TGATTTCTGC 


tAatCTCTCT 


AACACTGCCA 


GTATTTGCTA 


TCTTAATGGG 


1140 
1144 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1X58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484: 



ACAACCAATT 


TTACTAAACA 


TGGTrrTCAT 


gCATTTTCAA 


TATTTAGATG 


TCGACCGCAT 


AATCGAAGAA 


TCGCCGACAA 


TAGTACTTAT 


CGATGAGTTA 


GCACATACGA 


ATATTTCTAG 


AGATCGTCAT 


GAGAAACGAT 


ATATGGATAT 


TGAAGAAATT 


TTAAATCATG 


GTATOGATGT 


TCATACCACT 


TTGaACATTC 


aaCATATTGA 


AAGTTTAAGT 


AGTCAAATTG 


AACTGATGAC 


CGGTGTACAT 


GTTAAAGAAC 


GTGTACCCGA 


CTATTTCATA 


ATGAGCGCCG 


ATGTATTAGA 


AGTCGTAGAT 


ATCTCACCTG 


AACAATTAAT 


TAAACGCTTA 


AAAGCTGGCA 


AGGTATATaA 


AAAGGATAGG 


CTAGATGTAG 


CATTTAGTAA 


TTTCTTTACG 


TATGCCCACC 


TAAGCGAAtG 


CGTACATTGA 


CGTTAAGAAC 


AGTTGCCGAC 


TTGATGAGTG 


ATAAAGAAAA 


AGTCCGACAC 


AACCATAAAA 


CGTCACTCAA 


ACCTCATATT 


GCTGTGGCAA 


TTAGTGGGAG 


CATTTATAAT 












Ul 1 UAL 1 Ul* 1 


ATTTATATAG 


ATGTATTCGA 


AAAAAACAGG 


CAATATAAAG 


ATAGTCAAAA 


GCAAGTG CAT 


CAACATCTCA 


TGCTTGCAAA 


ATCATTAGGA 


GCAAAAGTAA 


AAGTAGTTTA 


TAGCCAAACC 


GTTGCATTAG 


GATTAGACGA 


ATGGTGTAAA 


AATCAAGATG 


TAACCAAATT 


AATTATCGGA 


CAACATATTA 


GAAATAAGTG 


GCGAGACTTT 


TTCAATACAC 


CTTTAATTGA 


CCATTTAATG 


TCCTTTGAAC 


ATAGCTATAA 


AATCGAAATC 


GTTCCAATCA 


AACAAATACC 


TGTTGAATTG 


AAAATGAACA 


AATCACCCTA 


TCGTCCTAAA 


GGCAAACGTT 


TCGCCATAGA 


TATGTTAAAA 


ATGATTTTGA 


TTCAAATAAT 


TTGTGTAATG 


ATGGGACTGT 


GGATTTATCA 


ACTTGATAAG 


CATGAGTCTA 


GTACGATTAT 


TTTAATGATT 


TTTCTCATCG 


GCATCATTTT 


ATTATCCATT 


TGGACGCGGT 


CCTTCATCAT 


TGGCTTTTaG 


CAGCAATTAt 


TAACGTATTT 


GTgTkTAATT 


ATkTTTTtAC 


GGAACCTA 











(2) INFORMATION FOR SEQ ID NO: 4B5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485: 





ATCATATGGT 


CGATTTAACA 


GATCCAACGT 


ACTGCtAAAT 


AATTACATQA 


CGTTTAAACA 


60 


6 


TAGCATTGAT 


TATAACTATT 


TCTAAGTCTT 


CGCATTATTT 


GCGATGATGT 


GGGAATAGTT 


120 




ATTTTTATTT 


AAAAATATAA 


AAAAATAGAT 


GCAGCAAAAT 


TTTAAAGCAT 


TTTATTTTGA 


180 




ACATATTAAA 


AGGGAGCGTA 


TCATAATGGA 


ATGTAATGTT 


TATATCGTAT 


GCATTACGGA 


240 


10 


TAAATAATAT 


ATAAATCATT 


CTTGAGGAGT 


GAAAGAATAA 


TGAGAGACTA 


CACAAAOCAA 


inn 




TACATTAATG 


GCGAATGGGT 


AGAAAGTAAT 


AGTAATGAAA 


CG ATAGAAGT 


TATA A ATPPA 


i fin 


IS 


GCAACCGAAG AA6TAATCGG 


GAAAGTTGCT 


AAAf^fTTAATA 


#V\\9l> X X w X 


Tf2 AT A A AflfT* 


^ *y a 


GTCGAGGCGG 


CAGACGATGT 


x xnx a. inwuj 


TTPrviTPATA 


P ATPTGTGA A 
V«r\ iwiu iwln 


Art & A lift & Pft ft 


Oft 




GCG 'A*i ~rA CZ 




AAAAGAmTAT 
Annnunlll x nx 


rs a n a arii^a a 


A Aft AfY5 ZkTAT 




S40 


20 


ATTACGGATG 


AATTAGGTGC 




X XAXl.l\aAv\~ 


PTVPPP ft IT ft 


ft ft ft qyv^i"! lv 

TCaAaTGGGA 


€00 




CTAAACCATT 


TTGTTGCAGC 


rtArtAftAf*rtf*A 


TTAGATAACT 


ACGAATTTGA 


ftAft ft 


660 




GGAGATGATT 


TAGTTGTTAA 


ft <T» ft ftfl/"*ft ft TV' 


GGTGTATCTG 


t~* ft IT ft ATWPft f» 


accgtggaac 


720 


25 


TTCCCTACAA 


ACCAAACATC 


* *p*t»b » ft ft •t**t , r 


L-AIj<_A 1 


TTGCGGCTGQ 


tagtccagtt 


780 




GTACTTAAAC 


CATCTGAAGA 




GGAGCTGTTA 


TTTTAGCTGA 


GATTTTTGAT 


840 




AAAGTCGGTG 


TTCCTAAAGG 


TY2T fiTTr A ftp 


l«X XViX lAnlu 


Vj I \sA XVJV3 Ivrv. 


TGGTGTTGGG 


900 




AATCCTTTAT 


CTGAACATCC 






1 1 AvJAvjVjA XV. 


AGGCC ctact 


960 




GGTTCTAAAA 


TTATGGAAAA 




VjAI X X X.MAnn 


AuulnlCAl X 




1020 




GGCAAATCAC 


CATATATCGT 




RTARATATTA ; 


AArtAArtPrtrtr* 


X nnnuLAALa 


1080 


35 


aCAGGCAAAG 


TTGTTAATAA 


TACTGGTCAA 


GTATGTACAG 


CTGGTACACG - 


. AO A X.X 


1 1 A ft 




CCTAACAAAA 


TTAAAGATGC 


ATTCTTAjGCT 


GAATTAAAAG 


A A fTAATTT Af» 


rT!AAft*IY«r , rtT 


t *>nn 
1/UU 




GTCGGTAATC 


CAAGAGAAGA 


TGGTACACAA 


GTAGGCCCTA 


TCATTAGTAA 


A A R RPJMiTT*!' 
nnnnAMnn^X X X 


1 OCA 


40 


GATCAAGTAC 


AAAATTATAT 


TAATAAAGGT 


ATTGAAGAAG 


GTGCTGAATT 


ATTTTATGRT 






GCTCCTGGTA 


AACCAGAAGG 


ACTTGAAAAA 


GGATACTTTG 


CAOGTCCGAC 


AATTTTTATT 


1380 


45 


AATGTAGATA 


ATCAAATGAC 


GATAGCACAA 


GAwGAAATTT 


TTGGGCCAGT 


AATGTCAGTT 


1440 


ATCACTTATA 


ACGATTTAGA 


TGAAGCGATT 


CAAATTGCAA 


ATGATACAAA 


ATATGGTTTG 


1500 




GCAGGATATG 


TTATTGGTAA 


GGACAAAGAA 


ACATTGCATA 


AAGTAGCTCG 


TTCTATTGAA 


1560 


50 


GCAGGTACAG 


TAGAAATAAA 


CGAAGCAGGT 


AGAAAGCCAG 


ATTTACCATT 


TGGTGGCTAT 


1620 




AAACAATCTG 


GTTTAGGTCG 


TGAATGGGGC 


GATTATGGTA 


TTGAAGAGTT 


CTTAGAAGTG 


1660 
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AGTGCACATG ACTAATTAAG TTTTGTGTAC TGTTTTAATT TTGCAATTTT TATAAATAGA 1800 

TTTTOTAATT AAAATAAAAA TTTGCTATAG TTATTCATGT ATTTAAAAGG TTGGGGATTA 1860 

5 GCATAATGGG ATTGTGCTAG CACAGTTATT TATGCATTGT CATGCCTATC TATTACTTAC 1920 

TAACTAAAAA ATAATGAAAT GGGTGTAAAC TATATGCCTG AAAGAOAACG TACATCTCCT 1980 

CAGTATGAAT CATTCCACGA ATTGTACAAG AACTATACTA CCAAGGAACT CACTCAnAAA 2040 

10 GCTAAAACTC TTAAGTTGAC GAACTATAGT AAATTAAATn AAAAAGAACT TGTTCTAGCT 2100 

ATTATGGAAG CACAAATGGA nAAAGATGGT AACTATTATA TGGAAGGTAT CTTAGATGAT 2160 

ATACAACCAG ATGGTTATGG TTTTTTAAGA ACAGTGAACT ATTCTAAAGG GGAAAAAGAT 2220 

15 

ATTT 2224 

<2) INFORMATION FOR SEQ ID NO: 4 86: 

<i) SEQUENCE CHARACTERISTICS: 
20 (a) LENGTH: 1690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 86: 





ACATTACnTT 


GAATCGAAAG 


TTTCATAAAC 


GACTTGAATG 


CCAGTTTCTT 


TTTCAAATTT 


60 


30 


CTTAATTAAC 


TCTGGATCAA 


TATATTCGCC 


CCAATTGTAT 


ACGTAAATTT 


TTTGATTTGT 


120 




ATGCACTTGT 


TCTTTAGATT 


TAAACCAATG 


ACTTAAAGTA 


AGACAAAGCA 


TACCCACAAC 


180 




TAATGCACCT 


ATAATGAGTT 


GTAAAAATCG 


TTTCATTATT 


TTACACCTCG 


CTTGATTAGT 


240 


35 


TTTTTCTTAT 


TTATCACGTA 


TTGAATCAAA 


TAATATCCTA 


GTATTCCTAA 


TACAATAACA 


300 




GCAAACAATA 


ATGTTGAAAT 


CGCATTAATT 


TCCATACTAA 


TTCCTTTTCT 


CGCCATAGCA 


360 




TAAACTTCAA 


CTGATAACAC 


ACTAAAGCCA 


TTACCAGTAA 


CGAAGAAACT 


TACTGTGAAA 


420 


40 


TCGTCTAGTG 


AATAAGTTAA 


AGCCATAAAG 


AATCCTCCTA 


TAATAGAAGG 


TAAAATATTA 


480 




GGAATAATAA 


TGTTGCTTAA 


TAATTGTGGT 


TCAGTCGCTC 


CTAAATCTCT 


TGCAGCATTT 


540 


45 


AACATATTAT 


TATTCATyTC 


ATACAGTTGT 


GGTAAGACGA 


TAATCACAAC 


TATAGGTATG 


600 


CAAAATGCAA 


TATGAGATAT 


TAGAACTGTC 


CaAAAkCCTA 


AACCAAGACC 


AGTAAAATGG 


660 




CCAATCGTTG 


TAAACATAAT 


TAAGAATGAT 


GCACCTATGA 


CAACGTCGGA 


TGATACCATC 


720 


60 


AAGACATTAT 


TCAATGTTAG 


TAAAGTTACT 


TTAAACTTTT 


TATTTCTTAA 


ATAATAAATA 


780 


GCAATGGCAC 


CAAATGTACC 


AATAACTGTA 


GAAATTGAGG 


CTGCTAAAAG 


TGCTACAGCT 


840 
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AATGTAAAAT 


GTTCAAAGTG 


AATCATATTA 


CCAGCCGAAT 


TGAATGAATA GAACATTAAA 


960 




AAGAATATTG 


GGATGTATAA 


AATCGCTAAA AGTATCCCGA TATACAGCTT 


TCCATACCAT 




5 


TTCATATGAT 


TCACCCTCTC 


CCATTAGATG 


ATTTTGTAAT 


GATTAAAATG 


AATGCCATAA 


i n a n 




ATACAATTAA 


GAATATAGCT 


ATAGTTOATC 


CCATACCATA 


ATTTTQAATT 


GTTAAAAATT 




10 


G1TTCCTCTAT 


TGCCQTACCT 


ATATTTATGA CTTTATTACC 


TGCAATTAAT 


CTTGTAATCA 


X200 


TAAATAATGA 


AAGTGATGGA 


ATAAAGGTTA 


CTTGAATCCC 


AGTCATAACA 


CCTTCTTTTG 


1260 




TTAACGGCAT 


GATTACTTTT 


CTAAAAGTAT 


AGAAAGGACT 


GGCACCTAAA 


TCACTTGAGG 


1320 


1S 


CCTGCAATAA 




ATTGCTTTCA 


TGCTATTAAA 


TATAGGTAAA 


ATCATAAATG 


1380 






ACTACTAAAA ACGCACCAGT 


TGTAAATAAC 


AAATTGAATG 


1440 




ATGGTAAATT 


AAATAAGTGG 


AAAATTGATT 


AATCACGCCA 


TCATGACTTA 


ATAAACCTAT 


1500 


20 


AAAAGCATAT 


GTCTTTAACA 


ATAAATTTAT 


CCATGTTOGA 


ATAATCATTA 


TCATTAATAA 


1560 




GATATTTTGA 


AATTTOGAAC 


GAGTAATATA 


ATAGGCAGnT 


GGATAACTGA 


TAGTCAAGGT 


1620 




AATAATTGTT 


ATTOAAGCGG 


CATATAAAAT 


TGAATATGCA 


AACATTTTCA 


AATATTTTGT 


1680 


25 


AGTAAAAATT 












1690 



(2) INFORMATION FOR SEQ ID NO: 487: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2112 base pairs 
30 <B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487: 



40 



45 



SO 



ACGAAAAGAA 


ATATTATGAT 


GAACAAAAAG 


AAAGAATAAC 


GATTTATATG 


AAGTACAATG 


60 


TGAAAGGTTA 


TAAAAATATA 


AGCTTCGCTA 


ATTTTAAAGA 


AAACCCAATG 


GATGGTTATT 


120 


CTATTAGTGG 


TTATATAAAT 


AAtGaTAAAA 


AGTTATCATT 


TACAGCTGGT 


ATAAGATCTG 


180 


TTGATGATTT 


TCAATTTGAT 


ACCGATATTT 


CTTATACAGA 


TGAATTGGGT 


AGAAAATTTA 


240 


ATAAAAATCC 


TAAGTCAGTT 


TCTGAAATAA 


AAAAAGAGCA 


AAATACGTCC 


AATAAATAAT 


300 


TGTTCATATT 


GTGATGAAAC 


AAAAATATAA 


GTCATTAGAT 


GAGTTTAACT 


ATGTTATAAA 


360 


TATTTGTAGT 


ATCTATAAAA 


ATCTCGACAC 


TATTAAAATG 


ATAAAGTGCC 


GAGGTTTTCT 


420 


TACTTATTTA 


GTTAATTCAA 


AGTTTATGCC 


AGATTCATAA 


GAATTTGTGA 


CACTTTTAAT 


460 


AGTGTAcCAT 


TGATTATTAC 


AATTTATCAA 


ATGGTCCTTT 


AGAAGGTATA 


AATAACAAAA 


540 
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TATTATTATG TTCAAAACTT TACGCTCCAA AAAGTAAAAA GGAAGTTAAG CAATGTTTAG 660 

TTGCTTAaCT TCGGaTATTG AACGCATCAG TCCAATTTGA CATAGAGCCT TTTTTAGTTC 720 

6 TTGATGTTTC TCTTTAAAAC CTTGCATATT TTACAAAAAG AAAAATTAGC AGTATAATTA 780 

AGACAACGAA AATAAGTATT TACTTATACA CCAATCCCCT CACTATTTGC GGTAGTGAGG 840 

GGATTTTTAT TGOTGCGGCT ATATGTCACC TATTTTGTAT TGCGTCTACT TAGCCAATAA 900 

10 GAAAAAAACG CAATGGCACA GCCACTGATG ACTGGTGCTA TGATGTGAAC GaAAATAAGC 960 

ATCAC CTTAT ACACCTCCTC TCTGCGTCTA AATTGACGsC TGAGaGrTAG GcGACTCTAC 1020 

TATTATATCA TCGGCAAATA TACAAGCACA GTCACTTGCT TCTGATAAGT TATATGATTC 1080 

15 

TAGCTGATAG ATTGAATCGT CTACACTTAA TTGGACAAAT TCTATGAGAA TAGATATTGT 1140 

TAATTTAAGA AAGTAGGCGA TTTTATTATG ACAAGAGAAA GAAGATCATT TAGTTCAGAG 1200 

TTTAAGTTAC AAATGGTTAG ATTATATAAA AATGGTAAGC CTAGGAATGA AATTATACGC 1260 

20 

GAGTATGATT TCACACCTTC GACGTTTGTA AATGGCGGTT ATAAAATGTA GOAAAATGGA 1320 

TAAAGCAACA TCAAAACACG GGTACATTCA AT CACCAAGA TAACTTATCG GATGAAGAAA 1380 

25 AAGAGCTGAT TAAATTACGC AAAGAAGTTC AACATTTAAA AATGGAGAAC GATATTTTAA 1440 

AGCAAGTAGC GCTGATTATG GGG CAAAAAT AGAAGTCATT CAAAAGAATG CACATCAATA 1500 

TTCAGTATCA G CAATGTGT A AAGTCCTGAT AATACTAAGA AGTACCTATT ATGATTCTAT 1560 

30 AAAAAGAAAA GATAATAAAA TCACTAAAGA TGATTCAAAC ATAGAACATG CCGTCATAAA 1620 

TATTTTTAAT TCTAATAGAA AAGTCTTTGG TACAAGACGA ATTAAAAATC ATTTAAATGA 1680 

CAAGGGTCTC ACTGTATCTG GACAAAAGAT AGGTCGATCA TGAAAAAATC TAGTTTCTGT 1740 

35 TTATACGAAA TCTAAATACA AAAATCATCT AAAAGAAACT AATGAAAAAC GAATTAAAAA 1800 

TCTTTATTAT TAGCTGCTGG TGTATTATTT GTTAGTCCAA TTTCATTATC TTATAATTCA 1860 

GATGTAGCTC ATGCTGAAGA TAAGTTAGAC CATTCTCAAG CAAAGGTAAT ATATTTGAGT 1920 

40 

AACCAAAATT TATTTOATGA ACTTGAGAAA AAAGGTTATA AACTGGAAGA TATATTTACA 1980 

AAAGAAGAAA TAAAAAAATA TAAAGCTGAA GACCAATTGA GAGCGGGTAA AACTCAATAT 2040 

GTAGAAACAG GTAAAGATAC TG CAACATT A TATCTTTCTT CTGCATATAC AAAAACAATA 2100 

45 

GCTGCTTTAG GT 2112 

(2) INFORMATION FOR SEQ ID NO: 488: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

55 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488: 

5 GTAGGCACAC ATCTGCCATA TAAACATTCT TTTATACTAG TGTTCTCATA TAGTGTAGAG 60 

TTATAGTCTC CTTCTTGAAT CTOGAATAAT TCAATCAACC TATCAACCTT AGTCTCTTCC 120 

GTTACTTCTT TTTCAATATC AACTATGAAG GGGATATCAA TTGGAATAAA ACTTGACGTC 180 

10 GAACACTTAT TTGTATTTGG ATGAAAACGA ACGAATCCAT CACTAAATCC TGTTGAAAAA 240 

AATATTTTTC CTTGTGATAG ATCCGOATTT TCTCGCGCCC ATTTAATTAA TTCATCTAAT 300 

CTCATTTCTT TTTTAACTTT GATTTTCATT GTTATATCTC CTCTTGAACA GTAAATTTAT 360 

15 CGTTAAtTGA TACGTATCCA GTCACATTAC ATAAGATGCT ATCAACATCA AAAGTCACAC 420 

AACAGTTGCG TTCAACATCA TTTGAATAGA ATCT 454 



(2) INFORMATION FOR SEQ ID NO: 489: 

20 <i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1372 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 



30 



35 



40 



TTGTCAGAAT 


TAGAATGCTT 


TTGAGTTACT 


TCATAATACT 


CATCAGTTTT 


TTGTGTATCC 


60 


TTTTGACTTT 


T ATTT ATTT C 


TTTCCACTTA 


CCAGTATGAC 


TTTCTTTTTT 


TACAGTTATT 


120 


TTCGGTTTGT 


TTAAAAAATG 


ATAACCOATG 


TTCTTTTTGG 


TATCTGTGGA CTCTAAAAAG 


180 


ACTGAATTGT 


TTTCCTGATT 


ATCAGAATTG 


GTTGTTTGTT 


TATCGTCTGT 


ATATAACGTA 


240 


TACCCATTCG 


CTTTGCGATT 


TTCAATCGTT 


GTAACTGGAT 


TCTTTGATGA 


ATCAGTACTT 


300 


TTAATGCCAG 


TTCCTAAGAA 


GACAATTTTA 


TCGTTTAATA 


TGAAATATGA 


TT-ITTTGGCA 


360 


GTTAAAGTTT 


TGTCCTGATT 


TTCAAAATCC 


ATTCCGATAC 


TAGCATGTTG 


GTCATCAACT 


420 


rnurrccGc 


CAACAAAAGT 


TTTACTCGAC 


TTTTTATCAT 


CCGTATCTTT 


TAATATTTCA 


480 


TTGTCTAAAG 


TTGTTGTACC 


TGATAAACGT 


TTCATATCGG 


CTGTCACCCA 


GAAGTTATCA 


540 


TGATAGTGTT 


TGACATCGCT 


GTTATATAAA 


TAAGACATTC 


CAGCACCAGT 


GTGCCAACCT 


600 


TTTAAATTCT 


CTCCGTTGAT 


ACTTTCATAG 


CGTGCTACGT 


TTTTCGACGT 


CATACTTAAA 


660 


CCAAATGCAA 


AGTCTAAGTC 


TTTGTTATGA 


TAGGTGACAC 


GATCCATGTC 


ATTATATATT 


720 


TTAAGTTGTT 


GTGTTAATCC 


GTTTTTAGAA 


ATACTGTTAT 


CTGTCATTAA 


AGACTTCATT 


780 
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ACTGAAGATT TGACAATCTT TTTATACTTA GCTTTTGTTO AATCATCCAT GGCATCACTT 900 

AATCTCAACA ATGATTTCAT TACTGTTGCA GATGCTGAGT GACTGGTTTC ATTTTCACGA 960 

6 CTGATAGCTC TACCTCGTOA TAAATCCATC ATTTCACCTT TATAAATOAG TGGCATAAAT 1020 

CCGTCGTCAA TCCATGACTT TAAGGTTGTA TCATTTTGGG TTTTATCATT AAAAGGTGTT 1080 

TCTTTTATCA TCGGCATCAT TTGAGAAATC CCCTCTAAGA GTACAACGCC ATAAGCACCA 1140 

10 GTGTATGGAA CGTCTTGATG ATCAATGTAA GAGCCATCTT TATAAAATCC ATTACGTTCT 1200 

TTACCAGTGG CAGAATCTTG AACGTAAGTG AAGACTTTAT TAAATGAATC TATAGACTTT 1260 

TTCATCATAT CTTTATCTTC TTCGATAATA CATTCTAAAA GTTTCACCTT AGAAATGTCT 1320 

15 

ACTAnATTTC CGCCTTTAGC AAGTTCAGnT TTTCCTACAC AAGATAATAT TT 1372 

(2) INFORMATION FOR SEQ ID NO: 490: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 90: 



ACAACAATAT AGTAAAGCGT CGTTAATTAA ACAATGGGAC CAATTTGTTC GTCTTATATA 60 

30 AATGTACTTT aCCTTTATTT TTACAAAAAT AGCATTTTCC TATGTCATTT AACTAAACAT 120 

GTAAGTTCGT ATGAACGAGG TTTGTTAAAT AGATGATTCT AGGAAAATGC TTTTTTCTTT 180 

TGACTTAGTT TAAAATATTT TGCCACTTTG TACTGATAGT AGTTGCATTG TACTGTTGTG 24 0 

35 CAGATTCTAT GCTATTAATT GAAAACTGTT GCAATTTTTG AGTATTATTT AGTAATTGGT 3 00 

CGACCTTTTC AACCATTTGA TTGATGTCAC CTTGAGGTAC TAAATAGCCA TTAAATCCAT 3 60 

CTTGaATCAG TTCTGaTGGA CCATAATCTA CATCATAACT GATCACTGGT GTACCTACTG 420 

40 AAAgCGACTC TAAAATTGCT AAGCCAAAAC CTTCCATTTT ACTTGTCGAT AACATCAGTT 480 

CTGCTTTAGC AATCTCTTCA TTAATATGCG TCTTAAAACC ATGAAATTTA ACATGTTCCA 540 

GATATnATGA TAATCTTCTA CAAG 564 



45 

(2) INFORMATION FOR SEQ ID NO: 491: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 
SO (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 491: 





TATCCACCCC 


ACGAnAGCCC 


CGGAAACTTA 


TTGTGTTACA 


AGATATATAA 


opaoaaapra 


en 


5 


ACAACAGTTA 


ACAAAATAAA 


TGAAATTAAA 


CGTTTTAAAA ATGAAACAAA 


TfS A A ATP ATP 


Xav 




TATTAGGTTA 


TGAAACTGTT 


TATAGCTTGA 


ATAGAAGCAT 


TTATTTTTTA 




i fin 




TATTATGC6T 


CAAACATTTA 

WWV1W14 X X#% 


TGGCAAATGA 


ATCAAACATT 


GAGCGCAAAT 


GGTATGTTAT 




10 


CGATGCTGAA 


GGCCAAACAT 




ATCATCAGAA 


GTAGCATCTA 


T CTT ACG CGG 


300 




TAAAAATAAA 




pz PP & p & Pf2*P 


TGATACTGGT 


GATTATGTAA 


T CG TTATTAA 


360 


15 




ATCGAATTTA 


CTGGTAACAA 


AGAAACTGAC 


AAAGTTTACT 


ACCGTCACTC 


420 




w» IViO XATvA 


AAT CAATCAC 


TGCTGGTGAA 


TTAAGAAGAA 


CTAACCCAGA 


480 




AOGTTTAATT 


a* * k /wink * 

GAAAACTCAA 


TTAAAGGTAT 


GTTACCAAGC 


AP/rPGTTTAd 

X WW X X X<f%N7 


GCGAAAAACA 


540 


20 


AGGTAAAAAA 


TTATTTGTAT 


ATGGTGGOGC 


TGAACATCCA 


CACGCTGCAC 


AACAACCAGA 


600 


AAACTACGAA 


TTACGTGGTT 


AATTAGAAGG 


AGGAAATGA.C 


TTTGGPJlPJVA 


GTTGAATATA 


660 




GAGGCACAGG 


CCGTCGTAAA 


AACTCwGtAG 


CACGTGTACG 


xxx a\jin\v\.ei 


GGTGAAGGTA 


720 


25 


ACATCACAGT 


TAATAACCGT 


GACGTACGCG 


AATACTTACC 


ATTCGAATCA 


TTAATTTTAG 


780 




ACTTAAACCA 


ACCATTTGAT 


GTAACTGAAa 


CTAAAGGTAa 


CTATGATGTT 


TTAGTTAACG 


640 




TTCATGGTGG 


TGnTTCACTG 


GACAAGCTCA 


AGCTATCCGT 


CACGGAATCG 


CTCGTGCATT 


900 


30 


ATTAGAAGCA 


GATCCTGAAT 


ACAGAGGTTC 


TTTAAAACGC 


GCTGGATTAC 


TTACTCGTGA 


960 




CCCACGTATG 


AAAGAACATA 


AAAAACCAGG 


TCTTAAAGCA 


GCTCGTCGTT 


CACCTCAATT 


1020 




CTCAAAACGT 


TAATTGTCGG 


ACGATATATA 


CAAAACACCT 


CGATATTATG . 


TCGAGGTGTT 


1080 


35 


rivrvuGCGT 


TTTTGCGGCG 


AATATGGAAT 


GTGTAGAATA 


TAAATGAATT 


TTTACCTTCC 


1140 




CACCATAAAA 


GATGAAGAAC 


CATGAATGTG 


GAGAACAATA AATAGTTGGA 


TATTCTGTTA 


1200 




TTTTTTTGGA 


AGTGGAAGTG 


GATTTGGAAT 


ACTTTACTCn 


AAACGATTAA AAGGTTTAAA 


1260 


40 


AAAACAACAA 


AnAGAAA 










1277 




(2) INFORMATION FOR SEQ ID NO: 492: 









(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



60 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492: 
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10 



1S 



20 



26 



45 



60 



TTATTGACAT TGTTTTTATC CAAAATTCAT TGTTAAQACA TTTTCTTTAT GAAATAATAA 120 

TnATATTGAA GTATATTTTT ATTATTATTA AAAATAAATA AGGGGATACT TATGAGCACA 180 

AATCAAACAT TTTTAATATT TGTTATAGCA ATTATTCTAC TTACATCTGT AATAGGAATT 240 

GTTGGAOGAT ACATGAGTCG TCAACGTCTA TTAAAATCTA TGGAAACATT ATGGCAAACG 300 

ATTTCTCCAT TAGAAGCTTT TATCAGACCG AACTCACATT TCGACTATGA GTATAAGCTC 360 

TAGAAGGAAA AATTTOAATC ACATTCATTA GTTGATGATA AAACTTGGTC CGACTTAAAT 420 

ATGAATGCAA TCTTTCATAA GATGAATTAT AATTTAACAG CTATTGGTGA AATGAAGCTA 480 

TATGCCTGTT TACGTGGAAT GCTTTCAATT ACGAACAAAT CATTACTTAG TTTATTTAAT 540 

GATAATGCTG AATTTAGAAA AAACGTAACA TATCATTTAG CTTTGATTGG tAAAACTGTT 600 

aTCCAACATT TCCAGACCAA ATCACACCGG kAAACGTCCA AATATATTGn TCTATGCCCG 660 

GTTTACCAGT ATC 673 
(2) INFORMATION FOR SEQ ID NO: 4 93: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



30 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 

AAAAAAAGTT AATGCTGACG GTGTATTAAC TTTTGATATT CTAGAAAATA AATATACTTA 60 

TGAAGTTATT AACGCTATAG GGAAAAGATG GATTGTTAGT CATGTCGAAG GTGAAAACGA 120 

35 CAAGAAAGAA TATGTAATAA CTGTCATTGA TAGGAAATCA GAAGGCGACA GACAACTGGT 180 

TGAATGTACT GCTAGAGAGA TTCCCATAGA CAAGTTAATG ATTGATAGAA TTTATGTTAA 240 

TGTAACAGGA TCTTTTACAG TAGAAAGATA TTTTAACATT GTGTTTCAAG GTACTGGAAT 300 

40 

Gu-x-i-ri-rGAA GTCGAGGGCA AAGTTAAATC TTCAAAGTTT GAAAATGGTG GTGAAGGCGA 360 

TACAAGGTTA GAAATGTTTA AAAAGGGATT AGAACATTTC GGTTTAGAAT ATAAAATAAC 420 

GTATGACAAA AAGAAAGACA GATATAAGTT TGTATTGACG CCTTTTGCAA ATCAAAAAGC 480 

GTCTTATTTT ATTTCTGACG AAtCAACGCC AACGCTATAA AACTCGAGGA AGATGCAAGT 540 

GATTTCGCCA CCTTCATTAG AGGATATGGT AATTATTCAG GAGAaGAAAC ATTCGAACAC 600 

GCTGGGCTCG TAATGGAAGC TAGAAGTGCA TTAGCTGAAA TATACGGCGA CATCCACGCA 660 

GAACCATTTA AAGATGGTAA AGTGACTGAC CAAGAAACTA TGGATAAAGA ATTACAATCG 720 
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TATCCAGAAG CAGACCCACA ACCCGGAGAC ATAGTACAAA TAAAATCTAC CAAACTAGGT 640 

TTGAATGATT TAGTCCGTAT AGTACAAGTT AAAACGATTA GGGGTATAAA CAATGTAATT 900 

5 GTTAAGCAAG ATGTAACGCT TGGTGAGTTT AATCGAGAAC AACGATATAT GAAAAAAGTT 960 

AATACTGCAG CTAACTATGT TTCTGGATTA AATGATGTTA ACCTTTCTAA TCCTAGTAAA 1020 

GCGGCAGAAA ACTTGAAGTC TAAAGTAGCG TCAATAGCTA AATCAACACT CGATTTGATG 1080 

10 

AGTAGAACTG ATTTGATTGA AGATAAACAA CAGAAGGTAA GCTCTAAAAC TGTGACTACA 1140 

TCTGACGGCA CTATOGfTTCA TGATTTTATa GATaAATCmA ACATTAAaGA TGTAAAAmCG 1200 

aTTGGAACGa TTGGOGATtC TGTAGCTAGA GGATCACATG 1240 

IS 

(2) INFORMATION FOR SEQ ID NO: 494: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1311 base pairs 

(B) TYPE: nucleic acid 
20 <C) STRANDEDNESS : double 

<D> TOPOLOGY: linear 



■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494: 





ACGGTGGATT 


TAGCCTAGAA 


GATTTAACGC 


ATCAAGGTAa 


ATTaTCAGCA TTTAGCTTTA 


60 




ATGATCAAaC 


AGGTCAAGCA 


ACATTGATTA 


CTAATGAAGA 


TGAAAACTTC GTAAAAGATG 


120 


30 


AGCAACGTGC 


TGG CGTAGAT 


GCAAATTATT 


ACGCTAAACA 


AACATATGAT TATTACAAAG 


180 




ACACATTTGG 


TCGTGAATCA 


TATGa CAACC 


AAGGTAGTCC 


AATTGTTTCA TTAACGCATG 


240 




TTAATAACTA 


CGGTGGTCAA 


GATAACAGAA 


ATAATGCCGC 


ATGGATCGGT GACAAAATGA 


300 


35 


TCTATGGTGA 


TGGTGATGGT 


CGCACATTCA 


CAAGTTTATC 


GGGTGCAAAT GACGTAGTAG 


360 




CACACGAATT 


aACACACGGT 


GTGACACAAG 


AGACAGCGAA 


CTTAGAATAT AAGGACCAGT 


420 




CAGGCGCTCT 


AAATGAAAGC 


TTTTCAGATG 


TTTTTGGATA 


CTTTGTAGAT GACGAGGATT 


480 


40 


TCTTAATGGG 


TGAAGATGTC 


TACACACCTG 


GAAAAGAGGG 


AGACGCTTTA CGCAGCATGT 


540 




CAAACCCAGA 


ACAATTTGGT 


CAACCAGCTC 


ATATGAAAGA 


CTATGTATTC ACTGAAAAAG 


600 


45 


ATAATGGTGG 


CGTACATACG 


AATTCTGGAA 


TTCCAAATAA 


AGCAGCTTAT AACGTGATTC 


660 


AAGCAATAGG 


GAAATCTAAA 


TCAGAACAAA 


TTTACTACCG 


AGCATTAACG GAATACTTAA 


720 




CAAGTAATTC 


AAACTTCAAA 


GATTGTAAAG 


ATGCATTATA 


CCAAGCGGCT AAAGATTTAT 


780 


SO 


ATGACgAGCA 


AACAGCTGAA 


CAGGTGTATG 


AAGCATGGAA 


TGAAGTAGGC GTGGAGTAAA 


840 


AATATATAAA 


CaAGAAGAAG 


TAATGTTAAA 


CACTTATAAA 


TAATTAAATT TTAAATACAG 


900 



55 



1420 



EP0 786 519 A2 





ATTAGATGAG AGGAGTGTGA GGGTTGTCTG 


CCGAAAOACT 


ACTCGGCAGT 


CTAAAATCAT 


1020 




TACAAGTAGT 


AGATATGTGA 


TAATTAAATG 


CTGACTTAGA 


ATACAAAATT 


CATTTTAAAA 


1080 


£ 


GTTGTCACAA AAAATTTACA 


TGTATTTTTA 


TTATCTTTTG 


CAAAACAAAG 


TGTTAAATTA 


1140 




TAAATGAaAC 


ATGCATGAAT 


TTATTTTTTA 


ATACAAGAAA 


CGTAACTACC 


AAAGGAGTTT 


1200 




ACAATATGAA 


GAAAAGTAAA 


CGATTAGAAA 


TTGTTTCTAC 


AATAGTTAAA 


AAGCATAAGA 


1260 


10 


TTTATAAAAA 


AGAACAAATc 


ATTTCATATA 


TTGAAGAATA 


TTTTGGTGTA 


A 


1311 




(2) INFORMATION FOR SEQ ID NO: 495: 








15 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1761 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 495: 








TGCACTTTCT 


AAAAATACTT 


GCTTTACTTG 


TTCCAATTCC 


TTGTCAGACA 


TTGATTCGGC 


60 


25 


TAATTTATTC 


ATTATTGCCT 


CAAATACGTT 


AATTATGTCA 


TCCATTTCAG 


TACTATAAAA 


120 




ATCAAACCAT 


TTTGCCGTAT 


CTTTTTCTCT 


ATTAAGCTTA 


TGATCACTTT 


GAGAGCGCTT 


180 




AGCTAATTCT 


GCATAAATAT 


ATGGACAAGG 


TGCCATTGCA 


GCAATTGTAT 


AAATAGCATT 


240 


30 


TTCACGACTA 


TGCGCTTGGA 


AATACATATG 


TTTTATGTAA 


TGGTCGCCAC 


TTGGAGGCCA 


300 




AACTTTTGTT 


TTAATGATTT 


CTTCGTATGA 


TTCACCAACA 


ATTTGCGCTA 


AAATATCATG 


360 




CGCAAGTACT 


TCACCTTCAA 


CCATAAATTC 


TATTTGCTCT 


ACTAAAAATT 


TTACGTCATT 


420 


35 


CATGCTATTC 


ATTTTTGGAA 


TTAACAAAGC 


ATATAAGTTT 


GTAAATTCTT 


TTAAATACGC 


480 




AGCATCAGCT 


TTTAAGTAAT 


GGCGTAATGC 


GTCAGcTCCT 


ATATCTCCGG 


ATAACATCTT 


540 




CTGAATAAAG 


TCATCCTCAT 


AAATATCATT 


AATGATTGGC 


TTTGCAGCTT 


GGTACAATTT 


600 


40 


TTGTGAAAAT 


TCCATTGTAA 


AAAATCCTCC 


CTAAATAAAA 


AAACTACTTC 


CAACATGAAA 


660 




GTAGTTTGAT 


GGCAATGTTG 


CTATACTAGC 


CCCATCACTT 


CAATAACTAC 


TTTCCTACGT 


720 


45 


TGGTACTAAC 


CAAATCAGGT 


CATAAGGGTC 


TGAACAATTC 


ATCTCAGCCA 


TATCATTAGG 


780 


CTCCCCTAGT 


AGTTCCTTAG 


TATTCAATTG 


CAAATTAATC 


TTAGCAAACG 


GTTTCAACAT 


840 




TTTCAATTAT 


TGTTGCTCAG 


TTGTATTATT 


ATCTTTAAAT 


AATAATTCTA 


TAATGACATA 


900 


50 


TATTTGCGAA 


ATAAAAAAAC 


CGGAACATAT 


CGAGAATTCC 


CCGATATATT 


CCAATCTAAA 


960 


AGTTACTTAT 


ATAACTATTA 


ATTAGCTATG 


CATAAATGGC 


TTATGCAGTA 


ACCCAATGTC 


1020 
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20 



25 



30 



40 



45 



SO 



TTX3CTOC5TGA TACACCTTTA TATTTAGCAG OTGCTACTC3A ATCCCAAGTT GATTOTAAGA 1140 

ATTGATACTT ACCAGCTGCA CCTGgATGTT GGrTTTACAG CATGAATATT GCCACCTGaT 1200 

TCACGTTCrG CAATTTGTTT TAGATGAGCa TTCACATTTA CTGATGAACC TTCTGATGAT 1260 

TTTGATyCAG TTGGTGTTGC AGTAACTTGT GAATTGTTTG ATGTTGATGC TTGTCGTTGT 1320 

TGAGTTTGAG CATTTTGTGG TGCTTCAACT TCTTGTGATT GTACTTGATT AGCTTGAACA 1380 

GCTGATGGTG CAACATTATT AGTTGCAGGT GCTTGTGCAC TCATGTCTGC TCCATTAGTA 1440 

CCTGTTGCAT GGTAATTCCA AGCAAAGTGT GTACCATCTG ATTCAAAGTG ATAAGTAAAC 1500 

CCTTCATAGT CAAATGTATA ATTATAAGCC CCAGCTTCAA TTGGTTTTTG ATTTAATCTT 1560 

TGATCATTTG ATTGCGCCAT TTGCCTGAAA GATGCTTTAT TTAAGTCCGC TTCACnTGCA 1620 

TGGGCTTCGT GGACCTGCAT TTCCTGGCTA CGATTCCTAA ACCTACTGGC nAAnGATGAT 1680 

GCGAGTAATG TTTTCTTCAT AATCTTAAAA TCCTCCTACA AGTGAATTTG TGTCTCTAAA 1740 

AGTTTTACAG TGGACGACTG T 

(2) INFORMATION FOR SEQ ID NO: 496: 

■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 base pairs 
<B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



1761 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: 

TCATTTATGA AAAATGTCGG AnAGAnCCaa GaAAmaCAAT -TGAgCGTGAA GAAAAAGCAA 60 

35 GACTTAAAGA AGAACAAAAG GCACGTCAAA ATGAACAGCC ACAAATAAAA GATGTGAGTG 120 

ATTTTACGGA AGTGCCTCAA GAAAGAGATA TTCCAATTTA TGGGCATACT GAAAATGAAA 180 

GTAAAAGCCA GAGTCAACCA AGTCGAAAAA AACGAGTGTT TGATGCAGAG AATAGTTCGA 240 

ATAACATCGT AAATCATCAT CAAGCAGATC AGCAAGAACA ATTAACAGAA CAAACTCATA 300 

ACAGTGTTGA AAGTGAAAAC ACTATTGAAG AAGCTGGTGA AGTTACGAAT GTATOGTATG 360 

TTGTTCCACC GTTAACTTTA CTTAATCAAC CTGCAAAACA AAAAGCAACA TCTAAAGCTG 420 

AAGTGCAACG TAAAGGACAA GTACTAGAGA ATACATTAAA AGATTTTGGG GTAAATGCAA 480 

AAGTGACACA AATTAAAATT GGTCCTGCAG TAACTCAATA TGAAATTCAA CCAGCTCAAG 540 

GGGTTAAAGT GAGTAAAATT GTAAACTTGC ATAATGATAT TGCATTAGCT TTAGCAGCAA 600 

AAGATGTTAG AATCGAAGCG CCAATACCTG GTCGTTCTGC AGTAGGTATT GAAGTGCCAA 660 
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ATAAACTAGA AGTTGGATTA GGaAGAGATA TATCAGGTGA TCCAATTACT OTTCCACTAA 780 
ATGAAATGCC ACAC 794 
(2) INFORMATION FOR SEQ ID NO: 497: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 97: 

IS 



20 



25 



30 



45 



AGCCAGTTTT 


GcATTTCGTC 


AAATCGCAAT 


GAATATATTG 


ATTGCTTGTC 


AAAATTTGGA 


60 


AGAAATTTAT 


TCTTCTTTTT 


CTTTTCAGCA 


GTTATCATTC 


CTAATTGTCC 


TTTCTTTTTA 


120 


TCTTAGTGAT 


AAAGAATCCA 


TCTGAATTAA 


AGTCTTGCGG 


CATGATTTGT 


AACGTTTTGA 


180 


CCAACTCTCC 


AGTTATCGGA 


TGTTGAAACG 


GTTCAAATTC 


GAAGTTTTTA 


TTATTTTTCA 


240 


AAAACGTATA 


AATCACGTTT 


TCATTTTCTA 


GTTGCTCAAT 


TGTACATGTT 


GAATAGATGA 


300 


TTTCTCCACC 


TATTTTTACA 


TTGTTTTTTA 


CATTTTCCAA 


TATTTCAAGC 


TGTAATTCAA 


360 


CTAGTGACT C 


AATATGTTGT 




TATACTTAAT 




TflTfTnTTB 
A. \3 ± V— X VJil J. ./"V 


a *3 n 


CACCTAATCC 


GCTACATGGT 


GCATCAACAA 


GTATCTTATC 


GTATGTTTTA 


TCATAAGGTT 


480 


TTGTCGCATC 


ATGTTGAAAA 


GCTTTAATAT 


TTGTTAATCG 


TAATTnTTTT 


ATATTAAAAT 


540 


TAATTAAGTC 


TATTTTGTGA 


TCATGTATAT 


CTGAAGCGTC 


AACTTGCCCT 


TCTGG CATTA 


600 


AAACTTCAGC 


AATGTGACAA 


GCTTTACCGC 


CAGGTG CACT 


ACATGCATCT 


AATACGTGAT 


660 


CATGTCGGTC 


TACATTCATA 


ATGTGTGCAa 


CAAACATTGA 


GCTTTTATCT 


TGAATTGAAA 


720 


CGAATCCATC 


TTTAAATGAA 


CGAGAATGAA 


TAATTGGTTG 


TCCTCCTATA 


TGGAGACAAT 


780 


AAGGTAAGTC 


ATGATCTTTT 


TCAACGTCAT 


AACCTTCGTC 


TTGCAACTTT 


TCAATAATAT 


840 


CAT CTAATG A 


TGCTCGCGTC 


AGGTTGGCAC 


GCACAGTTGT 


TGATGTCGTT 


TCTAAAAATG 


900 


ACTGTAAAAT 


TTTTTCAGTT 


TCTTCGAGAC 


CATAATGTGT 


TGCCCAATGA 


TCTATAATCC 


960 


ACTTCGGCAT 


ACTATACTCG 


ATTGCCATTC 


rrrrriTAGG 


ATCTGCAATT 


TCATTAAAAT 


1020 


CAGGTAAGTC 


ACTACG CATC 


ATTGTACGTA 


AAATACCATT 


TACGACATTA 


CCATTATGAT 


1080 


AGCCACCGCG 


TTCTTTTGCT 


ATTTCAACTG 


CTTCATTAAT 


AATGGCATGA 


TTTGGAACTT 


1140 


TATCTAnATA 


nACATATTGA 


T 








1161 



SO 

(2) INFORMATION FOR SEQ ID NO: 498: 
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(A) LENGTH: 1504 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS ; double 
<D) TOPOLOGY: linear 

5 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498: 
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AGCTCACGTC ATCTTCGGCG 


GCGCTAAATT AAAATAATCA ATTTCTGAGT TAAACTTTTA 


60 




TTTACAACAT ACTATTACTA TACATTACAA ATTTTTAAAA TATACATTAC ACTCATTACT 


120 




GAATGGnAAG 


CGTATGATTT 


CmCAGCCCCC 


CTAGCTTGTA 


GAAATCATAC 


tttccttttt 


180 


IS 


TCAATATATA 


TACAACTATT 


AAATCCCATA 


AGATTGCAGA 


GCACATAAGT AAATTTTTTT 


240 




AGAGCTTGAG 


GTTTGTTTAG 


CTTAAGCAAC 


CCATGAGCTC 


AAACACTTCC 


TGTTCACATA 


300 




ACACTACAAA 


TCGCATTATG 


TTGCTTAATC 


TTATGTTTAT 


ATAAATTACA 


CACAATAAAT 


360 


20 


AGAAAGAATG 


TGAACATCAT 


GAATAAATTA 


TTGCTACTCG 


TTACATTTAT 


CATTCGTGTG 


420 




GGTTCAGGTA 


TTGTTATGTT 


AATGCAAGGC 


TACGAAAAAT 


TAAOGOGCGG ATTTACGCTG 


480 


25 


AAAGGTTTAG 


TACCAGTCAT 


CGCTAACAAT 


ACTGATTCAC 


CAGAGTGGTA 


TAAGTGGTTT 


540 


TTCGCAAATA 


TAGTTGCACA 


TACGACGTCA 


TTATTTGATA 


TTGTTGTCCC 


ACTCGGAGAG 


600 




ATTGCAATTG 


GATTAGGTTT 


AATTTTTGGA 


GTTTTTGCAT 


ATGCTGCTAG 


TTTCTTTGGA 


660 


30 


GCCTTTGTTA 


TGATAAATTA 


TATCTTAGCA 


GATATGATAT 


TTACGTATCC 


TCTTCAATTA 


720 




ACTTTCTTTA 


TCCTTTTACT 


AATGAGTCAC 


TCATTGTTAA 


AACAGATTTC 


ACTTAAAGAA 


780 




ATCATTAATT 


ACTTTAGAGG 


TCGTAAGAAC 


AGAGGTGAAA 


AAATAGATGA 


CCCACTTACT 


840 


35 


GATCGTGGAT 


GATGAACAAG 


ACATTGTAGA 


CATTTGTCAA 


ACCTATTTTG 


AATATGAAGG 


900 




TTACAAAGTA ACAACGACAA 


CTAGCGGTAA 


AGAAGCAATT 


TCTTTACTAT 


CAAATGATAT 


960 




TGATATCATG 


GTACTTGATA 


TCATGATGCC 


AGAAGTTAAT 


GGTTACGACA 


TTGTCAAAGA 


1020 


40 


AATGAAAAGG 


CAAAAATTAG 


ATATCCCCTT 


TATCTATTTA 


ACTGCCAAAA 


CACAAGAACA 


1080 




TGATACCATT 


TACGCCTTAA 


CTTTAGGTGC 


AGATGACTAT 


GTCAAAAAAC 


CATTTAGTCC 


1140 


45 


AAGGGAACTC 


GTTTTACGTA 


TTAATAATTT 


ACTTACAAGA 


ATGAAGAAAT 


ACCATCATCA 


1200 


ACCAGTTGAA 


CAACTGTCGT 


TTGATGAATT 


AACACTTATT 


AACTTAAGTA 


AAGTtGTGaC 


1260 




tGTAAaTGGT 


CACGAaGTCC 


CTATGCGTAT 


TAAGGAATTT 


GAGTTATTGT 


GGTATTTAGC 


1320 


SO 


TTCTAGAGAA 


AATGAAGTTA 


TTTCTAAATC 


AGAATTACTT 


GAAAAAGTTT 


GGGGATATGA 


1380 




CTATTACGAA 


GATGCTAATA 


CCGTGAATGT 


CCATATACAC 


CGTATTAGAG 


AnAAATTAGA 


1440 




AAAAGAGAGC 


TTTACAACAT 


ATACCATCAC 


AACTGTATGG 


GGATTAGGAT 


ATAAATnTGA 


1500 



SS 
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(2) INFORMATION FOR SEQ ID NO: 499:. 

(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 1623 base pairs 
(B) TYPE: nucleic acid 
<C> STRANDEDNESS: double 

(D> TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 99: 

ATTGAAAGCG ATAATTCGTA nTAATTGAGT TTGTTGAAAA ATTTAGGGTA ATGTAAAGAT 60 

ATAAAAGATA CATAGAyTGG AGAGATATAA AGATGTTGAA TGAGATACAA ATATTAaATA 120 

aTGGATACCC GATGCCTTCA GTTGGGTTAG GTGTTTATAA AATCTCTGAC GAAGATATGA 180 

CTAAAGTTGT AAATGCTGCA ATTGACGCAG GCTATA&AGC GTTTGATACA GCATACTTTT 240 

ATGATAATGA GGCTTCACTA GGACGAGCAT TAAAGGATAA TGGCGTCGAT AGAGAAGATT 300 

TGTTTATAAC AACGAAGTTA TOGAATGACT ATCAAGGTTA TGAGAAAACA TTCGAATATT 360 

TCAACAAATC GATTGAAAAT TTACAAACTG ATTATCTTGA TTTATTTCTA ATACATTGGC 420 

25 CTTGTGAAGC AGATGGTCTA TTTTTAGAAA CATATAAAGC TATGGAAGAA CTTTACGAGC 4 80 

AAGGTAAGGT AAAAGCAATA GGTGTATGTA ATTTTAATGT TCATCAT CTA GAAAAATTAA 540 

TGGCTCAATC AAGTATCAAA CCAATGGTGA AT CAAATTG A GGTACATCCA TATTTTAACC 600 

AACAAGAATT ACAAGAATTT TGTGATCGTC ACGATATTAA AGTGACTGCA TGGATGCCTT 660 

TGATGAGAAA TAGAGGACTA CTAGACGACC CTGTCATTGT TAAAATTGCT GAAAAATATC 720 

ATAAAACACC AGCACAAGTT GTATTACGTT GGCATTTAGC ACACAATAGA ATTATTATTC 7 80 

CAAAATCTCA GACACCTAAA CGCATTCAAG AAAATATAGA TATTTTAGAT TTTAATTTAG 840 

AATTAACAGA AGTAGCTGAA ATTGATGCTT TAAATAGAAA TGCAAGACAA GGTAAAAATC 900 

4Q CAGATGATGT GAAAATTGGG GATTTAAAAT AACTGGATGT TAAATTTTAC GTTTATGAAT 960 

GCCTTTTAAT GTGTACATTA AAATAAATGA GTTGGTTTTT ACTATTTGAT AAAACAATAC 1020 

TCAGGTACAT TCAAAATCTT TTAAATAAAA AGGATGGACA TAGATGAAAA TTAGAGTCGT 1080 

45 CATTCCTTGT TTTAATGAAG GGGAAGTCAT TACACAAACA CATCAACAAT TAACTGAAAT 1140 

ACTTTCACAA GATAGTAGTG TGAAAGGCTA TGATTATAAT ATGCTTTTCA TAGATGATGG 1200 

TAGTACGGAT ACCACTaTAG ATGAAATGCA ACATCTTGCC ACAATAGATA GGCATGTCAG 1260 

60 

CTTTATTTCT TTTAGTAGAA ATTTTGGAAA AGAAGCAGCT ATGATTGCAG GTTACCAGCA 1320 

TAGTACTGAA TTTGATGCAG TCATCATGAT AGATTGTGAT TTGCAACATC CACCTGAATA 13 80 

55 
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35 



SO 



SS 



1623 



TAGAAGTGGT GAAAATTTTA GTCGCAAAAC ATTAAGCCAT TTGTATTATA AGTTAOTTAA 1500 
TTGCTTTGTA GAAGAAGTAC AATTTGATGA TGGTGTTGGT GATTTTAGAC TTTTAAGCCA 1560 
AAGAGCTGTT AAATCCATTG CATCACTTGA AGAATATAAT CGnTTTTCAA AAnGGnTATT 1620 
TGA 

(2) INFORMATION FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500: 

20 AAAGTnGGTG AAnCTATATA CTTAATCTAT ATTTATATAT TAACCATTAG GGTTAAAAAT 60 

TACTCTAGCA TTTATGAATA GATGGGAGTT TATTTTATTA TTATATAGGA GAGATGTTGA 120 

ATGACACATC GCGCACTATT AGTTGTTGAC TATTCATATG ACTTTATCGC AGACGACGGC 180 

TTACTAACAT GOGGTAAACC TGGACAAXAT ATTGAAGATT TTATTGTTTC TCGTATCAAT 240 

GACTTTAATT ATTATCAAGA CCATATATTC TTTTTrrTGG ATTTACATTA TTTACATGAC 300 

ATTCATCATC CTGAAAGTAA ATTATTCCCA CCACACAATA TCGTAGATAC AAGTGGTAGA 36 0 

GAATTATACG GTAAAGTAGG TAAATTATAC GAAACAATTA AAGCGCAACC TAATGTACAT 420 

TTCATTGATA AAACGCGCTA TGATTCGTTC TTTGGTACCC CGCTTGATAG TTTATTGAGa 480 

GAAAGAAGTA TTAAT CAAGT CGAAATCGTT GGTGTATGTA CCGATATTTG CGTGTTACAT 54 0 

ACAGCAATTT CTGCATACAA CTTAGGtTAT AAAATTTCAG TACCTGCTGA GGGAGTGGCT 600 
CATTT 

40 (2) INFORMATION FOR SEQ ID NO: 501: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1739 base pairs 

(B) TYPE: nucleic acid 
^ <C> STRANDEDNESS: double 

(D) TOPOLOGY: linear 



605 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 
TAGGTTnAAA GCATAGnTTT nTCAAAAAGA CAAATCATTC ATATATTGGA GGATATTTTG 60 
GTGTAAGATA TAGTG CAACC ACAATTGCTA AAGACTTGAA GGAACTAAAT ATATATCGTG 120 
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TGAGAGAAAA 


GTTTAGACAC 


TATTGTGAAC 


ATGAAGTTCT 


AAGTTCAATC 


ATCAATGGTT 


240 




CATACATTAT 


CGTCAAAACC 


TCACCTGGTT 


TCGCCCAAGG 


CATAAACTAT 


TTTATCGATC 


300 


6 


AGCTAAATAT 


AGAAGAGATA 


TTAGGTACGG 


TGAGTGGAAA 


TGACACTACA 


TTAATCTTAA 


360 




CTGCCTCAAA 


TGATATGGCA 


GAATACGTAT 


ATGCAAAATT 


ATTTAAATAG 


ACATGTATCA 


420 




AATGAATAAT 


AAAAATTTGT 


TTCGTATCAC 


GTGTACTCAA 


GTTAGTTACC 


AAATATTAAC 


480 


TTGTGTACGC 


GTTTTTTTAT 


GGAAAGAAAG AATTCATAGT 


CATTCAATTG 


ACTGTATAAA 


540 




AAACTTTATA 


CAACATGTTT 


TTATGGGTAT 


TTTTGAATAA 


AAAATGTATA 


TTTTGACCCA 


600 


1S 


AAATACCTTT 


ATTTATGTAT 


AAAAATCCAT 


TATTATGTAT 


TGTATAACAA 


AAAGATATGA 


660 




AATTTTCGAC 


TTTCTTTATG 


TGAATATAAT 


CACATGTAAG 


CGTTTGAAGA 


TTGTCTATAC 


720 




TCTAAATGAA 


TTCAAAGATA 


AAAGGAGGAA 


ATAGACATGA 


CAGATGGTCC 


AATTAAAGTA 


780 


20 


AATAGC6AAA 


TTGGAGCTTT 


AAAAACTGTG 


TTACTTAAGC 


GTCCTGGaAA 


AGAATTAGAA 


840 




AATTTAGTAC 


CTGATTATTT 


AGATGGATTA 


CTATTTGATG 


ATATTCCATA 


TTTAGAAGTA 


900 




GCTCAAAAAG 


AGCATGACCA 


TTTTGCGCAG 


GTGCTAAGAG 


AAGAGGGTGT 


TGAAGTACTT 


960 


2S 


TACCTTGAGA 


AGTTAGCAGC 


TGAAAGTATT 


GAAAATCCTC 


AAGTAAGAAG 


TGAATTTATT 


1020 




GATGATGTAT 


TAGCAGAGTC 


TAAAAAAACA 


ATATTAGGTC 


ATGAAGAAGA 


AATTAAGGCA 


1080 


30 


TTATTTGCGA 


CACTTTCTAA 


TCAAGAACTT 


GTAGATAAAA 


TAATGTCAGG 


GGTACGTAAG 


1140 


GAAGAAATTA 


ATCCGAAATG 


TACACATCTA 


GTAGAGTATA 


TGGATGATAA 


GTATCCATTC 


1200 




TATTTAGATC 


CAATGCCAAA 


CGTTTATTTT 


ACTAGAGATC 


CACAAGCCTC 


AATAGGACAC 


1260 


3S 


GGTATAACAA 


TCAATCGGAT 


GTTCTGGAGA 


GCACGACGAC 


GAGAATCAAT 


ATTTATTCAA 


1320 


TATATTGTAA 


AGCATCATCC 


TAGATTTAAA 


GATGCGAATA 


TTCCAATCTG 


GTTAGATCGA 


1380 




GATTGCCCAT 


TCAATATTGA 


AGGCGGCGAT 


GAACTTGTTT 


TATCTAAAGA 


TGTCTTGGCT 


1440 


40 


ATAGGCGTTT 


CAGAACGTAC 


ATCTGCACAA 


GCTATTGAAA 


AGTTAG CGCG 


ACGTATTTTT 


1500 




GAAAATCCGC 


AGGCGACGTT 


TAAAAAAGTA 


GTAG CAATTG 


AAATTCCAAC 


TAGTCGAACT 


1560 




TTTATGCACT 


TAGATACAGT 


ATTTACAATG 


ATAGATTATG 


ACAAATTTAC 


AATGCATTCA 


1620 


AS 


GCCATTTTAA 


AGGCAGAAGG 


CaATATGAAT 


ATATTTATTA 


TTGAATATGA 


TGACGTAAAT 


1680 




AAAGATATTG 


CCATCAAACA 


ATCTAGTCnT 


TTAAAAGATA 


CTTTAGAAGA 


CGTACTAGG 


1739 




(2) INFORMATION FOR SEQ ID NO: 502: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 








5 


CTGTACATAC 


AGCAATATCG 


TTAACAAATG AAAACAGTAT 


A 1 lAUWil XVj 


1AAACCATGA 


60 




TAAACCTAAA ATACTGTTAT 


TTTTATTACT 


TAAATTTCTT 


C A x LAATCCC 


TTTTCAACAT 


120 


10 


AAGGTGGAAC 


GAATTCAGAA 


ATATCTGCTC 


GATAAGCTGC 


AACTTCTTTA 


ACAATACTTG 


180 


AACTTATAAA 


TGAATAATTA 


GTACTAGACA 


TCATATATAA 


CGTTTCAATT 


TCATTGTTCA 


240 




AcrrrrrATT 


CATTGAAGTT 


AAGCGTAATT 


CATATTCAAA 


ATCACTGACT 


GCTCTTAAAC 


300 


IS 


CACGTATGAT 


TGTTTTAGCT 


CCTACTTGTT 


CACAATAATP 


GACTAGTAAA 


CCACTAAATT 


360 




GATGAACCTT 


GACATTAGGT 


AAATGTTTAA 


^•nUnX X\JX Iv 


AATTAAATCC 


ATACGCTCTT 


420 




CTAAACTAAA 


VW X^k A X V* X 


TTTTTAfTlT 
x x x tnv> inl 


XXX lAAUnnL 


ACAGACATGA ATTTCATCAA 


480 


20 


^ X c^X X x7 X 




ATAATGT P/T A 

«% XnnX w X x/\ 


A A*TY2 A PP ft *T A 
nn X unLLn 1 n 


AGTAATGGGG 


TCAAAACTAC 


540 




CCGGAATfiAr 1 






TATTCTCCCT 


TTTCTAATAA 


CAATGTGTCT 


600 






nn X VJ\? Xnn\_v? 


TTT A LTf A T A 


x inAAUiblT 


GATAATCtAT 


TTCTTCATGA 


660 


25 


TTGCTAAATT 

* * VJ*w-X X A 


*t«nV^>nn\«V9n X 




TCTTTCAATA 


AATTAAACTC 


TGAAATTAGT 


720 




TTTAAAGCTT 


T ATP A ATYS Af2 
X n X V— nn X van \9 


AVW X X XnX Xn 


T A A/^rtTVST2 ft T 


CTAaGAAAAT 


GACATCAAAT 


780 


30 


TGAATATCA C 


VJ A X X A uf\VJV\ 


x vjv.. x x x innn 


w^r X l» In X V- 1 


CATTATTTTT 


ATAAACTTCA 


840 


GATTGTGCCT 


CTAAATCCAA 




x x A Vjd X x Xnn 


TAACTTTTAC 


AGCTTTAAAA 


900 




TTTTGaTCAA 


CAAAGATTAC 


CTTATCCATA 




GTGcTTCTAT 


TCCAAGCGCC 


960 


35 


CCGCTTCCTG 


CAAATAAATC 


TAAACCTATA 


CCTGACACAT 


CATATAAACT 


ATTAAAGATA 


1020 


CCTTCTTTAA 


CTTTATCCAT 


AGTTGGTCTC 


GTATTACGGC 


CTTCCATACT 


TTCTAAAGCT 


1080 




A A X X -1 A X 




AATGACGCGC 


ATGTTGTTCA 


CACTTCCAAT 


TCATTTAGTT 


1140 


40 


ATTTAATATA 


ATTTATTGAG 




A I viAT AAAC C 


AATGAAACAA 


ACATTTATTA 


1200 




CACTTGGTGA 


AGGTCTAACA 


GATTTGTTCG 


AATTCATGAC 


GATGATTGAA 


TATAACCATC 


1260 




AACGTATTGA 


TAAAATTATC 


TATTTTCATT 


CACCACAAGC 


TGAAAATAAA 


AAGTCATCTG 


1320 


45 


TAGCAATCAT 


TATGAACCCT 


ACAACTGGCA 


ATCATTTCCA 


AGCATTTTAT 


ATCATGATAA 


1380 




ACGCTATTAA 


ATATCCATAT 


CCAGATTCAA 


ATAAAAAGTT 


TCAAATGATA AATGATTGTG 


1440 




CTGAAAAATT 


CGACATACCA 


ATTTTAGGTA 


TCGATGTACA 


GCCCCCTCAA 


GCATTTCATG 


1500 


SO 


ATTTATCGTT 


ATATTATAAT 


TATTTAATTA 


GTGTGTTAAG 


GCTCCAAAAA 


TGGATACCAG 


1560 




aACTTCAATA 


ATATTAATTA 


TATATTTCGT 


GTTTCTCTTT 


TTCGTAAGTT 


TTCTTTAAGT 


1620 
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TTTGATCAAC ATCTTGCTCA TTCACATACA TAATTACAAA TTTACGATCT CTATTTQAAT 174 0 



GAACG 

5 

(2) INFORMATION FOR SEQ ID NO: 503: 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1035 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1745 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 





TCGTCTTTAA 


TCTTGCTGAC 


TTTAGAAGGC 


TTACGAGTGC 




'i-i-i-i^ritT/ 

AAA XUwiiV JL 






TTAACTGATT 


GAACTAAAGC 


TtGACGTGTA 


GATTTATCA(3 


CTAAACTAAT 


Tf? app app a 


ton 


20 


ATTACGGCAC 


CAATTAAAAT 


ACCAGGAACA 


AATTTATTTT 


CCATAAAAAA 




i fin 




TTCAAATTTG 


CATcrxM-xAC 


GATGTAGTCT 


ATTAAATTAT 


CACAAGATGA 


TAATACCATG 


240 




TCGTATACAC 


CTTCAAAATT 


ATTCGTGTAG 


TATGGATCTG 


GTACATCACT 


CTCTTCCATA 


300 


25 


TTACTAAATT 


CTAACAGTTT 


GAACAATTGT 


CCCTTAAGAT 


TAGGATTGAT 


AGATTTAATA 


360 




TTATCAACGT 


TACTTTGATC 


CATAGCCACA 


ATGTAATCAA 


AATCATCTGT 


CGCTTCGAAT 


420 


30 


AATTCACTAA 


TCATGCCATC 


AAATGGAATA 


TTGTGTTTGT 


TGAGAATTTT 


TTGTGTACCT 


480 


TCATGAGGTG 


GCTCTCCTAA 


ATTCCAGCTA 


CCAGTACCTC 


TTGAATGTAC 


TTTAATATCA 


54 0 




TGAATATTTC 


TGTCTTTAAG 


TCTTTGTCGC 


ATGATTGCTT 


CTGCCATTGG 


AGAACGACAT 


600 


35 


ATATTGCCAA 


GACAGACAAA 


TGCTACATCT 


ACCATTTTGA 


TTCCTCCAAA 


CTATGTAGTT 


660 




ATATCCCCAT 


TTTATAGCGA 


CTTTAAACAA 


TAAGAAAGCA 


GATTATATAA 


AATTCTATTA 


720 




AAGTTTATTA 


AATTGTGATA 


CTTTGATAAC 


ATAACTATTA 


TTAGAGGTGA 


ACATTGTGGC 


780 


40 


TATGACAAAT 


GAAGAGAAAG 


TnTTAGCTAT 


TAGAGAGAAG 


TTAAATATtG 


TTAATCAAGG 


640 




ATTATTAGAT 


CCTOAAAAAT 


ATAAAAATGC 


AAATGaAGAA 


GAATTAACAG 


ATATATATGA 


900 




TTTTGkTCaA 


yCAAGAGaAA 


GATTGTCGCC 


AAGTGaAGTG 


mCAGCTATTG 


CTGaCGCTTT 


960 


45 


AGGACAATTG 


CGACACGAAT 


AGGAGTGGGA 


ATTTTGACTA 


ATTACAAAGA 


AAAGTTACAA 


1020 



CAATACGCTG AACTA 103 5 

(2) INFORMATION FOR SEQ ID NO: 504: 

SO 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 



o 


AnCCTGACAA GATTCTTATA TTCATAGATA TGAGAGCTAA AAATGAAATC AACAAACAnA 






CAATAAAGTA AACGATAATA 


GCCCATATAC CATTTTGTAA 


CCATATTACA AATTGTGTTG 


120 


10 


TATTATAGCC ATTTCCAGCT 


AATAATTGCT GGATAAATGC 


ATTATTGTTT 


AATGTATTTT 


180 


CAAGATTAGC 


AAtCGATGTG 


TTATTACTGA ATGAAACAAG 


TGCTATAAAC ATCGTAATGA 


240 




CAGTAAGCAC 


TAATAACATC 


ACCCAACATA ACCAACCTAG 


AACTTTTTCA 


GTTAATCTAC 


300 


IS 


TTACTGGACG 


TTTAATTTGA 


GTAAATTGTT CTCCAGTCAT 


TCGTTACAAC 


TCCTTATAGT 


360 




ACTTATCCCG 


TTATTATAAC 


TAAATATACA GTAAATAACT ACTATTTATG 


ATTTTATTTT 


420 




AATOACATTT TGAAATTCAA AAAGTTTTCA TTGTATTCAC TTAAAACTTC AGGTCCTAAA 


460 


20 


TCTTTATAAA 


CTTCAAGGCG 


TTCTTGCTCT TTCTTAGTCG 


GATAAAAACG 


ATGGTCGTCT 


540 




TTAATCTCTT 


TAGGCAACAA 


TTGTCGAGCA GCCTTGTITG 


GCGTTGCATA 


GCCTACGAAT 


600 




TCTGTATTTT 


GCTTGTTATT 


TTTAGCATCT AATAAAAAAT 


TCATAAATTT ATATGCACCC 


660 


25 


TCTTTATTTT 


GTGCCGTTTT 


TGGAATTACC ATATTGTCGA 


ACCATAAATT 


CGATCCTTCT 


720 




TTAGGAATAA 


CATAATTATA 


TTTATCCCCT TCTTGCACTA 


GAGGTG CTGC 


AACACCACTC 


780 


30 


CAAACAACCG 


CTATGTTACC 


TTCATTTTGT TGAAGCATCA 


TGGTAATTTC 


ATCACCTACG 


840 


ACACCTCTTA 


CTTGTGGTGC 


TAGTTTGGTT AAATCTCGCT 


CTGCTTCTTT 


TAAATGGTGC 


900 




GAATTACGGT 


CATTAAGATT 


ATACCCAAGT TTATTCAAAC 


TCATGCCTAT 


AATCTCT CTA 


960 


35 


GCACCGTCAA 


CTAGTAAAAT 


TTGGTTTTTA AATTTAGGAT 


TATACAATGA 


CTTCCAACTA 


1020 




TCAAATGATT 


CATTTGGATA 


CTTTTCTTTA TTATATAAAA 


TACCTACAGT 


TCCAAAGAAA 


1080 




TAAGGTAAAG 


AATATTTATT 


GCCTCTATCA AATGACATAT 


TCATATAATC 


TGAATCTAAA 


1140 


40 


TTTTTAATAT 


TAGGTACCTT 


ATTATGATCT ATTGGTAACA 


ATAAATGATC 


TCTTTTCAAT 


1200 




TTTTGAACTG 


nATATTCACT 


AGGAnAAGCA ACATCATAAT 


GTGTACCGCC AGTGCGAATT 


1260 




TTGGnGTCCA 


nCGCTTCATT 


TGAA 






1284 


45 


(2) INFORMATION FOR SEQ ID NO: 505: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5763 base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ATAATTAACA TCTTTTGCTA TATACCACCA OTTTGATACA TAAAATATCG CAGCAATAAT 60 

ATCATGTTTA ACCCTAATGA TATTATCTGA TTTTAATAAT AAGGTTGCTG TCCCTACAAC 120 

5 

CATTAATAAA ACTATGACTG CTGGTAATAA ACGTTTTAAA CGACGTATCC AAAAGCTTTT 180 

CAATTTAATG ATACCTGTGT CATCATACTC TTTGAGTAAT AAGCTTGTAA TTAAATAACC 240 

AGAGATCACA AAAAATGTAT CCACACCTAA AAAGCCACCT GtCAACCATT GCTTATTTAA 300 

10 

GTGGTAAATA ATAATTCCTA GAACAGCGaT TGCCCTCAAA CCATCGAGCC CTGGAATATA 360 

TCTCATTTTC TTATACTTTG TAAAACCCTT TGTTTTGTTC ATTTTTGCAT TCTTCCCTTT 420 

75 TAAAACTGTT CTTCTTAGAT GCTTAATTAA ATTTAGTTAT GCTGTTTAAA AGAATATTGA 4 80 

AATCCATATG TATATTATTG AATTACGACA TCATCAAAAT CATATTGACT AAAATACTGT 540 

TAAATTAAAA AAATTACCAA TGATGATTCT TACTTCGAAA TCCAATTTGT AATGCAACTC 600 

20 GGCAACTTAA AAACTATGAA GTATTATGTA TTGTAATATA ACTGTAATAT AAATTCAATT 660 

TATTATAAAA ATTTTCAAGA AAATATTCAA CTAGAAAATG AATTGTGCAC TCTTGGAAGT 720 

GCAAGTCACT GTCTTAATTC ATATTTTTTG AAACAAGTTA GATATAAATT TTCAAAATAA 780 

25 

AATCAGAAAC TAGAACATAA ATAAGGCTCC CTTCAAAATT TTCATTTTTC AATGTCTACT 840 

TTGAAGGGAG CTTATTCACA ATGAATTATA CTCTACAATG TTATATTGAC TGCGGGCCCA 900 

AACACAGAGA ATTTCGAAAA GAAATTCTAC AGGCAATGCA AGTTTATGTT AGCTCACACC 960 

30 

AAGTGCAATC TTAGCGTAAC GTGACATCAT ATCTTTTGTC CAAGGTGGAC TCCATACGAT 1020 

ATTCACTTCA GTATCCTGAA TTTCAGGAAT CTCTGCTAAT ACTGTTTTAA CTTGaTCAAT 10 80 

3S AATTTGAGGT CCCATTGGAC ATCCCATTGA TGTTAAAGTC ATATCAACTG TACATACGCC 1140 

TTCATGATCA ACATTCACTT TGTATACTAA ACCCAAATTA ACGATATCAA TTCCTAATTC 1200 

AGGGTCAATT ACCATTTCTA ATGCACCTAA GATACTATCT TTCAATGCCT CTTCCATCCA 1260 

40 TATCACCTCT TTAATGTCAT ATTATTCATA ATATATCAAA TATCOGACAA AACGCCAATA 1320 

AAATGCTATG ATGTATCTAT ATGAACTAAG CAACTTATGA GGAGAGAGAT ATGCAAC CAC 13 80 

ATTTAATATG TCTAGACTTA GACGGAACAT TATTAAACGA TAACAAAGAA ATTTCATCAT 1440 

45 ATACTAAACA AGTATTAAAT GAATTACAAC AACGTGGaCA CCAAATTATG ATTGCGACTG 1500 

GCAGACCTTA TCGTGCAAGT CAAATGTATT ATCATGAATT AAATTTAACG ACACCAATTG 1560 

TTAATTTTAA TGGCGCTTAC GTACATCACC CTAAAGATAA AAACTTCAAA ACTTGCCATG 1620 

60 

AAATTTTAGA TTTAGGGATC GCACAAAACA TTATTCAAGG ATTACAACAA TATCAAGTAT 1680 

CGAATATTAT AGCAGAAGTG AAAGATTATG TTTTCATTAA CAATCATGAT CCAAGATTAT 1740 
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AAGAATCCCC TACCTCAATT TTAATTOAAG CCGAAGAAAG TAAAATACCT GAAATCAAAA 1860 

ATATGCTTAC TCATTTTTAT GCCGATCATA TTGAGCATCG ACGCTGGGGC GCACCATTCC 1920 

5 

CTGTCATTGA AATTGTAAAA CTTGGTATTA ATAAAGCAAG AGGCATTGAG CAAGTTAOAC 1980 

AATTTTTAAA TATTGACCGA AATAATATTA TTGCATTCGG TGATGAAGAT AATGATATTG 2040 

AAATGATTGA GTACGCGCGT CACGGTGTTG CTATGGAAAA TGGTTTGCAA GAACTTAAAG 2100 

10 

ATGTAGCGAA CAATATTACA TTCAACAATA ATGAAGATGG CATTGGTCGA TATTTGAATG 2160 

ATTTCTTTAA TTTAAATATT AGATATTACT GTTAATTTAT AACTAATCAT TTTATAATAT 2220 

IS TTTAAAACAA TAGGAGGTAA GTTACGATGC CCAAAATAGT CGTAGTCGGA GCAGTCGCTG 2280 

GCGGTGCAAC ATGTGCCAGC CAAATTCGAC GTTTAGATAA AGAAAGTGAC ATTATTATTT 2340 

TTGAAAAAGA TCGTGATATG AGCTTTGCTA ATTGTGCATT GCCTTATGTC ATTGGCGAaG 24 00 

20 TTGTTGAAGA TAGAAGATAT GCTTTAGCGT ATACACCTGA AAAATTTTAT GATAGAAAGC 2460 

AAATTACAGT AAAAACTTAT CATGAAGTTA TTGCAATCAA TGATGAAAGA CAAACTGTAT 2520 

CTGTATTAAA TAGAAAGACA AACGAACAAT TTGAAGAATC TTACGATAAA CTCATTTTAA 2580 

25 

GCCCTGGTGC AAGTGCAAAT AGCCTTGGCT TTGAAAGTGA TATTACATTT ACACTTAGAA 264 0 

ATTTAGAAGA CACTGATGCT ATCGATCAAT TCATCAAAGC AAATCAAGTT GATAAAGTAT 2700 

TGGTTGTAGG TGCAGGTTAT GTTTCATTAG AAGTTCTTGA AAATCTTTAT GAACGTGGTT 2760 

30 

TACACCCTAC TTTAATTCAT CGATCTGATA AGATAAATAA ATTAATGGAT GCCGACATGA 2820 

ATCAACCTAT ACTTGATGAA TTAGATAAGC GGGAGATTCC ATACCGTTTA AATGAGGAAA 2880 

3S TTAATGCTAT CAATGGAAAT GAAATTACAT TTAAATCAGG AAAAGTTGAA CATTACGATA 2940 

TGATTATTGA AGGTGTCGGT ACTCACCCCA ATTCAAAATT TATCGAAAGT TCAAATATCA 3000 

AACTTGATCG AAAAGGTTTC ATACCGGTAA ACGATAAATT TGAAACAAAT GTTCCAAACA 3060 

40 TTTATGCAAT AGGCGATATT GCAACATCAC ATTATCGACA TGTCGATCTA CCGGCTAGTG 3120 

TTCCTTTAGC TTGGGGCGCT CACCGTGCAG CAAGTATTGT TGCCGAACAA ATTGCTGGAA 3180 

ATGACACTAT TGAATTCAAA GGCTTCTTAG GCAACAATAT TGTGAAGTTC TTTGATTATA 3240 

46 CATTTGCGAG TGTCGGCGTT AAACCAAACG AACTAAAGCA ATTTGACTAT AAAATGGTAG 3300 

AAGTCACTCA AGGTGCACAC GCGAATTATT ACCCAGGAAA TTCCCCTTTA CACTTAAGAG 3360 

TATATTATGA CACTTCAAAC CGTCAGATTT TAAGAGCAGC TGCAGTAGGA AAAGAAGGTG 3420 

SO 

CAGATAAACG TATTGATGTA CTATCGATGG CAATGATGAA CCAGCTAACT GTAGATGAGT 3480 

TAACTGAGTT TGAAGTGGCT TATGCACCAC CAT AT AG C CA CCCTAAAGAT TTAATCAATA 3540 
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10 



IS 



20 



35 



40 



45 



60 



GTTAGAATTA 
GAATCAAAAA 
TATGTCTTAT 
CACCTAATAC 
GTTCATGTCT 
ACAGCATTTT 
ATATCCAAAA 
GTTTGAAAAA 
AAATAATCGC 
TAATTATTTA 
AATCATGATT 
AACATGACAT 
TCTTAATATC 
AACACCTTTT 
TTAAATGATA 
TTAACTTCAT 
GGTGTACCAG 
ATTAATTCAT 
TTAACTTTTG 
TAAGGACCCA 
CTTTGTGCTT 
GCTAATGCTT 
TAATCTAATT 
GATGACGCTG 
TGATCTTTGT 
AATTCAATGT 
ACAACTGCTT 
AACAATGTTT 
CATACAGCAT 



TGTTGGACTG 
AGACATAAGG 
CAATTAGCCA 
CATTGTGATO 
CTTTCTTTTA 
CAATGTAAGC 
TCCTGAAATT 
TGGTGATCTG 
TAATACCCAA 
TATTATTAAA 
TGATTATACA 
TTTATTCCTT 
AATATTTCAC 
TAGAAAAGCA 
ATTATTTAGC 
CACCTGGTTG 
TTG CAATAAT 
CAATTTTAAG 
TAACAATATT 
TTGGGCAACC 
TGCGATCAGT 
TTGGAATCTT 
GATCAGTAAT 
CTTTTGTAAA 
AATTTCTACC 
CATTAAATGA 
TACGTACTTG 
TAGGATGGAA 
CTTCGCGTTT 



GTACTACTAT 
TCTTGGACTA 
ATACCGAATA 
ATAATTAATG 
GCAATCGACA 
AACATATGAT 
AACGTCAACA 
CCTTGATTTT 
CTTAATATAT 
TTAAAGCTTC 
CGAATATATA 
ATTAGAATAT 
TTTCATCTCA 
AGAAAAAACT 
GATATAAGTT 
TAAAAATTTA 
ATCTCCCGGA 
AATCATTTCG 
TACATTTTCA 
GCCAGTTAAA 
GATATCGTTA 
TTCAC CAG AC 
ATCTTTATGA 
TACATATAAT 
AAAAGCAATC 
AATTTTATAG 
TTCTTGAAAA 
ATCTCCTTCT 
TACTTTAACG 



CCAGTCCATT 
ATAATTGTCC 
ATTTTGATAT 
CAATTGTTAT 
CTTCCATCAA 
TTGCCCCGCC 
GCATAAATAA 
TTGAAATGTT 
GTAAATGTAA 
TTGGGATTAA 
TTCTACCACA 
TGTGAATCTG 
AAATGGTAAC 
AACCAATCTA 
GTCAGCGTTC 
GGTGGTTGCA 
TGTAGTGCAA 
CCAGTGTTAC 
GGTAATGGTA 
CTTTTTGATA 
ATAATTGTAT 
TTACCAATAA 
TTTGGAATTG 
TTTTCCACTT 
ACATTATTCG 
TCTTCAGCTT 
TCTAAAGTAT 
GCAAAGTCAG 
CCATATGAAG 



TTTTATGTTT 
ATGCCTTATG 
AGGSCCTAAC 
CCAAAACATT 
TCCAACTACT 
ATTCATAAAT 
GCGTAAAATC 
TAAGTATGTA 
CATACTGATT 
TACCCACTTG 
CTTCTATATT 
CTGTAAAATA 
ATTATAAATA 
TATAAAGACT 
CAATATTATC 
TACCTGCACC 
CATATTTTGA 
CATCTTGTCT 
GTTCGTCTTT 
AAAATGCTTG 
AG CCGTAAAC 
CAATACCTAA 
TTGCATTATC 
CATGATTTAA 
GAGGTGTTAC 
TGCCGCTATC 
GATTTTGTTG 
CAAATACTTG 
TTTTGTCATT 



AACATTTTTA 

TCATATACTA 
GGTAGAATGA 
GTGTGACTTT 
GCAACACCAC 
GACTGAATTA 
ATGTGCAACG 
GCGATAAATA 
CCCCCCACTT 
CTTGTAATTT 
TGAGAGGAAG 
ATCAACTACT 
ATTTATCTTT 
GGTTAGCTTT 
AATAGTCACT 
AACGCCTGCT 
AATTTCTTCT 
AATTTCATTG 
AGTAACGATA 
ATCTTGTTCA 
ATAATCTAAA 
TTCACCTTCA 
TCCTGTTAAA 
TTCGTTCGCA 
TGGTGGTAAA 
TTCTGCTGCT 
TAAACCAGCT 
TGTTAAATCC 
ATACTTGAAT 



3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 
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TTOGTTATCA AATAACAAAT AAATAAGTAA GACAATTTTG AAAATGAGTT GTGTTCATTC 
TGCTACAAGG ACTTTGCACT TAATCGAAAT TATTTTTTAT TCTTTTGAAA ATCAAAATAC 
TATAGTTGCA ATGTACCAAA TTTGAAGAAG TATAAATAAC CTTTAACTTC TTTATTAAGA 
ATCGTTTGAA GOGTATTTTG ATAATATTTC ATCTGTATCT TATATTTATT TTTTAATTGT 
GTACCAATTT CTTCATCTGT CATCCCACGG CGACGATTAA ATGCATCGGT TTTATAGTCT 
ACAAAATAAT GCACACCATC TTTAACAAAG ATTAAGTCAA TCATACCTTG AATAATTGAG 
ACG 

(2) INFORMATION FOR SEQ ID NO: 506: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 
CCATATGATT TTGTGCAATA ACTCTTTTTC TTTCTTCTTT TCGTAAAAAG TTGTACATCG 
CTTTTGTGTT TAAGAGACTA TTGTTTTCTT TAGGTTTTTG AACTTCACTC AGTGTATTTC 
TAGTTGTTAA CACTAAAATT CCAACTGTTG TATCTTTGTA TCTAGCCATA ACTTTATTCA 
GATGTTTGTC ATTTGTAATA ACTACGACAT AATTAAACAC TTCATAATAA TCATTAATTT 
GATTATCTAA TCTATCCAGC TTATCTAATT CTGTTTTAAT CT CAT AG ACA ACGCCTTTGC 
CGTTTAACAA TATAAAATCA GCaATACTTT TCCCTATGGG CATCTCAGAA AGTGCAGTAG 
TTGTATTAAT AGAATGTCGT CCTAGAAGGA GTkTATTAAG TATnGTGTTT TTGTAAAAAT 
AT 

<2) INFORMATION FOR SEQ ID NO: 507: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1188 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 
GCTTATGTAT TCTCAAAATA TTTATGTGAT ACGCAAAGGA GACATGGTTA TTCGACCAGC 
ATTTGATGAT GACGATCAAA GAAACGGTAG TGAAATAATT CGGTTTGACA AAACGCGTAT 
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TTATCTTGGC 


AAGAAAGCAG 


AGAGAAACCG 


CATTACTGGC 


ATTTCTAGTA 


AACCACCTAT 


TTTACTAACA 


CCATTATTTC 


CAACTTATTT 


TTTCCCAACA 


CATTCTGACA GACAAAATGA 


AAATATTTGG 


TTAAATATGC 


ATTATATCGA 


AAGTATTAAA 


GAATTAAAAA ATCGTAAATG 


TAAAGTGACA 


TTTATTAATA 


ATGAATCAAT 


CATTCTTCAT 


GTTTCATACC 


ACAGTTTATG 


GCATCAATAT 


AACAATTCCA 


TTTTTTACTA 


TTACATGGTA 


GATAAACAAT 


CTCGCATGAT 


ATCAAAAAAT 


CCCGACCAAC 


CAATAGATTA 


TAATAAAGCC 


ACATTGAATG 


TGTTTGAAGC 


ATTGACACGC 


TATTCTTTAT 


TTGAAGATAA 


ATAAATTGTT 


TATTTTTAAA 


ATATGCGGAA 


TGTTTTATAA 


ATATAGTGTA 


AATGTTCTGC 


ATATrrrrrr 


AAGGTATCTA 


TTGCAAATTA 


ACTTAATCTT 


GTTATAATAA 


TATTTGTGCT 


TGaTATTCAA 


ACACATACAA 


ATTAATCCAC 


AGTAGCTCAG 


TGGTAGAGCT 


ATCGGCTGTT 


AACCGATCGG 


TCGTAGGTTC 


GAGTCCTACC 


TOTGGAGCCA 


TTGGAAACGT 


ACTCAAGTTG 


GCTGAAGAGG 


CGCCCCTGCT 


AAGGtGTAGG 


TCGCGAAAgG 


CGCGAGGGTT 


CGAATCCCTC 


CGTTTCCGTT 


ACTTGCTAAA 


ATGGTATATA 


CCATTTTAnC 


TTTTTTGTTT 


ACTTATATAT 


AATGAATGAG 


AATTTCACTG 


TTCTTTTATA 


TCAATTTTAA 


AATTCTAAAA 


ACCTTTCCTA 


GATAATCTTC 


TCTAAGAAAG 


GTTTTTATAC 


TTGTTGAACT 


TATAATTAAT 


TTATTACATA 


GCAATATTTA 


CCTGTTTTTA 


ACTATAAAAT 


TATCACTACA 


TGAAATACGA 


TAATTCGGaT 


CTCTTAACTT 


CTCTGCaATT 


AATGtACTCA 


TTGgTTTCAT 


CGTATGATTC 


ATGTATAATA 


GCATTTkTTA 


AATAATTC 





(2) -INFORMATION FOR SEQ ID NO: 508: 

S<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: 
CCCAACTTCG GTTATAAGAT CCCTCAAAGA TGATGAGGTT AATAGGTTCG AGGTGGAAGC 
ATGGTGACAT GTGGAGCTGA CGAATACTAA TCGATCGAAG ACTTAATCAA AATAAATGTT 
TTGCGAACAA AAtCACTTTT ACTTACTATC TAGTTTTGAA TGTATAAATT ACATTCATAT 
GTCTGGTGAC TATAGCAAGG AGGTCACACC TGTTCCCATG CCGAACACAG AAGTTAAGCT 
CCTTAGCGTC GATGGTAGTC GAACTTACGT TCCGCTAGAG TAGAAOGTTG CCAGGCATAA 
TATTAATCCA CAGTAGCTCA GTGGTAGAGC TATCGGCTGT TAAC CGATCG GTCGTAGGTT 
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GGCOGTAACA CGGGTTCGAG TCCCGTAGgA GTCATACAAG CAGAAGTGAA ATATCGCTTC 480 

TGTTTTTTTA TTACATATTT ATTGTTGAGG AAGGTTGTCC GAGCTGGCCG AAGaGCACGC 540 

S 

CTGGaAAGTG TGTAGGCGTC ACAAGCGTCT CAAGGGTTCG AATCCCTTAC CTTCCGTAAA 600 

GGcGCTTAAA TTGGTTTTAC CCATTTTAAG CGCTATTTTT ATTTTGGACT CAATCCCTTG €60 

ATATATCTGC ATTTGAGCTA TTATCCTCAT TTTTACACTT CTTATTTATT TATATCCATT 720 

10 

TAAAATTTTT TAGCCACAAT GTGACTAATT TTTGaTGAAT AATCCTAATT TTAGtCACAA 780. 

GATTTTQAAG TTTAGTCACA AAACAAATCA TTCAGATTTT TTTCyATAAA TTTAGTTTCA 840 
f5 (2) INFORMATION FOR SEQ ID NO; 509: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 



30 



35 



40 



4$ 



50 



TTCTATGAAA 


CTATCAGCTG 


TTTCTTTTTC 


CTTGGTAAAA 


ATCGTTTTAA 


CGCCATGATA 


60 


CATGTATTCT 


CTTTCAaCAC 


CCGATTGGTC 


ATAAGCATCG 


AAAACACAAA 


TAATTTCATC 


120 


TGAAATAACT 


GCATTATAAT 


TTGCAATTGC 


ATCTATTAAT 


TGCATTCTAG 


CTTCTTCTAA 


180 


ATTCTCTTTT 


GCAATGGCGC 


TTAGCGTTGG 


TGATTGTCCT 


ATCATATTGT 


ATCCATCAAT 


240 


GATTAAGTAA 


CGTTCTTTCA 


TTATATTTCT 


CCAACATCAT 


GTCTTTTTCG 


AAATACTTCG 


300 


TACATCATTA 


AACTTGCTGC 


AACCGAAGCA 


TTCAAACTGT 


TTACATGTCC 


AACCATTGGA 


360 


ATCTTAATAT 


AAAAATCGCA 


TTTATCACTT 


ACTAGGCGAC 


TCATACCCTG 


TCCTTCGCTA 


420 


CCAATTACAA 


TAGCCAATGA 


CATGTCCGCT 


TCTAGATTTC 


TATAATCTGT 


TGCATTATTA 


480 


GCTTCAGTGC 


CAGCTACCCA 


AAAGCCATTA 


TCTTTTAGTT 


CATCGATAGT 


TTTAGCTAAA 


540 


TTTGTCACTC 


GAATAACTGG 


TACATGTTCA 


ATTGCACCTG 


TTGAGGCTTT 


TGCAACTGTT 


600 


TGCGTTAGTG 


TAACTGAACG 


ACGTTTAGGA 


ATAATAACAC 


CATCAACTCC 


CGTTGCATCG 


660 


GCTGTTCTTA 


AAATTGATCC 


CAAGTTATGT 


GGGTCTTCTA 


AGCCGTCTAA 


TATAAGTACT 


720 


GTCAATAAAC 


CTTCTTTTTC 


TTTTTGCTGT 


TTTAAAAATT 


GATCGAAGTC 


AGCATATTCA 


780 


TATGGTGCAA 


TAAGCGCTGC 


AACACCCTGA 


TGTGGTGCAT 


TTGCTAAAAA 


ATCTAATTTA 


840 


GATTTTGGTA 


CAGTTTGAAC 


AATGATTTTT 


TGATCTTTTG 


CArrrrrrAA 


AATTTCATTA 


900 


ATTTGTTGCT 


TTTTAATACC 


TTCTTGAATC 


AATATCTTAT 


TTAyCGGATG CCCAGTAATA 


960 
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10 



TTCGTTTACT ATTGTTATTA TTTTATTTAA TAATGCCTCT AATOGTTCTT CTCTTTTTTC 1080 

TAAATAaAGA AAACCTATCA CTGCTTCTAT CkCTGAACTT TTACGATATG TTTGAACATC 1140 

AQTGTTTTTA GCTTTAGTAT OACTTTTCGC GTTACGCCCT CGCTTCAAAA TATCCATTTC 1200 

TTCGTCTGTA AACCATTCTT GCTCCATTAA ATATTCTAAC GTTTGCGCCT GACTTTTGGC 1260 

AGATACATAT TTTTTAGACA TTTGATGTAG TTTATTAGGC TTACTTTTAA GCTTTAAAAC 1320 

GATATAGGTA CGTACATATT GATCTAa GAC TGCGTCnCCC ATATATGCTA AGG 1373 
(2) INFORMATION FOR SEQ ID NO; 510: 

1S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY 2 linear 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 
TCAAGTGCCA ACTACACAAC TGATTTACAT TCTTTAGGTC AATATGTACA AGAAGGCCGT 60 
CGCTTCTTAT TCGAAACAGT TGTAAAAGTA AATCATCCTA AATATGATAT TACTATTGAA 12 0 

AAAGATAGTG ATGATCTAGA CGGATTAAAT TATTTAGCTG GTAAAACAAT CGACGAAgTT 180 
AACACAAAAG CATTCGAAGG TACATTATTA GCGCATACTG ATGGTGGTGT TCCTAACATG 240 
GTAGTGAACA TTCCACAATT AGATGAAGAA ACTTTCGGTT ACGTCGTATA CTTCTTCGAA 300 
CTTGCTTGTG CAATGAGTGG ATACCAATTA GGCGTAAATC CATTTAACCA ACCTGGTGTA 360 
GAAGCATATA AACAAAACAT GTTCGCATTA tTAGGTAAAC CTGGTTTTGA AGaCTTGAAA 420 

35 

AAAGrATTAG rAGAACGTTT ATAAAATACA TTACTTCAAA GATTAGTGAA GTTTGAAAAG 4 80 

ATAGAACTAG ACGTTAACTA TTTAAAGCAT ATTTTCGAGG TTGTCATTAC AAATGTAAAA 54 0 

40 ATGTAATGAC AACCTCGTTT TTATTTATAT GCAAGAACTA GGTTACTAGC TAaTGTGACA 600 

AGATGTTAAG AGAAAATTAA aGATAAAATA ATATCTGCCT TACAATAATA TTGTTATACT 660 
ACTAGnGACT GATTTATTAG CATGATTACA TGTTAATGTT TCTTTACTTA GTAATTA 717 
(2) INFORMATION FOR SEQ ID NO: 511: 



25 



30 



AS 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2700 base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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AATCTAATTT TTCCTCCTAA TCTCOOATTG GATAOTGGTC AATTTTTCTG GCCCGCCATT 60 

TTAGCGTTTG TTCTAACTGG GATTGGTTTA CCATTATTAG GTGTGATTGT AGGTGCACTT 120 

5 

GATAAAGAAG GATATATTGG CGCATTAAAT AAAATTTCAC CTAAATTTTC AATATTGTTC 180 

TTAATCATCA TTTATTTGAC TATAGGACCA CTTTTTGCAA TACCTAGAAC TGCATCTACA 240 

TCTTTTGAAA TGACAATTAC ACCAATTATA CATAGCAATA GTAGTATCGC TTTATTTATA 300 

10 

TTTACGATTA TCTACTTCAT AGTCGTTTTG TATATTTGTT TAAATCCATC TAAGTTAATC 360 

GATCGTATTG GTTCATTATT AACACCATTA TTATTGATTA CTATTTTAGC GATGATTATT 420 

15 AAAGGATACT TAGACTTTAG CGGTAATAGT GCTGGAAAGG GCAATGAAGC ACTATATCAT 480 

TCTAATTTTT CAAGTTTTGC TGAAGGCTTT ACACAAGGCT ATTTAACAAT GGATGCCATT 540 

GCAGCAATTG CTTTTTCAAT GATTGTTGTT AATGCAGTAA AACTAACAGG CATTACTAAA 600 

20 ACAAATCAAA TATTCAAACA AACTTTGACT GCTGGTTTAA TTGCAGCCGT AGCTTTAATT 660 

TTCATATATA TTTCATTAGG TTATATTGGT AATCATATGC CAGTAAGTGA CATGACGTTA 720 

GATCAATTGA AATCCAAAGA TCGAAACATT GGGACATATT TATTAACGAC AATGGCTTCA 780 

25 

ACAGGATTTG GTTCATTCGG AAAATATTTA TTGGGCATCA TTGTGGCGCT GGCATGTCTA 840 

ACTACAGCAT GCGGGCTTAT TGTTGCAGTT TCTGAATATT TCCATAGAAT CGTACCTAAA 900 

GTATCATACA AAGCATTTGT ATTAGTTTTC ATTTTAATGA GTTTTATTAT TGCTAACCAA 960 

30 

GGTTTAAATG CTGTTATCTC AATGTCAATT CCGGTATTAA GCATTGTATA CCCAGTAGCA 1020 

ATAACTGTTG TATTATTAAT TTTAATTGCC AAATTCATAC CGACAAAACG CATTTCACAA 1080 

^ CAAATTCCAG TTATTATCGT ATTTATATTG TCGATTTTCA GTGTTATTAG TAAGTTAGGT 1140 

TGGcTGAAAA TTAACTTTAT AGAATCATTG CCTCTAAGAG CGTATTCTTT AGAGTGGkTC 1200 

CCAGTAGCAA TTATTGCAAC GATATTAGGC TATCTAGTOG GCATATTTGT AAAACAAGAT 1260 

40 CCAATTAAAT ATCAACAGGA ATAACGAATA ATATAAAAGA GGTTGGGACA TAAATCCCTA 1320 

AAAAAACAGC AGTAAGATAA TTTTCAATTA GAAAATATCT TACTGCTGTT CTCTATTTAT 1380 

ACAATACTTC GTATTGAATG GCTTCX3CTTT CCTAGGGTGC CGTCTCAGCC TCGGTCTTCG 1440 

45 ACTGGCACTG CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACGT GTAATTTTAC 1500 

TTTGAAATAC TTTAAAAAAA TAAGACACTT TGCCCAACTT GCACATAAAT GTAAAATTCA 1560 

ATAAAATAAA TTTTCTGTGT TGGATCCCTT CGTATAATTT AATAAATACT ACTAAACTAA 1620 

50 

ATTAACGAGG TGCCTTATGT ATAAAAATTA TAACATGACC CAACTTACAC TACCAATAGA 16 80 

AACTTCTGTT AGAATTCCTC AAAATGATAA TACGCGATAT GTTAATGAAA TTGTTGAAAC 1740 

55 
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AAGATATGCA TACCOTAATG ATAGATATAG TTTTAAACGT GACTTCAAGC TATATGAATG i860 

TGATGACTGT TCATCATGTT CTTTOAGACA TCAATGCATG AAGCCAAATT CGAAATCCAA 1920 

6 

TAAGAAAATT ATGAAGAATT ATAATTGGGA ATACTTTAAA GyCCAAATTA ATCAAAAGCT 1980 

TTCTGAACCA GAAACGAAAA AAATCTATAG TCAAAGAAAA ATTGATGTAG AGCCTGTTTT 204 0 

1Q TGGATTTATG AAGGCTATTT TGGGTTTCAC TCGAATGTCA GTTCGAGGAA TAAATAAAGT 2100 

TAAACGAGAG CTAGGTTTTG TATTAATGGC ACTTAATATA AGGAAAATAG CAGCTCAACG 2160 

AGCTGTACAT TATAAAATAC ATATCAAAAA AGCTGATTTC TATCAAATAA TTAATAGAAA 2220 

IS TCAGCTTTTT TACATTGCCT AAGAATTTAA TGTCCCAAGC CCTTTTTATC GAATAACTTA 2280 

TTGTAAACCT TGTCTTTCTT GGTTATTGTT TTCGTTATTT TTTTCGTGTT TTTQTTTCCA 2340 

CTCTTTTTGA GTCATTACAT CGTCAACTTG CATGTTAACT TCAACTACAT CTAAACCAGT 2400 

20 AATATATTTA ACTTGTTCTT TAACTAATTC AGTTACTTTA CGGAAGATTT TTGGTGCTGA 2460 

TTCACCATAT TCTAAGATTA CTTTTAAGTC TACAGCAGCT TGTTTTTCAC CAACTTCAAC 2520 

AGATACACCT TGAGTAACAT TGTTGCCACT TGAGAATGCA TTAGTGAATG TATCAGTTAA 2580 
25 

GCCACCTTTC ATATCTAAGA TACCTTTAAC TTCACGTGCA GCGATACCAG CAATTTTTTC 2640 
AACAACTTCA TCAGAGAATG TTAATTTGnT TTTGAATTGA GGCTCCTGAT TTnGTTCnTG 2700 
(2) INFORMATION FOR SEQ ID NO: 512: 

30 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1135 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
3S (D) topology: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 



40 



45 



60 



ATCAAATACA 


ATAATTAAAA 


TAaTGGCTAA 


TACaCCTAaA 


ATaATAATcG 


AGTTaGAGTA 


60 


GCCTAAGCGA 


yCACCcTAAC 


aGTArACATT 


CTAGGCATAT ATTTaTCTTT 


AGACATTGAA 


120 


GCCGCTAACA 


TCGGAAAAGC 


TGTAAATCCA 


GTGTTGGCTG 


CTAATACCAA 


AATCATCACG 


1B0 


GTCGTTGCCT 


GTACAAAATA 


GAAGGCAGCA 


TTATCACCAA 


AT ATTTG CAT 


AGCTAATTGT 


240 


GATAAAACTG 


TCGTTTCCGT 


TTGTGGCAAA 


ATTCCATAAA 


CATATGCTAA 


ACCAACGATA 


300 


CCAACTAATA 


AAAACGCTAA 


AATTGAACCC 


ATAGCAATTA 


ACGTTTTCAC 


AGCATTTTTA 


360 


GCACTTGGCT 


CTCTAAAATT 


GGTTACCGCA 


TTTGAAATAG 


CTTCAACACC 


CGTTAACGAT 


420 


GAAGCCCCTG 


ATGAAAATGC 


TTTTAATAsC 


AAGAATAATG 


TTACTCCAGG 


AACCGCAGTT 


460 



1439 



EP0 786 519 A2 



5 



10 



IS 



AATAl 1AATA 


TCACTAACCC 


TATAATGAAA AGATATACTG 


GATAGGATAA 


TACC5GTV5GGA 


£. f\ rt 

600 


GATTCAGTTA 


AACCACGTAA 


ATTTAATATT 


AAAATAAAAA 


GTACAAGTAw 


ACATG CAATL 


ooO 


AGTrCkTTAT 


GCCCATATAA 


ACTTGGGAAT 


GCAGCAACAA ATGCATCAGC 


ACCAGATGAT 


720 


ATmCTAACAG 


CGACAGTCAG 


TATGTAATCG 


ACTAATAATG 


AGCCTCCTGC 


AAGCAATC CC 


780 


CATTTTTCTC 


CTAAATTGGT 


CTTGGACACC 


ATATACGCGC 


CGCCACCTTT 


AGGATATGCA 


840 


TAAATAATTT 


GCCTATAAGA 


CATAATTAAA 


GCAGCTAATA 


AGATCAGAAC 


AGCACCTGCA 


900 


ATCGGTAAAG 


TATACCAAGT 


TGCAACTGCA 


CCCACTACTG 


ATAGTGTAAT 


CAGTATTTGT 


960 


TCGGGACCGT 


AAGCCACTGA 


AGATAATGCA 


TCCGACGAAA 


GAATCGCTAA 


CCCTTTAAAC 


1020 


TTCGATATCT 


TTTCGTCTTT 


TAGTTCTCTG 


TTTTTTTTAG 


GTTGCCCTAT 


AATAAGtCTT 


1080 


TTAAATTGAT 


kGAcATAAtC 


TCCTATTCCT 


TTTTATAGTT 


TTnGATGGAA ATCAC 


1135 



20 (2) INFORMATION FOR SEQ ID NO: 513: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE : nucleic acid 
2S (C) STRAND EONESS : double 

<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513: 

30 

TTATGGTAAA ACTTTAACAC 
TTGAATTTAA GCCATTTCTA 
CACATTTTAT TCAGATTTTT 

35 

GATATTTTCA AATAATACTC 

TTTAGCTAAA TGATAAACTG 
40 AGATAATGCT TGATCCATTG 

AGAATGTGTT GATGTGCAAG 

CATAGGATCG GCACCTAACT 
45 TAATGCAGCT TTTGCATTAT 

GATGATATTT TTCAATGCAC 

ACGCAAATAA TCATTCATAA 

SO 

TGAAGCAGCA ACTGTAGTTG 
CAACACGCCA ATACCTG CAT 

55 
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AAGCATTACA TAAATCGCAA TCATTTATGT GGCAAAAACG 60 

TTCATTTTAC GTAATTATTA GCCGTATATG TGTAATAATA 120 

TATCGCGCTC CATTAAATCT TTTACGCATT CTTTTACTGA 180 

TATATAATGC ATTTGTAATT GGCATATCCA CATTTTTTTC 240 

ATTTAGTTGT ATAAATACCT TCAACAACCA TATTCATTTC 300 

ATTCACCTTG TCCAAGTTTA TATCCTAATG TGAAATTCCG 360 

TAACGATTAA GTCACCGATA CCACCTAAAC CTAGAAATGT 420 

TTTCACCTAA TCTACTAATT TCCGCTAAGC CACGAGTCAT 480 

CACCGTAGCC AATTCCAGCT ACGATACCAC TTGCTACTGC 540 

CACCAAGTTC AACACCAATC AAGTCATCAT TCGTGTACAC 600 

ATAAATCTTG CGTTAATTTA CTTACACTTT TATCTTTTGA 660 

GTTGCTTGAC TACAACTTCT TCCGCATGAC TTGGCCCTGA 720 

TATATTCAGG TGAAATAGAA TCTTCAATCA TTTCTGACAC 780 
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CAGCTTATCA TTAATTTGAG AAGCAACTTC TCGCATTGCT TTAGTAGGTA AAGCCATTAA 900 

GTAAATATCT GCAAATTG 918 

5 (2) INFORMATION FOR SEQ ID NO: 514: 

<i) SEQUENCE CHARACTERISTICS: 

ih) LENGTH: 587 base pairs 
(B) TYPE: nucleic acid 
*0 (C) STRANDEDNESS : double 

<D> TOPOLOGY: linear 



1S (xi) SEQUENCE DESCRIPTION; SEQ ID NO: 514: 

CAATACTCGG TATTATAGAT AGTCCTACTA CATTTATATG GGTTTGTATC ATTACATTAA 60 

TTGCACATCA ACTTGAAGGT AATATCATTA CACCAAATGT AATGGGTAAA TCTTTAAGTA 120 

20 TCCATCCTTT AACAATTATC GTTGTTATTT TAGCAGCAGG AGATTTAGGT GGCTTTACAT 180 

TAGTTCTGAT TGCAGTGCCA TTATATGCTG TACTTAAAAC GGTTGTTAGT AATATTTTCA 240 

AATACCGCCA ACGCATTATT GATAAAGCAA ACAGTAATGT TAAGGACTAA TTCTGTGGAT 3 00 

GTCTTTTAAG AATATAAGAT ACTATCG CAT CAAAAGTTGA AACTACAGCT TTTGAGGCGA 3 60 

TTTTTTTGTG CATAAAAAAT CAGTCtnAATG AAATATCAAA TAATTTTCCA TCAGTCCGAT 420 

TATTATAAAA GCAAAAAAGC TTTGCTCACA TATATAATAA CGTGAACAAA GCTGTTGAAT 4 80 

GATATTATTT AATTGCGTGG AATCCGCTAT CTACATGAAT ATTTTCACCT GTAACGCCAC 54 0 

TTGATAAATC ACTTAATAAG TAAGCCGCAG TTTTACCTAC TTCTACT 587 
(2) INFORMATION FOR SEQ ID NO: 515: 



25 



35 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doub 1 e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 

GGTTCATTCC AAAAAAGTAC GCGATAATTA GCGAAGAAGA ATTTGAAAAT TTAAATGTTG 60 

TTAAACCAAA TAAAAATAAT GTTTTCTGGT CAGTTATAGG AAGTTCGGTT TTGTTTGGAG 120 

TTACTTTAAG GAAATACATA CATGTTTTTG ATGTTCAATT AGATAAGCTA GTTGTAATGA 180 

TATTGTGTGC TCTCGCTTTA ATTTGTGTTA TAGTTTTTTA TTTTAACTTA AATAGAAAGC 240 

TTAAGTTAAA AGTGTTTGAT ACAAATATTG AAAAAnATAA GAGAGTTATA T AAA w Ac CAA 300 
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10 



1S 



30 



3S 



SO 



ss 



TTTCATTAAT TGCCCTTATG ACAATCGAAC CTCAAAATAT AATAATATTT ATTTATTGGA 420 

TTATGATGAC AATGCTTTTC TTTTTGTTAA ATATGACTTC GATAGGTAAT GAAAAAGTTC 480 

GCGTTATAAT GAAAAATAAT TGATTACATT TAAAATATTC TAAATGTTGT CGACACAATC 540 

CTTTTAAGAC GCTAGTAGAA TTTAAATGAC TTCTAATGTA TATGAAAGTG TATCAATATA 600 

AAACCAATTG AAAAGAAGTG GAGACATTGC TTTGTGAAAC TGAAAATATT AATAAGAATC 660 

CCAAATATAG AATTATCAAA TACAAAGATG AATATTTGAT GATTGATTTA GTAAGTACAT 720 

GGTTAGCACT CTTTTTCCCA ATGATTAATT GGCTGATTCC AAAAAAGTAC GTCAAAATCA 780 

GCGAAAAAGA TTTTGAAACT TTAAACATTG TG 812 
(2) INFORMATION FOR SEQ ID NO: 516: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 
20 <B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



26 ■ <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516: 

GTTCTATTAT TTCTTCCAAT GGTAATGGAT TTTCGTAAAA TGATTGATAT AAGTTGATAA 60 

TCTCAATAAC TTTTAATTGA TCTGGAAACA TCGTTTTTTG GAACATTATA CTGCGATTTT 120 

CACTTTGTAA TAACTTGTCT TTATCAAATA TCTCACCAGA ATTAGCATTA ACATTACCAA 180 

TT AAT AT AT C AATTAACGTT GACTTTCCAG CACCATTTTT TCCAATTAAA GCGATACATT 240 

TACCTTGTTC AATATCGAAC GAAATATTTT TTAGAACACA . TCTTTTATTA . AATGACTTGT 300 

TGATATTAGA TATTTGAATC AATTTAATCA CCTCTATTTT TTTCTTAATT TAATATTAGT 360 

AAATTTATTA GATTTAAAAT AGAACACTTT GTCATAGATT TGAAATGACA AATGTCATTA 420 

40 TTAGATkTAC ATAATATATT TATCGTtATT TTAATTTTGG GCAAAATAAA AAGAGCCTCT 480 

ATAATCGrGc TCCTTACAAA TAaATTATAA AAttGGCGAA CTAAAT 526 
(2) INFORMATION FOR SEQ ID NO: 517: 



45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS s double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 



1442 



EP0 786 519 A2 





GGAACACCTA AAGAAAGAAA TGATGCATTA AACACAOAGG 


CTGATATCTA 


TGTAACCAAT 


120 




AAAQAAAATA 


CTAAATGGTT ATGCGATCAA 


TATAAAAAAG 


AATGGCCATT 


TGATATGGTT 


180 


5 


GTAATTGATG 


AACTGTCTAC 


ATTTAAAAGT 


CCTAAGAGTC AAAGGTTTAA 


ATCTATTAAA 


240 




AAGAAATTAC 


CACTCATTAA 


TAGATTTATA 


GGATTAACAG 


GAACACCTAG 


CCCAAATAGT 


300 


10 


TTACAGGATT 


TATGGGCTCA 


AGTTTATTTG 


ATAGACAGAG 


GCGAAAGACT 


TGAGTCTTCA 


360 


TTCAGTCGTT 


ATCGAGAAAG 


GTACTTTAAA 


CCAACACATC 


AAGTTAGCGA 


ACATGTTTTT 


420 




AACTOGGAGC 


TAAGAGACGG 


ATCTGAAGAA 


AAGATATATG 


AACGAATAGA 


AGATATATGT 


480 


IS 


TTAAGCATGA AAGCGAAAGA 


TTATCTAGAT 


ATGCCTGACA 


GAGTTGATAC 


TAAACAAACA 


540 




GTAGTCTTAT 


CTGAAAAAGA 


AAGAAAAGTA 


TATGCAGAAT 


TAGAAAAAAA 


CTATATTTTA 


600 




GAATCGGAAG 


AAGAAGGAAC 


AGTTGTAGCT 


CAGAATGGGG 


CATCATTAAG 


TCAAAAACTA 


660 


20 


CTTCAACTAT 


CTAACGGTGC 


AGTTTATACA 


GATGATGAAG 


ATGTAAGACT 


TATACATGAT 


720 




AAGAAGTTAG 


ATAAGTTAGA 


GGAAATTATA 


GAGGAGTCTC 


AAGGCCAACC 


AATATTATTG 


780 




TTTTATAACT 


TCaAACATGA 


TAAAGAAAGA 


ATACTTCaAA 


GGTTTAAGGa 


AGCAACCACA 


640 


25 


TTaGAGGATT 


CAAACTATAA 


AGAACGTTGG 


AATAGTGGAG 


ACATTAAGCT 


GCTTATAGCA 


900 




CATCCAGCAA 


GTGCAGGGCA 


TGGATTAAAC 


TTACAACAAG 


GTGGGCACAT 


TATTGTTTGG 


960 


30 


TTTGGACTTA 


CATGGTCATT 


GGAATTATAC 


CAACAAGCAA ATGCAAGATT 


ATATAGACAA 


1020 


GGACAAAATC 


ATACGACTAT 


TATTCATCAC 


ATTATGACCG 


ATAACACAAT 


AGATCAAAGA 


1080 




GTATATAAAG 


CTTTACAAAA 


TAAAGAACTA 


ACGCAAGAAG 


AATTGATGAA 


AGCTATTAAA 


1140 


35 


GCAAGAATAG 


CTAAGCATAA 


GTAATGGAGG 


TATAAGATGG 


GAAAGGCATC 


ATACGATATT 


1200 




AAGCCAGGTA 


CATTTAAATA 


TATTGAGTCA 


GAGATATATA 


ACCTACAAGA 


GAACAAGAAA 


1260 




GAGATAAATA 


GATTGAGAAT 


GGAGATACTT 


AACCCAACGA 


AAGAGCTAGA 


CACTAACATT 


1320 


40 


GTGTATGGAC 


CGTTGCAAAA 


AGGTGAACCA 


GTTAGAACAA 


CTGAACTAAT 


GGCAACAAGG 


1380 




TTATTGACTA 


ATAAGATGTT 


ACGAAACCTA 


GAAGAAATGG 


TCGAAGCAGT 


TGAAAGTGAA 


1440 




TACTTAAAGT 


TACCTGAAGA 


TCATAAGAAA 


GTAATTAGGC 


TAAAGTATTG 


GAATAGAGAT 


1500 


45 


AAGAAGCTAA 


AGATAGAGCA 


AATAGGAGAT 


GCATGTCACA 


TGCATCGTAA 


TACAGTTACT 


1560 




ACTATACGAA 


AGAACTTTGT 


TAAAGCGGTA 


cGwtATCATG 


CAGGTATCAA 


ATAACATTGT 


1620 


SO 


GCAAAGATTG 


TGCAAAAGGC 


CTACAAATCT 


GTAGTAATAT 


GATAGTATCG 


GAAAGATGTA 


1680 


TAAAGTTATC 


TAAAAGTTAT 


ACGACACAAG 


TACACGAGGC 


ACATCGCTAT 


GCGtGTGTCT 


1740 




TTTGTTATGC 


AATCAAAGAG 


GTGTAAGAGA 


TGACCAAGCA 


TAATAACATT 


TATAAGCATG 


1800 
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AGATAGCATT 


AGATAGGGAT 


AATCATCTTT 


GTCAAATGTG 


TTTAOGTGAA 


GACATAGTAA 


1920 




CAGATGCAAA 


CATAGTGCAT 


CATATTATTT 


ATGTTGATGA 


AGATTTTAAT 


AAAGCTTTAG 


1980 


5 


ACTTAGATAA 


TTTGATGTCA 


GTTTGTTATA 


GCTGTCATAA 


CAAAATTCAT 


GCAAATGATA 


2040 




ATGACAAAAG 


TAATCTTAAG 


AAAATTAGAG 


TATTAAAAAT 


TTAAATAAAA AAATAATTTA 


2100 


10 


TTTTTATAGC 


CCCCTACCCA 


TCGGCTTAAA 


ATGTTTTTTC 


GACGGGTACC 


GGCGGGGGCC 


2160 


CTTCGCTTGC 


AACGCGGATA 


AACTTTTATG 


AAAGGGGGTC 


TTTATATGAA 


ATTAACAAAA 


2220 




AAACAGCTGA 


AAGAATATAT 


AGAGGATTAT 


A_AAAAATC!TG 


ATGACATATT 


AATTAATTTG 


ion a 


15 


TATATAGAAA 


CGTATGAATT 


XTATTGTCGG 


TTAAGAGATG 


AACTTAAAAA 


TAGTGATTTG 


« J* u 




ATGATAGAGC 


ATACAAACAA 


GGCTGGTGCG 


AGCAATATTT3 


TTAAGAATCC 


ATTAAGCATA 






GAACTGACAA 


AAACAGTTCA 


AACACTAAAT 

r%n*mrm\* x raw & 


AAt "i"i'A i " i 'f^A 


AGTCTATGGG 


TTTAACTGCA 


z4oU 


20 


GCACAAAGAA 


AAaAGATAGT 


TPAAGAAGAH 




GTGACTATTA 


AAGTT I" 1 'AAA 
X X x /vvv 


2520 




TGAACCTTCA 


CQAAAACTTAT 




<rr AT/in Afi an 


CAAGTCACTC 


AAGGGAAAAT 


2580 




AAAAACAAGC 


AAAT A TRTT 21 


#WW%rl\s/vfcX Vj 


*IY2 Af2 A/^! A A 


CTTAGATATC 




264 0 


25 


AGGTAAATGG 


fiTATTTY^ATfi 

U1A1 X iUnlVJ 


AAfiAATTAftP 


' 


ATTCGATTCA 


TAGAAAAGTT 
x-fw&nnjmwj x x 


2700 




TTGTAAACCT 


x v» wwvwvn x 


r , TAAArY2*rr , A 


x\V^ X X\sXx\X Xn 


CAACCATGGC 


AACATTTTAT 
***^s»*»x x x mi 


2760 




TATTGGCAGT 


ilvl X X VJSJ X X 


Cid(W r pr*tx r r A A 


An A AAfA A A A 


CTGCGCAGGT 


TTAmAGAAGC 


2820 


30 


TTT(3ATATTT 

» • * *X*i fcX A A 


ATGGGGCGAA 


AAAATYlfSTJX A 




ATATCTGGTG 


TTGCTAACTA 


2880 




TGCTGTTTCT 


CAAGATGGAG 


A*AAfV3GCGC 


Tfi AA A TrrnT 


CTTTTAGCAA 


ACGTAATGAA 


2940 


35 


ACAAGCTAGG 


ATTCTATTTG 


ATGAATCTAA 




AAAGCTAGCC 


CAAAGCTTAG 


3000 


AGAAAATTTT 


AGACCTTTGA 


GAGATGAAAT 


TCATTACGAT 


GCAACTATAT 


CTAAAATTAT 


men 




GCCACAGGCT 


TCAGACAGTG 


ATAAGTTGGA 


TGGTTTAAAT 


ACACATATGG 


GCATTTTTGA 


"5 i ^ n 

O X.^4 U 


40 


TGAAATTCAT 


GAATTTAAAG 


ATTATAAATT 


GATTT CAGTT 


ATAAAAAACT 


CAAGAGGGGC 






AAGGTTACAA 


CCCCTTCTTA 


TCTACATTAC 


GACAGCAGGG 


TACCAACTAG 


ATGGACCACT 


3240 




TGTTAATATG 


GTAGAAGCGG 


GAAGAGACAC 


CTTAGATCGA ATCATCGAAG ATGAAAGAAC 


3300 




TTTTTACTAT 


TTAGCTTCTC 


TCGATGATGA 


CGATGATATA 


AATGATTCGT 


CGAATTGGAT 


3360 




TAAAGCAAAT 


CCTAACCTAG 


GTGTTTCTAT 


CGATTTAGCT 


GAAATGAAAG 


AAGAGTGGGA 


3420 




AAAGGCTAAG 


AGAACACCAG 


ATGAACGTGG 


AGATTTTATA 


ACCAAAAGGT 


TTAACATCTT 


3480 


SO 


TGCTAATAAT 


GATGAGATGA 


GTTTTATTGA 


TTATCCAACA 


CTTCAAAAAA ATAATGACAT 


3540 




TATTTCCTTA 


GATGAGTTGG 


AAGGTAGACC 


ATGTACTATA 


GGTTATGATT 


TATCAGAAAC 


3600 
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AACACATTCT TGGATTCCTA AGCATAAAGT TGAATATTCT AACGAAAAGA TACCCTATAT 3720 

AGAATGGGAA GAAGACGGAT TACTAACAAT ACAAGATAAT CCTTATATAG ACTACCAAGA 3780 

TGTTTTAAAT TGGATAATAA AGATGAATGA GCATTATGTT GTCGAAAAAA TCACTTATGA 3840 

TAGGGCGAAT GCTTTTAAAT TAAATCAAGA GTTAAAGAAT TATGGCTTTG AAACAGAAGA 3900 

AAcwAGACAA GGGGCTTTGA CCTTGAGCCC TGCaTTGAAG GATCTAAAAG AAATGTTTTT 3960 

AGATGGGAAA ATAATATTTA ATAATAATCC TTTAATGAAA TGGTATATCA ATAATGTTCA 4020 

GCTGAAACTA GACAGAAATG GGAACTGGCT GCCATCTAAA CAAAGCAOAT ATCGTAAAAT 4080 

AGATGGTTTT GCAGCATTTT TAAACACATA TACAGATATT ATGAATAAAG TTGTTTCTGA 4140 

CAAGGGTGAA GGAAACATAG mATTTATTAG TATTArAGAT ATAATGCGTT AAGGAGGTGA 4200 

ATGTTATCGC AAAAGAGAAT ATTGTCACAC GCATAAAGAA AAAATTGATA GACAATTGGA 4260 

20 TTGaTCAGTC AGCTTCTAAG CTTTATGACT TTAGCCCATG GAAAAATAAA TCTTTTTGGG 4320 

GTGTAATCAA TAATACGCTT GAAACTAATG AAACGATATT TTCAGCTATT ACnAAGTTAT 4380 

CTAATTCGAT GGCTAGTTTG CCCTTGAAAA TGTATGAAGA TTATAAAGTA GTTAATACAG 4440 

AAGTATCTGA TTnACTTACA GTGTCACCGA ATAATTCTCT GAGCAGTTTT GATTTTATTA 4500 

ATCAAATTGA AACAATCAGA AATGAAAAAG GTAATGCATA TGTG 4544 
(2) INFORMATION FOR SEQ ID NO: 518: 



10 



15 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 861 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 

40 CATGGCAGAA TATTGAAGCT GTAAAAAAAG GACATGTAAT TTCATATAAA GCAGAAGATT 60 

ATTGGTTCAC AGATCCTATT ACATTAGAAC ATTTGAGAAG TAAATTAAAA AAAGAAATTT 120 

TAAATAAAAA ATAATAGAAA TAAGTTGTAA AAATTTTCTT ATGCATTGGT ACTAATGTTT 180 

45 TTAAGGAGTG ATTAAATGAA GCAACTGGTT GGAATTCCCG AATCAATGTT AATTCCTTTG 24 0 

ATAGCTCGAG CAAAAGAGTA CGAAAACGAA AAACCAATAA TAAAAGACGC ACTATCAAAA 300 

AAAATATTTG ATGGTTTAGA TGATATGTAC AAAAATGTTA CATGTGATGA CATGTCTCAA 360 

60 

ATTGGAATTA GTATACGTAC TGTGATAATA GATTGTGTTA CTAAGAGGCT TATCAAGGAT 420 

AATAAAGATT TAATCGTGGT CAATATAGGT TGTGGCTTAG ATACAAGGTT TCAAAGATTT 4 80 
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ACATTTTTTA AAGAAAGTAA TAGTTATAAG ATGATATCTA AATCTATGCT AGATTACAGT 600 

TGGATTGATG ATGTCAAAAA TTATAAATTT TTTAATAGTA AGTCAGATAT ATTGTTTATT 660 

S 

ATTGAAGGTG TATTGaTGTA TTTTGATGAG AOTGTAATGA CTCAATTATT GGACACTATT 720 

ATCAAAAAGA TGGGAGATCA TAATTTGACA TTTGCGATTg AATTTTGCTC aAAAACAATT 780 

GCGAATAATA CmAAGAGACA TCAATCGGTA TCCAAGTTAT CCTCACCACC TGTTTTTAAA 840 

10 

TATGGGTACC ATGATTTAnA AAAATTGGAT GAnATTTACC C 881 
(2) INFORMATION FOR SEQ ID NO: 519: 

1S <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3122 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 



25 



30 



35 



50 



TGAATaaAAA 


TATATTAATA 


GATAAACACA 


AATGTGTCcA 


AATACCCCTA 


GAGGTATTTG 


60 


ACnAGTT CCA 


TCCAACTGTT 


TAAAATACCC 


CTACAGGTAT 


TTTTAGGGAG 


GTTATTATGA 


120 


AACAATACGG 


AGAAAaGTTT 


ATCGATGAAT 


TTAGTAAAGC 


AGAATTGGAA 


AAACTAGCCA 


180 


AGCAAGGGCA 


ATTAATTGAC 


GTTAGAACAG 


AAGAGGAGTA 


TGCATTAGGA 


CATATCAATG 


240 


GTTCCATACT 


TCATCCTGTT 


GATGAGATTG 


AGTCATTCAA 


TAAAGAAAAA 


AATAAAACCT 


300 


ATTATGTAAT 


CTGTAGAAGT 


GGTAACAGAA 


GTGCTAATGC 


TAGTAAATAT 


TTAGCTAAAC 


360 


AAGGTTATAA 


CGTTATAAAT 


CTTGATGGTG 


GTTATAAAGC 


TTATQAAGAA 


GAAAACGATA 


420 


GTTATGATAC 


ACAAGAAGAA 


TATAAAAGTA 


TAGAAATTAA 


AGGAGATCGT 


AAACAATTTA 


480 


ACTATCGTGG 


TCTTCAATGT 


CCAGGGCCAA 


TTGTAAAAAT 


TAGTCAAGAA 


ATGAAGAATA 


540 


TTGAAGTAGG 


TGACCAAATT 


GAAGTCAAAG 


TCACAGACCC 


TGGATTCCCT 


AGTGACATTA 


600 


AAAGTTGGGT 


GAAACAAACA 


AGGCATACTT 


TAGTTAAGCT 


TGATGAAAAT 


AACAATGGAA 


660 


TTAATGCGAT 


TATTCAAAAA 


GAAAAAGCAA 


AAGATTTAGA 


TATAAATTAT 


TCTGCTAAAG 


720 


GTACTACAAT 


TGTATTATTT 


AGTGGAGAAT 


TAGACAAAGC 


TGTAGCAGCG 


TTGATTATTG 


780 


CAAATGGTGC 


TAGAGCTGCT 


GGAAAAGATG 


TAACTATCTT 


CTTTACTTTT 


TGGGGGCTTA 


840 


ATGCATTAAA 


AAAAGTGCAA 


ACAGTTAATG 


TTAAAAAGCA 


AGGTATTGCA 


AAAATGTTTG 


900 


ATTTAATGTT 


GCCCAAAAAG 


AATATACGAA 


TGCCTCTTTC 


CAAAATGAAT 


ATGTTTGGTT 


960 


TAGGAAATAT 


GATGATGCGC 


TACGTAATGA 


AAAAGAAAAA 


TGTTGATTCA 


TTACCAACAC 


1020 
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10 



TCATGGGTAT TCAGAAAGAA GAACTTAGAG ATGAAGTTGA GTACGGTGGT GTAGGCACTT 1140 

ATATTGGTGC TACTGAAAAT GCGAATCATA ATTTATTTAT CTAATTAAAT CTATTAATAA 1200 

AAGGAGTTGT TATCATGTTT TTTAAACAGT TTTACGATAA TCATTTATCT CAAGCATCAT 1260 

ATTTAGTGGG TTGTCAACGT ACAGGAGAGG CAATAATAAT AGACCCTGTT CGTGATTTAT 1320 

CGAAATATAT AGAAGTTGCA GATTCTGAAG GTTTAACAAT TACACAAGCT ACAGAAACAC 1380 

ATATTCATGC TGATTTTGCT TCAGGAATTC GTGATGTGGC TAAACGCTTA AATGCAAATA 1440 

TATATGTGTC TGGCGAAGGT GAAGATGCAT TAGGGTATAA AAATATGCCA TCAAAAACAC 1500 

75 AATTTGTTAA ACATGGAGAT ATCATTCAAG TAGGCAATGT TAAATTAGAA GTTCTGCATA 1560 

CTCCAGGACA CAOGCCTGAA AGTATTAGCT TTTTACTCAC TGATTTAGGT GGTGGTTCAA 1620 

GTGTTCCGAT GGGATTATTT AGTGGTGACT TTATTTTTGT TGGTGATATA GGTAGACCTG 1680 

SO ATTTAtTAGA AAAATCTGTT CaAATAAAgG GTTCTACAGA AATTAGCGCG AAACAAATGT 1740 

ATGAGTCCGT TCAAAATATT AAAAATTTAC CAGACTATGT TCAAATCTGG CCGGGTCATG 1800 

GTGCTGGAAG CCCTTGTGGT AAAGCATTAG GTGCCATACC TATATCTACA ATAGGTTATG 1860 

AGAAAATTAA TAACTGGGCA TTTAATGAAA TTGATGAGAC TAAATTTATT GAATCATTAA 1920 

CATGAAATCA ACCAGCACCA CCGCATCATT TTGCACAAAT GAAACAAGTT AATCAGTTTG 1980 

GTAf'GAATTT ATATCAATCA TATGATGTTT ATCCTAGTTT AGATAATAAG AG AGT AG CAT 2040 

TTGATCTTCG TAGCAAAGAG GCCTTTCACG GTGGCCACAC AAAAGGAACA ATCAATATAC 2100 

CATACAACAA AAACTTTATT AATCAAATTG GTTGGTACTT AGATTTTGAA AAAGATATAG 2160 

ATGTAATTGG AGATAAATCT ACTGTTGAGA AAGCGAAACA CACTTTACAA TTAATTGGGT 2220 

TTGATAAGGT AGCAGGCTAT CGTTTGCCAA AATCAGG CAT TTCAACCCAG TCCGTTCATA 22 80 

GCGCTGATAT GACAGGTAAA GAAGAACATG TATTAGACGT ACGTAATGAT GAAGAGTGGA 2340 

40 ATAATGGACA CTTAGATCAA GCAGTTAATA TTCCGCATGG TAAATTATTA AATGAAAATA 2400 

TTCCTTTTAA TAAAGAGGAT AAAATATATG TACATTGTCA GTCAGGTGTT AGAAGTTCAA 2460 

TTGCAGTGGG TATATTGGAA AGCAAAGGTT TTGAAAATGT GGTGAATATT AGAGAAGGCT 2520 

ATCAAGATTT TCCAGAATCA TTAAAATAAT TTAAGGATGT GGAAAAAATG AATAAGCATT 2 580 

ATCAAATTGT TATTATTGGT GGCGGTACAG CAGGTGTTAC CGTAGCATCA AGACTATTAA 264 0 

GAAAAAATCA AAACTTAAAA GAGAAAATAG CAATTATAGA TCCAGCAGAC CATCATTACT 2700 

ATCAACCATT ATGGACGTTG GTTGGTGCAG GGGTATCTAG TTTGAAAAGT TCTCGTAAAG 2760 

ATATGGAAAG TGTTATACCT GAAGGTGCTA ACTGGATAAA ACAGGCTGTT TCAAGTTTTC 2820 
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TAGTAGCTCC AGGATTACAG ATTAATTGGT CTTCAATTAA AGGACTAAAA GAAAATATAG 294 0 

GTAAAAATGG TGTTTGCTCT AACTATTCAC CTGACTATGT TAACgAAACT TGGAACCAAA 3000 

TTTCTAATTT TAAACAAGGA AATGCCATTT TTACGCATCC AAACACTCCT ATAAAOTGTG 3060 

GAGOTGCgCC TATGAAAATT ATGTATTTAG CTGAAGATTA TTTTAGGAAA CATAAAATCC 3120 

GT 3122 



10 

<2) INFORMATION FOR SEQ ID NO: 520: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3982 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520: 



25 



30 



35 



AO 



45 



60 



ATAnAGATAT 


AoATnAATAT 


ATTGAGGTCA 


AAOGATGATA 


ATTAAAATTT 


TAACAATTCT 


60 


ATTACTACTT 


TGTATATTGA 


GCTATTTGGT 


TACAAATAGA 


AAGAAGCCTT 


TTCTGTTCTT 


120 


AAAGACACTC 


TTTATGGGTG 


TGGTATTTAT 


CTTTATAGGA 


TATATTTCAC 


TGGCAATATC 


180 


TGCOGTAATT 


ATTTATGGTA 


TTATTCAATT 


TATCACAATT 


GATTTTGGTA 


GTTTTTTCTT 


240 


AATGGGTATT 


ATATTGATCT 


TGATTTCAAG 


TATATTCCAA 


TTATTTATAG 


TTAGATTACT 


300 


TTTTAGAAAA 


AAGAATGTCG 


ATTTGACAGA 


GGTTGTCGTT 


TT As As CATT 


TAATTCAATG 


360 


GTTCTTAGTT 


TACTTTGCGA 


TCTATCAAGC 


AGTAAATGAA 


AAAATGGACA 


TTAATGATAT 


420 


TAATATCGAC 


AATTTCCAAT 


CTGTCTTTTT 


TGACGTGTCT 


AATTTGAATT 


TAGTAATTCT 


480 


ACCAACGTTA 


ATCATTAGCT 


GGGTCACAAT 


ATTTAACTAT 


AGAATGAGAA 


GTTACAAATA 


540 


AAATCTATGA 


GATTATACCT 


TCAGACACCA 


ACATTCAAAT 


GGTGTCTTTT 


TTGTTGTGTG 


600 


GTTTTATTTT 


TGAAATTCGA 


AAAAGTAGAG 


GCATGAATTT 


TTTGACTAGT 


GTATAAGTGC 


660 


TGATGAGTCA 


CAAGATAGAT 


AGCTATATTT 


TGTCTATATT 


ATAAAGTGTT 


TATAGTTAAT 


720 


TAATAATTAG 


TTAATTTCAA 


AAGTTGTATA 


AATAGGATAA 


CTTAATAAAT 


GTAAGATAAT 


780 


AATTTGGAGG 


ATAATTAACA 


TGAAAAATAA 


ATTGATAGCA 


AAATCTTTAT 


TAACATTAGC 


640 


GGCAATAGGT 


ATTACTACAA 


CTACAATTGC 


GTCAACAGCA 


GATGCGAGCG 


AAGGATACGG 


900 


TCCAAGAGAA 


AAGAAACCAG 


TGAGTATTAA 


TCACAATATC 


GTAGAGTACA 


ATGATGGTAC 


960 


TTTTAAATAT 


CAATCTAGAC 


CAAAATTTAA 


CTCAACACCT 


AAATATATTA 


AATTCAAACA 


1020 


TGACTATAAT 


ATTTTAGAAT 


TTAACGATGG 


TACATTCGAA 


TATGGTGCAC 


GTCCACAATT 


1080 
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TCAAAATCTT GTGAGAGAAT TTGAAAAAAC ACATACTGTC AGTGCACACA GAAAAGCACA 1200 

AAAGGCAGTC AACTTAGTTT COTTTGAATA CAAAGTGAAG AAAATGGTCT TACAAGAGCG 1260 

5 AATTGATAAT GTATTAAAAC AAGGATTAGT TAAATAAAAC TTCAATOGTT GCTGTTATCT 1320 

GGAAATAATT AATTAAATGT TATGTTAATT TTTGTTAATG AAAAAAGTAA TCTATTTAAT 1360 

GACAGGTTAA TGTAATTGTC CTGAAATTGA CTATATACTC AGTAAGTATC AATTTTAAGG 1440 

10 

AGAGCTTATA ATGAAATTTA AAAAATATAT ATTAACAGGa ACATTAGCAT TACTTTTATC 1500 

ATCAACTGGG ATAGCAACTA TAGAAGGGAA TAAAGCAGAT GCAAGTAGTC TGGACAAATA 1560 

TTTAACTGAa aGTCAGTTTC ATGATAAACG CATAGCAGAA GAATTAAGAA CTTTACTTAA 1620 

1S 

CAAATCGAAT GTATATGCAT TAGCTGCAGG AAGCTTAAAT CCATATTATA AACGTACGAT 1660 

TATGATGAAT GAATATAGAG CTAAAGCGGC ACTTAAGAAA AATGATTTCG TATCAATGGC 1740 

20 TGATGCTAAA GTTGCATTAG AAAAAATATA CAAAGAAATT GATGAAATTA TAAATAGATA 1800 

ATAAATAAAA CAGGTTGAGA CAAAAAATGG TCTTAAC CTG TTTTCAATTT GCATATGTGA 1860 

TAAATTCTAT ATCAAAATGC TTATGTATAA TGAATGACAT TTAAAAGTAG GGGAGACAAA 1920 

25 TATAAATACA ATAGTTCCTA GGATTACTCT CAAAATAACT ATATCAATTA TTTACTTTGC 1980 

TCTCCTATTT TTTAAAATAT GTACATGTTT AAACAATCAA AAGTGTACAA TATTAAATTA 2 040 

TCATTTCCAG TTCTAGTGCT ATATTGGTAG TAGTTGACTA AATGAAAATA AGCTTATAAC 2100 

30 AAGTTTTTTC AATACTCGTG GGGCCACAAC AGAGAGAAAT AGGATCACCA ATTCCAACAG 2160 

ACAATGCAAG TTGGCGGGGC CCCAACATAG AGAAATTGGA TCACCAATTT CAACAGACAA 2220 

TGCAAGTTGG GGTGGGGCCC AACACAGAAG CTGGCGAAAA GTCAGCATAC AAAAATGTGC 2280 

35 

AAGTTGGCGG GGCCACAACA GAGAGAAATA GGATCACCAA TTCCAACAAA CAATGCAAGT 234 0 

TGGCGGGGGC CCCAACATAG AAGCTGGCGA AAAGTCAGCT TACAAAAATG TGCAAGTTGG 2400 

CGGGGCCCCA ACATAGAGAA ATTGGAACCC CAATTTCTAC AGACAATGCA AGTTGGGGTA 2460 

40 

GGACATCGAT AAAGAAATAC TTTTTCTTTA GCAATTAGTA TTTCTTATGC ATGAGCTTTA 2520 

CTCATGTATT CATTTTTTAA GTACaCATTA GCTACAGCTA ATGATAAAGA ACCACTACAT 2580 

4S AATAAATCAT TAGTGGTTCT TTATCATTTC TATCTCACTC TTTTACTGGA AGAAAAAGTT 2640 

TACGTTTGTA GAACATGCCA CAATACCAAA AATAATTAAG AAAAATAAGA CGATAAGCAT 2700 

GATGACACTT TTCAAACAAC CTCTATCAGT TTCTTTCGAT TTTCTTTGTT GAACCTTTTT 2760 

60 ATAATCTTCA AGTAGTTTTG CGGCTTTTTT ATTTATATGT TTATTCATGA TGTTGACTCC 2820 

TTATAATATA TGTTTAATTC ATTAAAATAG TTGAAAACAT GACTTGAAAT AAAGATATAA 2880 
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AATTTGATGA TGATATTTGC TTTTTATTTT CCAAATGGAA TTTACTTAAA CTGATGCATT 3000 

AAAATATTAA TGAAGCACTA GAATACATAA ATGAATAGTa ATGGTGcACA GTATAGAATA 3060 

5 ATTAAGGCTA TATTAAGTAT AAATATCGTT AACTGTAAGC TATCTTTAGT TTTAATATAA 3120 

ACTATTAGGA TAATCGACGT AAGAAGAATC ATATATATTA ATGATGAAGA AGTCCATACA 3180 

AAATCCGCAT CATTTGTTGT TAATAATGGG ACTATAATTA ATCCGAAATT AATCATG CAT 3240 

W GCTATATATA CTATAATGTT ATACACAATG TTAATTTTTG TTCACCACCT TATACTTCTA 3300 

TTTTAAAAAC TTCTTTATAA TGATATATTG TTTAATGTTG AAATAATTAG ATTATCTAAT 3360 

TTTCATTTGC TTTACATGTA AAAGGCTATA TATAGTATGC TCTTTATGAT TCTAAATGCT 3420 

15 

TTTTAATATT TAATGCTCAT CAACATTTGG ATTTTGAATA TTCAATTCAA AAACTTTATT 3480 

AGCTAOGTCA ATTGTAAAAT CAGAACCATA GTTGACATGA GCTACTTTTA ATTTTCCATC 3540 

TAAATAATAG ATTGCGATTG CAACATCGTA AAATTCGTCA ATGACAAATA AACTCTTTTC 3600 

20 

GTTTGTTACA ACCTCATGCT CTCCTGAGTA TACAACGTTA ATTTCCCAAT CATTAAAAAC 3660 

CATTTCTTAA CCTCCTTGAA CATTTAAATT GATTCAACTT AAGTTTAACT TATTCATACA 3720 

2s ACTT.CGTACA ATATCTAGAT GAACATTAAT TGTATTTCTA GAAATCTTTT TCAATTATAT 3780 

GTACTAATTA TACTTTTAAA TTTCTTATTT CAGTATAGTT TTAAAACGAT TTTAAAATAA 3840 

TTCTGCAAAT ATATTAACAC ATAATGTGTT CAAAAAGTTT TGAACAATTT CAAAACTTTT 3900 

30 ATATAAAGGG nTTGACAACA TGGATTCAAA TnTCTTATTT TAAAAATTAC CTCATATAGT 3960 

GTCATGTTAG CCAATTTTTA AG 3982 
(2) INFORMATION FOR SEQ ID NO: 521: 

35 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 
AGCTTGGaTG ATTTAATaTG GtCCtTTCCC AaCcTTAGAT AATGAAAGAT tTGATAATCC 60 

45 

TGAATATAAA GAAGCTATGA AAAAATATcA ACAGAGATTT ATGGCTGmAG ATGAGGCTTT 120 
GAAGAAATTT TTTAGTGAAG AGAAAAAAAT aAAAAATGGA AATACTGATA ATTTAGATTA 180 
so TCTAGGATTA TCTCATGAAA GATATGAAAG TGTATTTAAT ACTTTGAAAA AACAAAGTGA 240 

GGAGTTCTTA AAAGAAATTG AAGATATAAA AAAAGATAAC CCTGAATTGA AAGACTTTAA 300 
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GTTAGGTAAA 


ACATTTTATC 


AAAACTATAG 


AGATGATGTT 


GAAAGTTTAT 


ATAGTAAGTT 


420 




AGATTTAATT 


ATGGGATATA 


AAGATGAAGA 


AAGAGCAAAT 


AAAAAAGCAG 


TTAACAAAAG 


480 


s 


GATGTTAGAA 


AATAAAAAAG 


AAGAGTTAGA 


AACCATAATT 


GATGAATTTT 


TTAGTGATAT 


54 0 




AGATAAAACA 


AGACCTAATA 


ATATTCCTGT 


TTTAGAAGAT 


GAAAAACAAG 


AAGAGAAAAA 


600 


10 


TCATAAAAAT 


ATGGCTCAAT 


TAAAATCTGA 


CACTGAAGCA 


GCAAAAAGTG 


ATGAATCAAA 


660 


AAGAAGCAAG 


AGAAGTAAAA 


GAAGTTTAAA 


TACTCAAAAT 


CACAAACCTG 


CATCTCAAGA 


720 




AGTTTCTGAA 


CAACAAAAAG 


CTGAATATGA 


TAAAAGAGCA 


GAAGAAAGAA 


AAGCGAGATT 


780 


1$ 


TTTGGATAAT 


CAAAAAATTA 


AGAAAACACC 


TGTAGTGTCA 


TTAGAATATG 


ATTTTGAGCA 


840 


TAAACAACGT 


ATTGACAACG 


AAAACGACAA 


GAAACTTGTG 


GTTTCTGCAC 


GAAGAAAGAA 


900 




ACCAACATCA 


CCGACTACAT 


ATACTGAAAC 


AACGACACAG 


GTACCAATGC 


CTACAGTTGA 


960 


20 


GCGTCAAACT 


CAGCAACAAA 


TTATTTATAA 


TGCACCAAAA 


CAATTGGCTG 


GATTAAATGG 


1020 




TGAAAGTCAT 


GATTTCACAA 


CAACGCATCA 


ATCACCAACA 


ACTTCAAATC 


ACACGCATAA 


1080 




TAATGTTGTT 


GAATTTGAAG 


AAACGTCTGC 


TTTACCTGGT 


AGAAAATCAG 


GATCACTGGT 


1140 


25 


TGGTATAAGT 


CAAATTGATT 


CTTCTCATCT 


AACTGAACGT 


GAGAAGCGTG 


TAATTAAGCG 


1200 




TGAACACGTT 


AGAGAAGCTC 


AAAAGTTAGT 


TGATAATTAT 


AAAGATACAC 


ATAGTTATAA 


1260 




AGACCGAATA 


AATGCACAAC 


AAAAAGTAAA 


TACTTTAAGT 


GAAGGTCATC 


AAAAACGTTT 


1320 


30 


TAAT AAACAA 


ATCAATAAAG 


TATATAATGG 


GCA 






1353 



(2) INFORMATION FOR SEQ ID NO: 522: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 



GTCTGAGTCA GGTGCTGTTT GGTTAGATGC TGAAAAAACA AGTCCTTATG AATTTTATCA 60 

4S ATTCTGGATT AATCAATCAG ACGAAGATGT AATTAAATTC TTnAAATACT TTACTTTCTT 120 

AGGAAAAGAA GAAATTGATC GCTTAGAACA ATCTAAAAAT GAAGCACCGC ATTTACGTGA 180 

AGCTCAAAAA ACATTAGCTG AAGAAGTAAC TAAATTTATT CATGGTGAAG ATGCATTAAA 240 

SO TGATGCAATC CGTATTTCAC AAG CATTATT TAGTGGTGAT TTAAAATCAT TATCAGCGAA 300 

AGAATTAAAA GATGGATTTA AAGATGTGCC TCAAGTGACA TTATCAAATG ACACAACAAA 360 
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(C) STRANDEONBSS: double 

(D) TOPOLOGY : linear 



5 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523: 





TTAGAAGTCA AATCATTAnT 


GGCGTCnTAT 


CGAGTATACT 


ATTAArTTPA 


ft f ^'Vft*T*I 1 *I 1, Pft 
nLinl X X X MVJ 


D U 


10 


CAATTGCATA 


TATTTTAATG 


TGGTTTAACG 


GCCATATGAC 


CtV X*vtw X X X Vj 


& ^^^*^P^^ft ft ^^^5 ft 


14 O 




CAATAATTAC 


AAGCTGTTTA 


ACCTTATTAA TATGTAGTAT 


TTTTATTAAT 


C.CACTTATAC 


180 




AAAAAATTAA 


GCAGTTTAAT 


ATAAAAACTA 


AGCAATTTGC 


TAACGGAAAT 


TACGCAAGCA 


240 


IS 


ATGATAAAAC 


GTTTAATTCA 


CCAAAAGAAA 


TTTATGAATT 


AAATCAATCT 


TTTAATAAAA 


300 




TGGCTTCTGA 


AATTACGCAA 


CAAATGAATC 


AAATTAAATC 


CGAACAACAA 


GAAAAAACAG 


360 




AACTGATTCA 


AAACTTAGCC 


CATGATTTAA 




AGCAAGCATT 


ATTTCATATT 


420 


20 


CTGAAGGACT 


ACGTGATGGT 


ATAATCACTA 


AGGATCATGA 


GATTAAAGAG 


TCATACGACA 


480 




TATTAATTAA 


ACAAGCAAAC 


AGATTATCAA 


^f^v x xnx x x un 


TGATATGACT 


CATATTATCA 


54 0 




CTTTAAATAC 


AGGTAAAACA 


TATCCCCCAG 


AATTAATAPA 


ACTAGACCAA 


TTACTTGTAT 


600 


2S 


CAATATTGCA 


ACCATATGAG 


CAACGTATCA 


AACATGAAAA 


CCG CACATTA 


GAAGTGAATT 


€60 




TCTGTAACGA 


AATTGATGCA 


TTTTATCAAT 


ATCGAACGCC 


ACTTGAGCGT 


ATTTTAACAA 


720 


30 


ACTTACTTGA 


TAATGCGCTA 


AAATTTTCAA 


ATGTTGGTAG 


TCGCATTGAT 


ATTAATATTA 


780 


GTGAAAACGA 


AGATCAAGAT 


ACTATCGACA 


TTGCTATTAG 


CGATGAAGGT 


ATTGGCATTA 


840 




TACCAGAACT 


ACAAGAACGT 


ATATTCGAAC 


GTACATTCAG 


AGTAGAAAAC 


TCTCGT AAT A 


900 


35 


CAAAAACGGG 


TGGTTCTGGA 


TTAGGCTTAT 


ATATAGCTAA 


TGAACTCGCG 


CAACAAAATA 


960 




ACGCAAAAAT 


CAGTGTAAGC 


AGTGATATAG 


ATGTAGGAAC 


TACGATGACT 


GTAACATTAC 


1020 




ACAAATTAGA 


CATTACGTCA 


TAATCCGATT 


TATTTATAAA 


ATAAAATGCA 


AAGACTAAAA 


1080 


40 


AGAAGCTCCC 


ATTAATGAGG 


GCTTCTTTTT 


TTGTTTATTT 


AGAATAAACT 


TTATGGGTAT 


- 1140 




CCTTCTCATC 


ATTTTCAAGA 


CTTGAAAGAT 


TTGTAGCTTG 


AATAATATAT 


TTAGGACGTG 


1200 




CCTTAACTTC 


ATAATATATC 


CTGCCAATAT 


ATTCACCTAC 


AACACCAATT 


GAAATTAACT 


1260 


45 


GTATGCCGCC 


TAATAATAAA 


ATAGCTGCAA 


TCGTTGAAAA 


ATATCCCGGA 


ATATTAACAC 


1320 




CAGATATCAT 


AATATTGATG 


GAATAAATAG 


A 






1351 




(2) INFORMATION FOR SEQ ID NO: 524: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 

s TTGTTGTCAG TTTAACGCAA CGTGTTACCG ACTTTTCAAG TACACAAcAA TATATATGTC 60 

TATTCCATTT TTAGCCCcTG CCATTTTCAT CATTGGTGGT ATTATGTTGT TTATTTCAAC 120 

ATTTAATAGC TTAGATGAAA CTGCTGAAAA TAATAACAAA ATAAAOAAAC TAATGATTAA 180 

70 AGGACTTATC ATTATTAACA TTTCATTTAT CGTTATGATG GTTTTAACAC CATATTGGTA 240 

CTTGTATTTA ATCGTCTATC TTATTTTCTT GTTGTTTTTA TTGTGGCAAA AGGTTTATAA 300 

ATTTTAATAC CAAAACTATT AAACACTTCT GATATTCTTA GTTCAAAATA TCAGAAGTGT 360 

IS TTTATAGTGT TATCTAGTTC AGATAAATAT TTCCTTACTT AAAAAAACGC CCTCCTCTTA 420 

TTTTGACCCC nAT 433 



(2> INFORMATION FOR SEQ ID NO: 525: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1845 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

2S 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525: 



30 


CCCCCACTAT 


GATATGCTTG 


GCCTATTGCG 


TCAATCCCCT 


TATATTTCGG 


CAATAGAGAT 


60 


GGATGTATAT 


TCAATATTTT 


ACCTTCAAAT 


GAAGCTAATA 


AGTCTGGACC 


AATTAGACGC 


120 




ATGTAGCCAG 


CTAGAATAAT 


CCACTCTACC 


TTATCTTCAT 


TTAATAATGT 


TACTAAATGT 


180 


35 


TGTTCATACG 


CTGCTTTTGA 


ATCAAATTGT 


TTTGGTTCAT 


TAATATAAAC 


AGGAATATCG 


240 




TGCTTTTTTG 


CTCTATCTAT 


ACAAAACGCA 


TTTTGATGAT 


CCGTATATAG 


CGCCGTAACT 


300 




TCAATATTTT 


CAAGTTTTCC 


TGATTCAACA 


TGCTCAACTA 


TATTTkCAAA 


GTTACTTCCT 


360 


40 


GAACCTGATG 


CAAAAATCGC 


AATTTTAACC 


ATTGTTATAC 


CCCCAACAAT 


TCAATTGCAG 


420 




TTGACTCATT 


TTTCACAATA 


TGACCAATTT 


GATAGGCTTC 


CACATTTTGT 


TCTGCTAAAA 


480 




TCTTCAAAGC 


GCGTGATACA 


TCTTTTTCAT 


CAACGATAAC 


CGTATAGCCA 


ATACCCATAT 


540 


45 


TAAAAATGTT 


ATACATTTCA 


TTTGTGTCTA 


TATTGCCTTG 


TTGTTGTAAC 


CAATCAAATA 


600 




TTTTTGGTGT 


TGGAAATGAT 


GTAGTATCAA 


TTCTAGCAGC 


ATATCCGGCT 


GGCAATGCAC 


660 




GTGGAATATT 


TTCATAAAAA 


CCTCCACCAG 


TAATATGATT 


CATTGCCTTA 


ATAGAAACTT 


720 


SO 


c-itttttcaa 


AGCAAGTACA 


GGTTTGACAT 


ATAATTTAGT 


TGGTTCTAAA 


AAGACATCGA 


780 
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5 



10 



1$ 



20 



25 



OCACTAAACT 


GTATCCATTT 


GAATGAATGC 


CACTTGACGC 


AAGCCCTATA 


ACAACTTGTC 


900 


CCTCTTTCAC 


TTCTGAACCA 


TCTACATAGT 


CATCCTTTTC 


AACTGCTCCA 


ACAGCAAATC 


960 


CAGCTACATC 


ATATTCGCCT 


TCGTGATACA 


TTTCACCCAT 


TTCAGCAGTC 


TCTCCACCGA 


1020 


TAAGTGCAGT 


ATTCGTTTCA 


ACACATG CAT 


CACTAATACC 


TTTAACAATT 


TGTTCAATAA 


1080 


CTTCAGGAAC 


AACTTTGTTT 


GTAGCAATGT 


AATCTAAAAA 


ATATAATGGT 


TCTGCACCTG 


1140 


TCGTTAAAAT 


ATCATTAACA 


CACATTGCGA 


CTGCATCGAT 


ACCTATCGAA 


TCATGTTTAC 


1200 


CATAGTCGAT 


AGCTAGTTTT 


AATTTCGTAC 


CTACTCCGTC 


TGTT C CAG AA 


ACTAAAACTG 


1260 


GCGCTG t CAT 


ATTTAATTGT 


GATAAATCAA 


ATGTAGCACC 


GAATCCACCT 


AAACCACCGA 


1320 


TAACTTCTTT 


ACGCATCGTA 


CGtTTAACaw 


GaCtAGaCaT 


TCtTTCtACA 


GCTTCATAAC 


1380 


CAGCATGAAT 


ATTTACACCA 


GATTGTTCAT 


ATGCTTTAGA 


CATTTAAATT 


TCCCTCGCTA 


1440 


TCAAAAAAGT 


GTTTGTTTTT 


AGAAATATAT 


TGCTTTTGTC 


GATGACTTAA 


ATGCGCTTTA 


1500 


TAATTTGCTT 


CATAATCATA 


TAACCCTGCA 


GGATAATCTC 


CAviTGAAAL. JL 


TTCTACACAT 


lDDU 


AAGCCACTAT 


ATGGCGCGTC 


AT AAT CT AAA 


CCAATTGATT 


CAATTAACCC 


ATCTACAGAT 


1620 


AGATATGCTA 


ATGAATCAGC 


GCCAATATAA 


TCTTTAATTT 


CTTCAGGTGA 


TTTGCTTGCA 


1680 


GAAATTAATT 


CTGCCGTAGT 


TGAAACATCG 


ATTCCGTAAA 


AACTTGGAAA 


CATAAATTCC 


1740 


GGTGATGCTA 


TACGCACATG 


CACTTkATTT 


GCACCAGAAT 


CkTTTAACAT 


TTTCACAATG 


1800 


CGTCGAATTG 


TCGTACCGCG 


nACAATGGAA 


TCATCAACAA 


GACTG 




1845 



(2) INFORMATION FOR SEQ ID NO: 526; 

,<i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1141 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 



45 



SO 



TCATAAATnG 


TAAATGTTAA 


TCGTCATATA 


ATATTAATTT 


ACAACACCAT 


TTTGGTTATT 


60 


TGAAGCTTGT 


GGCGCTTGTT 


GTGTGCCACC 


TTGATTTTGA 


TTTGAGTTTT 


GATCTGTAGC 


120 


AGGTTGTTGT 


TGATTGCTGG 


AmtCACTGTT 


ATTAGTTGAm 


TCACTGTTCT 


CGTTAGATGT 


180 


CTTATCTTTA 


TCTGTCGTAT 


CATTTTGTTC 


TTTTCTCAAT 


AAACTATTAT 


CTAAAGGCGT 


240 


TAATGGTATT 


AATGAACCAT 


AATGATTAAT 


GACACGTTGA 


TCTAAGAAAT 


CATTTTTATC 


300 


ATTAATAGGT 


GATAATTCTA 


AGTCTTTACG 


AAGTAAGTTT 


GCATATTTTT 


GAATGCTTTC 


•360 
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5 



10 



IS 



20 



#iurx x 1 x#w%x X 


X LAAvAA aJAT 


TTGTTaAGTA 


CATTTTTOCT AAAGCTTTAA 


TTTCAGAGTT 


480 


ittil XAAA1 XA 


luL X X luUil 


TTTTACCTAC 


AATTTGAATC 


ACGTTATCAA 


GTTTATCAAT 


54 0 


AuiAAXa_AAI»X 


TCCTGTGCTT 


X x XTXsAATAA 


AATCTTAATT AATTCCATTT 


GACGTTGTCC 


€00 


ACGTTTTAAG 


X U luAATui I 


GATGTCTAGT 


TCTAGCAACT 


GCTAAAGCCT 


CATCACCATT 


660 


TAATTTTTGG 


TACCCTTTTT 


TAATTTTAAT 


CTTACCAGTA 


TCATCTGTGT 


TAGGTTCATT 


720 


TAAGTCGTAT 


GGCACATCAT 


AGTATATGCC 


ACCAAGCTCA 


TTTACAGCCT 


CGACAAATGC 


780 


TTTCATATTG 


ACTCTCACAT 


AATAATCAAC 


AGGTACATTC 


ATGGTAGCTT 


CTACCGAATC 


840 


CATTGCGGCA 


ATTGGACCAC 


CATATOCATG 


TGCATGGGTA 


ATCTTATOGT 


AATAGCCAAC 


900 


TTTAGGAATG 


TAQCTQATAG 


TATCACGTGG 


AATACTAAGC 


ATTCTAATTT 


GATGTTTTGA 


960 


TTGATTAAAA 


GTAGTTAAAA 


TCATAGCGTC 


TGATCTAGAG 


TGTTCAGCAT 


CCTGTCCTTT 


1020 


TTTTCTTCTT 


CCATCGTTAT 


CATCGATACC 


TAAGAAAAGA 


ATAGAGATAG 


GTTGTTCTTC 


1080 


GGGATGACTT 


TATTATCTCT 


TAAGTTGGAT 


TGACGTTAGC 


ATTTTTGCTG 


TCTTGAGAAG 


1140 



A 1141 
25 (2) INFORMATION FOR SEQ ID NO: 527: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1565 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527: 

GCACCATTAG CATTGGGCnC ACTGATTGGA GTAnCAGTTG TTGAAAATTC GGCGCCAACA 60 

AGTAAACmGG CtaCaGGCaGC mATamCCCCa TATTATAcTT ATAATGGTTA TATTGGTAAT 120 

40 AATGCTAATT TTATTCTGGA TAAGAATTTT ATTAACGCGA TTAAGTATGA TAATGTGAAA 180 

TTCAATGGTA TTAAATT AG C TAAAACGAAT ACGATAAAAA AAGTAGAGAA ATATGATCAA 24 0 

ACTTTTAAAG GTGTTTCTGC AAAAGGAAAC GAAGCAAGTC AATTGCAATT TGTAGTTAAA 300 

45 AATAATATTT CATTAAAAGA TATCCAAAAA GCTTATGGCA AGGACTTGAA AAAAGAAAAT 360 

GGTAAAACAA AGGAAGCTGA TAGCGGTATT TTTTACTATC AAAATGCTAA AAAGACATTA 420 

GGCATCTGGT TTGTCGTTGA TCATAATAGA GTTGTCGAAG TAACAGTTGG ACATACACCA 480 

SO 

TACAAAACAA GTAAATAAAA TAATGGCATA TTAAGGCTAG AGTGTGAGGA GTGATACCGC 54 0 

ATTCTAGTCT TTTTTATTAA ATAATAACGA TTATTGCGTC TTACATAGTT GTTTGAAATT 600 
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GATTAAGTAT 


ATAGAGCACT 


ATTTTGTATT 


TGTTAATATT 


TTCACAAAAA 


TAAAGCCTTG 


720 




ATAAATTTTA 


AAATATAATT 


AAGCTCAATT 


TTTAAAATTT 


TATTTAGCTA 


CAGATAACAT 


780 


5 


TTTTAAAAAA 


GAAAAGAATC 


AATAAATAAA 


ATCAACGAAC 


AAAAAGTATA 


GAAATAAATA 


B40 




GAAATAATCA 


TTTACTTTTC 


TGAAAAATTA 


AATTAATATT 


TTATTTATAA 


GCTGTTTTTA 


900 


10 


AGATTTCAGG 


AGGAATGAAA 


TGTGaGGAAA 


TTTTCAAGAT 


ATGCATTTAC 


AAGTATGGCA 


960 


GCATTAACCT 


TGTTOAGCAC 


TTTATCACCA 


GCAGCATTAG 


CGATTGATTC 


AAAAAATAAA 


1020 




CCAGCTAATT 


CTGATATTAA 


ATTTGAGGTG 


ACTCAAAAGA 


GTGATGCGGT 


CAAAGCATTA 


1080 


15 


AAAGAATTGC 


CTAAATCCGA 


AAATGTAAAA 


AATATTTATC 


AAGATTACGC 


TGTTACTGAT 


1140 




GTAAAAACTG 


ATAAAAAAGG 


ATTTACGCAT 


TATACATTGC 


AACCGAGTGT 


TGATGGTGTT 


1200 




CATGCACCTG 


aCAAAGAAGT 


GAAAGTACAC 


GCAGACAAAT 


CAGGAAAAGT 


CGTTTTAATC 


1260 


20 


AATGGGGATA 


CTGATGCGAA 


GAAAGTAAAG 


CGAACGAATA 


AAGTGACATT 


AAGTAAAGAT 


1320 




GACGCAGCCG 


ACAAAGCATT 


TAAAGCAGTT 


AAGATTGATA 


AGAATAAAGC 


GAAAAATCTT 


1380 




aAAGATAAAG 


TCATTAAAGA 


rAACAAAGTT 


GAAATCGATG 


GTGACAGTAA 


TAAATACGTT 


1440 


25 


TATAATGTTG 


AGTTAATTAC 


AGTGACACCA 


GAAATTTCAC 


ATTGGAAAGT 


TAAAATTGAT 


1500 




GCTCAAACTG 


GCGAAATTTT 


AGAAAAAATG 


AACTTAGTTA 


AAGAAGCTGC 


AGAAACTGGT 


1560 



AAAGG 1565 

30 

(25 INFORMATION FOR SEQ ID NO: 528: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2870 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



40 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528: 



45 



50 



GGATAGGTAA 


ATCCATTCAA 


AATTGGTTGT 


ACTCATTTTA 


AATCATATTT 


TTTAATATAG 


60 


AAACACATAA 


TTAAAGGAGT 


GATGATATTA 


ATGAATAAAC 


TTCGAGACAC 


TACTTTTCTA 


120 


TCATATTTAT 


TTACTATCAT 


ATTGTGGGGA 


TCTGCATTTC 


CAATGATAAA 


GATTGCGTTA 


180 


AATGATTTTA 


GTGCAGAATC 


ATTGTCGGCA 


TTTCGTTTAA 


TTTTAGCAAC 


AATAATTCTC 


240 


TTGCCGTTCG 


TAATTATAAA 


GAAATTGCCT 


ACCCCTGAAC 


TAAGAGATAT 


CCCTGTTATT 


300 


TTTATTTTAG 


GATTTTGTGG 


ATTTGTGATA 


TATCACACAG 


CTTTAAATTT 


TGGTGAAaCT 


360 


TTGATTAGTG 


CAGGTATAtC 


TGGrATTCyA 


GTCTCTACAA 


CGCCTATTTT 


TTCTAGTGCT 


420 
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GCA2TTATAG GAATATCCAT TATTTCAATA AGTAAAGATG ATTACACAAC TATTAATGTA 540 

TTAGGTGTTT TTATTATTTT ACTTGCATCT TTTAGTGAAA GTTTGTATTT TACTTTCCAG 600 

5 

AAAAAATACA TAGAAAAATA CGGCTTCATC GCTTTCACAC TATaTACAAT AATGGCAAGC €60 

TCACCATTTA TGCTTATTTT TATTCCTGAA ATCATCAACG ATATACACGG CGCCACTTTT 720 

ACATCAATAG TATCGGTACT TTATTTAGCT ATATTCCCTA CTATAATTCC ATACGTTTTG 780 

10 

CTTGCTTATA TTGTGAAGTC AGTTGGTGTC TCTGATGCAA CAATGTCTCT TTATTTAACA 840 

CCTATCGTTT CTTTATTATT ATCTTATCTG TTATTAGACG AGCTACCAAC AACCCTTGCT 900 

1S ATTATAGGCG GAATTATCAC TCTACTAGGC GTTAGTTTAA GTAACTTCTT TCAAAATACA 960 

TAATTATTCC AAGTCCCGCA CCTCAGAATC CAAAAACATT CGAGTGATAA AATTTTAAAA 1020 

ATCAAAAATA TAAAAATGAT CTAATTTCGT CAAATTTACC AATATAAATA CTAATATTTG 1080 

20 CAATTCACAA AGGGGTATAG TCTGAGTGTA TTCTAATACG AAAGGACTTG GTGGATATGT 1140 

ATTACAGTTA TGGAAATT AT GAAGCATTTG CGCGCCCTAA AAAACCTGAA AATGTAGAAA 1200 

ACAAATCCGC TTACTTAATC GGATCTGGTC TAGCTTCACT TGCTGCAGCT TGTTTTTTAA 1260 

25 

TAAGGGATGG TCAAATGGAA GGTTCGAAGA TTCATATTTT AGAAGAGTTA CCTAAAGCAG 1320 

GTGGTAGTCT TGATGGTGAA AATATGCCTT TAAAAGGCTA TGTTGTCCGC GGTGGTCGTG 1380 

AAATGGAGAA CCACTTTGAA TGTTTGTGGG ACTTATTCAG ATCTATCCCT TCATTAGAAA 1440 

30 

TCGATAACGC GTCTGTATTA GATGAGTTCT ATTGGCTAAA CAAAGAAGAC CCTAACTATT 1500 

CTCGCTGTCG TGTTATTGAG AAACAGGGTC AACGTTTAGT CACAGACGGA GACTTCACTT 1560 

35 TGACTAAAAC GGCGATTAAA GAAATTTTAG ATTTATGCTT AACGAATGAA GAAGATTTAG 1620 

ATGATGTCAA AATAACAGAT GT A TTTTCCG ATGACTTCTT TAATTCAAAC TTTTGGATTT 1680 

ACTGGAAAAC GATGTTTGCA TTTGAACCGT GGCATTCTGC AATGGAAATG CGTCGCTATC 174 0 

40 TAATGCGATT CGTTCATCAT ATTAGTGGTC TCGCAGACTT TTCAGCTTTA AAATTCACTA 1800 

AATATAATCA ATATGAATCT TTAGTATTAC CTATGGTTGA ATATTTAAAA TCGCATGGGG 1860 

TTCAATTTGA ATACGATGTA AAAGTCGAAG ATATTAAAAT AGATGTTACG ACAAGTCAAA 1920 

45 

AAATTGCCCG AGAAATATTA ATTGACCGTA ATGGTAATGC AGAATCTATT AAACTGACTA 1980 

TAAACGATCT TGTCTTTGTG ACAAACGGTA GTATTACAGA AAGCTCTACT TATGGTGATA 2040 

ATGATACACC AGCGCCACCA ACTGACGAAT TAGGTGGTAG TTGGACACTA TGGAAAAATT 2100 

SO 

TAG CG CGACA AAGTCCTGAA TTTGGTAATC CTGATAAGTT TTGCCAAAAT ATTCCTAAAA 2160 

AAAGTTGGTT TGTTTCAGCA ACTTCTACAA CAAACAATAA AGAGATTATC GATACAATAG 2220 
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5 



20 



25 



30 



40 



45 



50 



TCAATGATTC 


TGCATGGCAA 


ATGAGTTTTA 


CAATCAATCG 


TCAGCAACAG 


TTTAAAGACC 


2340 


AACCTGAAAA 


TGAAATATCT 


ACATGGATTT 


ATGCCTTATA 


TTCAGATGTA 


AACGGCOATT 


2400 


ATATTAAAAA 


GCCAATTACA 


GAATGTAGTG 


GTAATGAAAT 


ATGCCAAGAA 


TGGCTGTATC 


2460 


ACTTAGGTGT 


ATCAACTGAC 


AAAATTGAAG 


ACTTAGCAAA ACATGCATCT 


AATACGATTC 


2520 


CTGTTTATAT 


GCCATATATC 


ACATCTTATT 


TCATGACGCG 


TGCTATCGGC 


GACAGACCTT 


25B0 


TAGTOGTCCC 


GCATCAATCT 


CAGAACTTAG 


CATTTATTGG 


TAACTTTGCA 


GAAACAGAGC 


2640 


GAGACACTGT 


ATTTACAACA 


GAATATTCGG 


TTCGTACTGC 


CATGGAAGCT 


GTTTATCAAT 


2700 


TACTAAATAT 


AGATCGTGGT 


ATTCCAGAAG 


TCATCAATAG 


TCCATTTGAT 


CTTCGCGTCT 


2760 


TAATGGATGC 


CATATACGAA 


CTGAATGACC 


ACCAAGATTT 


GCGTGAGATT 


ACTAAAGATT 


2620 


CGAAAATGCA 


AAAACTCGCA 


TTAGCAGGAT 


TCCTTAAAAA 


GATAAAAGGT 




2870 


<2) INFORMATION FOR SEQ ID NO: 529: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








,(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529: 






CnGTTTAGCT 


GCAAAAAGTA 


AATAACGACA 


CTGTATTTGG 


AATTTTGCAA 


TTAGAAACAC 


60 


TTTTGGGTGA 


CATTAACTCA 


ATTTTCAGCG 


AGATTGAAAG 


CGAATACAAA 


ATGTCTAGAG 


120 


AAGAAATTTT 


AATTTTACTA 


ACTTTATGGC 


AAAAAGGTTC 


TATGACGCTT 


AAAGAAATGG 


1B0 


ACAGATTTGT 


TGAAGTTAAA 


CCGTATAAGC 


GTACGAGAAC 


GTATAATAAT 


TTAGTTGAAT 


240 


TAGAATGGAT 


TTACAAAGAG 


CGTCCTGTTG 


ACGATGAAAG 


AACAGTTATT 


ATTCATTTCA 


300 


ATGAAAAGTT 


ACAACAAGAG 


AAAGTAGAGT 


TGTTGAATTT 


CATCAGTGAT 


GCGATTGCAA 


360 


GTAGAGCAAC 


AGCAATGCAA 


AATAGTTTAA 


ACGCAATTAT 


TGCTGTGTAA 


GTTTAATAGC 


420 


ATAAAAAGAG 


GTTTTCATTA 


AGTTGAAAAC 


CTCTTTTTGT 


TGTTGGCATT 


AATTTTTCAA 


460 


ATGTTGACTA 


CTCAATCCTA 


AATTATAAAT 


AGTATAGCGC 


aCAAaTGCTT 


AAGAAATTTT 


540 


TTCTATGGCA 


CAAATGAATG 


GAGCATGATT 


ACGTTGGTTT 


AAAAATTGAT 


ATTGCAAAAC 


600 


TTGCGCATGC 


TTTTGATCCA 


AAGTACTCAA 


GTAATCAAGC 


AATGCATGCT 


TCTCAATTTG 


660 


TCCTTCGCTA 


TGACCATGAT 


ATATAACAAG 


TACAATAATA 


CCTTCAATTG 


ACATTAATGA 


720 


TAGCAATGAA 


TTAATAGCTT 


GGATTGTCGT 


GTCAGGCTTT 


GTCACGATAG 


ATTTATCACC 


780 
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TATATGATGT TCAATATTTT CATGTCCATC 
AACTTTATCA CGTGTATTTT CCAAAGCTAA 
6 TTCTGGTACT TGTTCGGCTA AAAATAAAOT 
AACAATACTT TCTGGTGTTA TATGTTGTTT 
TTCTAATTTC ATTGCTTCAC CTTAGACTTG 

10 

GCTAATTCAG CATCGATGCC ATTTAATACT 
ACCATGATAT CTATTGGACC ATCACCGGTA 
ACTTCTAATT GGTCCACAAC TAGGTTTGTG 

1S 

CCTTTATCGT ATTGTTTTAC CATCGGTGTA 
ATACCTTGTA CATCCATTTG TGCAcTTCTT 

20 CGCCAGGTAA GCCATTAATG ATGTGTGTAC 
CCACACCATC ATAATAAGTT TTCATATCAT 
GATGGATTGT TTGTAGTCCT AATTCAACCC 

25 AATATTCGAC AACATCGTCT GGTAGACAGT 
CACCCGGTTC TTTAAGTACA GGTTCGAATT 
TATTTGTAAA TGCCTGAAAA TAAGCAATAT 

30 TTTCTTTAAT TTCTTTAAAT TGTACTGCGA 
CGCTACCTGC AGCAGAACAA AATGTACATC 
GACAGTCAAA CCCGCCATCC AATGCAACTT 

35 

AATGGTAATT CCATGTGTGA TAACGTTTGT 
TATGTCATTT TCCTTTCTAT AAAAAAAGAG 

^ GTTATAGTGT TTATTATAGT TTGACAAAAA 
CTAAATCAGT CTGGTGGCTA GTAATTGGCA 
TGTGGCCTTT AAATACAATT TATATGAAAC 

45 GTTTAGTGCT AATGATAAAT TCATTTGGCA 
TATTTGATAA ATTAGGTGGA TACAAGACGA 
GTACAACGCT ACTTAATTTC TTTCACGGGT 

SO 

TAGGGTTTGG TGGCGGAATG ATTATTCCTG 
CAAATGGCGG AAGACAAACG TTTAATGCGA 

SS 



TTTTATTAAA GAAACATGAT TGAAATCCTT 900 

ATCTTGAATG TCGAAACCAT AAACATGTCC 960 

GTCATTGCOG TTACCGCAAG TTGCGTCTAC 1020 

AATAAGTGTT TTTGAAAAAG GGAGTATACG 1080 

TAACGCAAGC CTTGATAAGA ATTTCTACGT 114 0 

TCCCATTTAT TAACACTCCA CATTGGACCT 1200 

ATTCGGTGAA CGATCATTTC AGGGGGAATC 1260 

TACTCTTCTT GAGTCATAAA AGTTAATAAA 1320 

CCTTTTAACA AATGAAGTAA ATGAATTTTA 1380 

TGGCAGTAGC CATCATCATG TCATAGTCTT 1440 

ATACATTGAT ATTATGCTTA CGTAATTTTG 1500 

GGGCACGATT GATTAAATCA GATGTTGACT 1560 

ATAAGTATGT TCGTTGATTC AAATCTGCTA 1620 

CAGGACGCGT ACCAATAGAT AATCCCACAA 1680 

TTTCTTTTAA TACTTCAACC GGTGCATGTC 1740 

ATTTTCCTTC GTGCCATTTC TCATGCATCT 1800 

TTGAATCTGC ACGATTACCT GCAAAGTCTC 1860 

CACCATGTGC TACAGTGCCA TCGCGGTTAG 1920 

TAAATATTTT TTGTCCAAAT TTATTTTTTA 1980 

TTTCAAAAGC GTATTGGAAA TGATTGCCCA 2040 

TTCTAAGTAC AGATTTTAAC ATATTTTAAT 2100 

AGAGAGAGGA ACTATGAAAT ATGAATATAC 2160 

TGGCGTTAAA TATTACTGGT TCCAGTTTTT 2220 

AAGAACTTGG AAAAAGTTTA ACTGTTGCTG 2280 

TGGTTATTGG AAACTTATTA GGTGGTTCAC 2340 

TTTTAATTGG AACTTTCACT TGTCTTTGTA 2400 

GGCCTTGGTA TGCTGTATGG CTTGTAATGT 2460 

CGATATACGC TATGGCTGGA GCAGTGTGGC 2520 

TATACTTAGC GCAAAATATT GGTGTGGCTG 2 580 
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AT CTT ATT AT GTATGTTGTG TTTGCGCTTG TOGCGGTAAC GCAATTTAAT ATTGAAATTA 2700 

ATGCGAAAGT TAAATATCCA ACTCATTTAG ATATTACTGG TAAAAAGAAT AAAGCAAGAT 2760 

5 TTATTTCATT AGTACTAATT TGTGCAATGT TTGCAATTTG TTGGGTTGCA TATATTCAAT 2820 

GGGAGTCTAC AATCGCTTCA TTTACACAAT CTATTAATAT TTCAATGGCA CAATATAGTG 2880 

TTTTATGGAC AATTAACGGA ATAATGATTT TAGTAGCACA ACCATTAATT AAACCGATTC 2940 

10 

TCTATCTGTT AAAAGGAAAC TTAAAGAAGC AAATGTTTGT CGGCATCATC ATTTTTATGT 3 000 

TGTCGTTCTT TGTCACGAGT TTTGCCGAAA ACTTTACAAT ATTTGyTGTC GGTATGATTA 3060 

y5 TTTTAACTTT TGGAGaATGT TTGTATGGCC AGCAGTTCCA ACTAT 3105 

(2) INFORMATION FOR SEQ ID NO: 530: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5532 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530: 



30 



3S 



AO 



45 



60 



TAATGATTAA 


ACCAGATGCA 


GTACAAAGAA 


ATCTAATTGG 


TGAAGTAATT 


TCAAGAATTG 


60 


AAAGAAAAGG 


ACTAAAACTT 


GTCGGTGGTA 


AATTAATGCA 


AGTACCAATG 


GAACTTGCTG 


120 


AAACACATTA 


TGGTGAACAC 


CAAGGTAAAC 


CATTTTATAA 


TGATTTAATT 


TCATTTATTA 


180 


CATCAGCACC 


AGTGTTCGCA 


ATGGTAGTTG 


AAGGTGAAGA 


TGCAGTTAAT 


GTATCTAGAC 


240 


ATATTATTGG 


CAGCACCAAT 


CCTTCAGAAG 


CTTCACCAGG 


ATCAATTAGA 


GGTGATTTAG 


300 


GTTTAACTGT 


TGGTAGAAAT 


ATCATTCACG 


GTTCAGATTC 


ATTAGAGTCT 


GCTGAACGTG 


360 


AAATTAATCT 


ATGGTTTAAT 


GAAAATGAAA 


TTACTAGCTA 


TGCTTCACCA 


CGTGATGCAT 


420 


GGTTATATGA 


ATAAAATATA 


AACTGTAAAC 


CTTTACGATT 


TATTTATAAA 


GGTAGAAAGG 


480 


GTTTTGTTAT 


GTGGTTAGTC 


ATTATGaTTA 


TACATAACAA 


GGCCCGTTTT 


TTATGTTGTA 


540 


GfAAATTACT 


TGAAAAATTT 


TATAGTTTTT 


kGGTAAcACG 


TAtTaAAAaG AGAGGAATAT 


600 


TCTTTATCAA 


ATGAAACTAA 


ACAGAGAGAA 


GGGGTTGTTA 


AAATGAAgAA TATTATTTCG 


660 


ATTATTTkGG 


GGATTTTAAT 


GTTCTTAAAA 


TTAATGGAAT 


TACTATATGG 


TGCTATATTT 


720 


TTAGATAAAC 


CACTTAATCC 


TATAACAAAA 


ATTATTTTTA 


TACTGACTCT 


CATTTATATT 


780 


TTTTATGTAT 


TAGTAAAAGA 


ATTGATTATA 


TTTTTGaAGT 


CAAAGTATAA 


CAAAAGCGCT 


840 


TAACATATGT 


ATATTTTAAT 


ATCATAATTT 


TTTTAAACGG 


ACTGATTAAC 


TTTATTAATA 


900 
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GATACGATTA TATTAAAACG GCTAATCATT 
TAGAAATTCA TGATACTTTT CTACAGACGA 
5 TATTAAGATA ATTCTGACAT TTAAAATGAG 
AACAAACATT TATGAATGCA CAATGAATAT 
CGTGGAAGTA TGAAAGGAGC GAGTGTGTAT 

10 

ACCTCAATTA ACAGTTATTG TTGAAGGTGT 
TATTAATAAA GAAATGTTTA AGCGTCAAGG 
TGAGAAAGAT ACAGTAGAAA TAGTATCAGG 
AATTACTATG GTTGTAACCA ATGATGACTT 
TCCAATAAGT GAAGAAGAAC GTGAAAATAT 

SO TCATGCAGAT TTGGTTGGAG GTATGAAATA 
GCGATCATCT GCTAGAGAAA CAGCAGCTCG 
ATTACAACAG TTAGATATCG ATATATACAG 

25 TAAAGATTTT TATGATTCAG AAACATTTAA 
AATTGATGAC AGTATCGCAC AAGCAATGCG 
AGATTCAATT . GGCGGTGTCG TTCAAGTTGT 

30 

TTATGTGCAT TATGATCGTA AGTTAGATGG 
TGCTTTTAAA GGTGTAAGCT TTGGTGAAGG 
GATTCAAGAT GAAATTCTAT ATAATAGTGA 

35 

AGGTGGTTTA GAAGGCGGTA TGTCAAATGG 
ACCAATTCCA ACGTTATATA AACCATTAAA 

40 TAAAGCAACA ATTGAACGTT CTGATAGTTG 
ACATGTCGTA GCATTTGAAA TAGCAAAAGC 
TGAGCAACTT AAACAACAAA TTATTGAGCG 

45 AGAACAATTG AGGTGTAATC ATGAAATTAC 
TATATGTTGA ACACGGTGCA ATTGACCATA 
GTTTTATATT AATTGACGAG CATGTAAATC 

50 

TATCATATGA AAATGTACAT AAAGTTATTA 
AGCAATATCA AGAAACATTA GAGTATATTT 

55 



TTTAATTAAT GATTATATGA TGCAACTGTT 1020 

ATATATTATA ATTAATTTTA GTTCGTTTAA 1080 

ATGTCATCCA TTTTCTTAAT TGAGCTTGAA 1140 

GATAAGATTA ACAACATATT ATAATGTTAT 1200 

GAGATACCTA ACATCAGGAG AATCACATGG 1260 

ACCTGCAAAT ATAGAAATTA AGGTTGAGGA 1320 

CGGTTACGGA CGTGGACGTC GTATGCAAAT 1380 

CGTTAGAAAT GGTTATACAT TAGGTAGTCC 1440 

TACGCATTGG AGAAAAATTA TGGOAGGAGC 1500 

GAAACGTACT ATTACAAAAC CAAGACCTGG 1560 

TAATCATCGT GATTTACGAA ATGTGCTAGA 1620 

AGTTGCAGTC GGTGCCTTAT GTAAAGTGTT 1680 

TCGTGTTGTT GAAATAGGTG GAATTAAAGA 1740 

AGCAAATCTT GATOGTAATG ATGTTCGTGT 1800 

AGATAAAATT GACGAaGCTA AAAATGAAGG 1860 

AGTTGAAAAT ATGCCTGTTG GTGTAGGTAG 1920 

TAAGATTGCA CAAGGTGTTG TCAGCATAAA 1980 

ATTTAAAGCA GCTGAAAAGC CAGGTAGTGA 2040 

AATTGGTTAT TATCGTGGAT CTAATCACTT 2100 

AATGCCAATT ATCGTTAATG GTGTAATGAA 2160 

TTCAGTAGAC ATTAATACTA AAGAAGACTT 2220 

TGCTGTTCCT GCAGCAAGTA TCGTCTGCGA 2280 

ATTATTGGAA GAATTCCAAT CAAATCATAT 2340 

CAGACAATTA AATATTGAGT TTTAACAACA 2400 

AAACAACATA CCCTTCAAAT AATTATCCAA 246 0 

TTAGCACGTA TATTGATCAG TTTGATCAAA 2520 

AATATTTTGC TGATAAATTT GATGATATTT 2580 

TTCCAGCTGG TGAAAAGACG AAAACATTTG 2640 

TATCCCATCA TGTAACTCGT AATACAGCAA 2700 
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ACGAOGCGTG CACTTTATAC AAGTGCCAAC GACTATACTA GCGCATOATT 


CTAGTGTTGG 


2S20 




CGGTAAAGTG 


GGTATTAACT 


CAAAGCAAGG 


TAAAAACCTT ATCGGTGCAT 


TTTATCGTCC 


2880 


5 


AACTGCTGTG ATTTATGATT TAGTCTTTTT AAAGACGTTA CCATTTGAGC AAATATTAAG 


2940 




TGGCTATGCA 


GAAGTTTATA AGCATGCGTT 


ATTGAATGGT 


GAATCAGCGA 


CGCAAGATAT 


3000 




CGAACAGCAC 


TTTAAAGATA 


GAGAGATATT 


ACAGTCATTA 


AATGGTATGG 


ATAAATATAT 


3060 


10 


TGCTAAAGGT 


ATTGAAACGA AGCTGGATAT 


TGTTATTGCA 


GATGAAAAAG 


AACAAGGTGT 


3120 




ACGTAAATTT 


TTAAATTTAG 


GTCATACATT 


TGGTCATGCT 


GTTGAATACT 


ATCATAAAAT 


3X80 


15 


ACCTCATGGT 


CATGCAGTGA 


TGGTTGGCAT 


TATCTATCAA 


TTTATAGTTG 


CGAATGCTTT 


3240 


GTTTGATTCT 


AAGCATGATA 


TTAATCATTA 


TATTCAATAT 


TTAATACAAC 


TCGGCTATCC 


3300 




TTTAGACATG 


ATAACTGACT 


TGGATTTTGA 


AACGTTATAC 


CAATATATGC 


TAAGTGATAA 


3360 


20 


AAAGAATGAT 


AAGCAAGGTG 


TACAAATGGT 


CTTGATTAGA 


CAATTTGGAG 


ATATCGTTGT 


3420 




ACAACATGTT 


GATCAACTAA 


CATTACAACA 


TGCATGTGAA 


CAATTAAAAA 


CATATTTTAA 


3480 




GTAGGTGAAT 


GAAATGGTAA 


ATGAACAAAT 


CATTGATATT 


TCAGGTCCGT 


TAAAGGGCGA 


3540 


25 


AATAGAAGTG 


CCGGGCGATA 


AGTCAATGAC 


ACACCGTGCA 


ATCATGTTGG 


CGTCGCTAGC 


3600 




TGAAGGTGTA 


TCTACTATAT 


ATAAGCCACT 


ACTTGGCGAA 


GATTGTCGTC 


GTACGATGGA 


3660 




CATTTTCCGA 


CTGTTAGGTG 


TAGAAATCAA 


AGAAGATGAT 


GAAAAATTAG 


TTGTGACTTC 


3720 


30 


CCCAGGATAT 


CAATCTTTTA 


ACACGCCACA 


TCAAGTATTG 


TATACAGGTA 


ATTCTGGTAC 


3780 




GACAACACGA 


TTATTGGCAG 


GTTTGTTAAG 


TGGTTTAGGT 


ATTGAAAGTG 


TTTTGTCTGG 


3840 


35 


CGATGTTTCA 


ATTGGTAAAA 


GGCCAATGGA 


TCGTGT CTTG 


AGACCATTGA 


AACTTATGGA 


3900 


TGCGAATATT 


GAAGGTATTG 


AAGATAATTA 


TACAC CATTA 


ATT ATTAAG C 


CATCTGTCAT 


3960 




AAAAGGTATA 


AATTATCAAA 


TGGAAGTTGC 


AAGTGCACAA 


GTAAAAAGTG 


CCATTTTATT 


4020 


40 


TGCAAGTTTG 


TTTTCTAAGG 


AACCGACCAT 


CATTAAAGAA 


TTAGATGTAA 


GTCGAAATCA 


4080 




TACTGAGACG 


ATGTTCAAAC 


ATTTTAATAT 


TCCAATTGAA 


GCAGAAGGGT 


TATCAATTAA 


4140 




TACAACCCCT 


GAAGCAATTC 


GATACATTAA 


ACCTGCAGAT 


TTTCATGTTC 


CTGGCGATAT 


4200 


45 


TTCATCTGCA 


GCGTTCTTTA 


TTGTTGCAGC 


ACTTATCACA 


CCAGGAAGTG 


ATGTAACAAT 


4260 




TCATAATGTT 


GGAATCAATC 


CAACACGTTC 


AGGTATTATT 


GATATTGTTG 


AAAAAATGGG 


4320 




CGGTAATATC 


CAACTTTTCA 


ATCAAACAAC 


TGGTGCTGAA 


CCTACTGCTT 


CTATTCGTAT 


4380 


50 


TCAATACACA 


CCAATGCTTC 


AAC CAATAAC 


AATCGAAGGA 


GAATTAGTTC 


CAAAAGCAAT 


4440 




TGATGAACTG 


CCTGTAATAG 


CATTACTTTG 


TACACAAGCA 


GTTGGCACGA 


GTACAATTAA 


4500 
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AAACTTGTTA GGGTTTGAAT 
ATTTAAAACA AATGCAACAG 
5 AGTTGCTTCT CTACTTTCAA 
ATCATTTCCA GGATTTTTAC 
GGAAGATATC TATAAATTAA 

10 

TCGTGTTAAT GAAGCAATAA 
ACTTTACAAT TTTGGATTAA 
ATATCCAGAC GAAAOTGAAT 

IS 

AACTGAOGAA GCGTTAGAAT 

AGAAGCAGAT TTATATCAAC 
20 AGCACTTGAA CTAGAGCCAA 

TTATGATGGT CAATATTTAC 

ATATCAAGTT AATGGTGTAA 
25 kAACTATAGT GATkCcGATt 

AAGATTATCT CAAAAGAGCC 

AAATAATGAC TACATTACTT 

30 

aATCaTTATA TG 5532 

(2) INFORMATION FOR SEQ ID NO: 531: 

(i) SEQUENCE CHARACTERISTICS; 
55 (A) LENGTH: 94 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 
AATTGGGTTA TACTATAGGT AAATTTAAGG AGGTAAGAAA ATGGATAAAA AAGAATTAGC €0 
45 GAAATTTATA GGCAATAAAA TCAGATACTA TAGAACCaAA TTGaACTTAA CTCAAGATCa 120 
ACTTCGAGAA AAACTCmACa CTAAAAArGC TACTATTTCA AATTATGAGA CAGGGTACAG 180 
AACTCCTAAA CAAGATGATT TGTTTGAAAT TGCTCATATT TTAAATATCA GTATCGATGA 240 

so 

TTTGTTTCCT ACAAGAAATA ATAAAAAAAA CGACATCACT TCCATATACA ACAAACTCAC 300 
ACCTCCCCGC CAAGAAAACG TACTTAACTA CGCAAATGAG CAATTAGATG AACAGAATAA 360 

55 



TACAACCAAC TAATGATGGA TTGATTATTC ATCCGTCAGA 4620 

TTGATAGTTT AACTGATCAT CGAATAGGAA TGATGCTTGC 4 680 

GCGAGCCTGT CAAAATCAAA CAATTTGATG CTGTAAATGT 4740 

CAAAACTAAA GCTTTTAGAA AATGAGGGAT AATATAAAAT 4800 

TAGACGATAT CAATCTACAA AAACTAGAAA ATTTAGACTC 4860 

CTACTGACAA CGATGACGCA TTATTTATTC TAGGAGAGAC 4920 

TGCCaCAAGG TTTGGAAGTA TTCCGCGTGT TATATCACAA 4980 

TGCTGATTTA TTTTATTGAA GGTTTAATGT CTGAAAATCA 5040 

ATTTAAGTTA TGTTGAACCA TCACCTGAAA AGTTGATGTT 5100 

AAATTAATAT GATGGAAGTT GCTATTGATA AATTACAAGA 5160 

ATGATCCAAT AATCCATTTT GCATTGGCTG AAATGTTATA 5220 

GTGCTACCTC TGAATACGAA ACCGTTTTAG AAACTGGTGA 5280 

ACTTATTCTC TCGTATGGCA GATTGTAGTT TACAAAGTGG 5340 

CgCTTATACG ATGrAATTAA TGAAGATGAA ATGACTTCAG 5400 

ATTTCTnACG ATAAAAATGA CATCACTCAA GAAGCAATTA 5460 

TCTAAAGATC CTGATTATAT TCAAgGCTAC TTGTATTTAC 5520 
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AACTGGTGCT GGCATAGGAG AAGAATTATA TGATGACATA TTOCATGAAG AAGTATTTTT 480 

TAAAGAAGAC GAAACGCCAT CAAATGCTGA TTTTTGTATT TTAGTTAATQ GTGATTCAAT 540 

GGAACCTATG TTAAAACAAG GAACATACGC TTTTATTAAG AAAGAAGATT CTATTAAAGA 600 

TGGTACAATT GCACTCGTTG TATTAGATGG AGTAAGTCTT ATCAAGCGTG TAGATATATG €60 

CGAAGACTAT ATTAATTTGG TATCTCTAAA TCCGAAGTAT GATGATATCA AAGTCGCTTC 720 

GTTTAGTAAT ATTAAAGTAA TGGGCAAAGT TGTATTGTGA TTAATAGCGC CTATATGGCA 780 

CTTTAATATA AAAGACGTCT ATTTCAGCAG TGTTTAAAAG GAGTTTATAA TGAAAATAAC 840 

TAATTGCAAA ATAAAAAAAG AAACTATAGT ATATGAAGTT TTAACTAGTG GTAATCAACC 900 

ATTCACTTAT GAGTTACCTA AAGATTTATC GTCACATAAT GC 942 
(2) INFORMATION FOR SEQ ID NO: 532: 
20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 

TGGnAAATGC AAACCAAAAT ATGaTCCTCG TGcAAGTTGA AGCGGGACGT TATGAAGAAT 6 0 

GGGTAAAGAA TGGTTATTTT AAACCGTCAG AAGATAAATC AAAAGAAACA TATACAATTG 120 

TTATCCCGCC ACCAAATGTA ACTGGTAAAT TACATTTAGG ACATGCATGG GATACG ACTT 180 

TACAAGATAT CATTACACGT ATGAAACGTA TGCAAGGATA CGATACGTTA TACTTACCAG 24 0 

GTATGGATCA TGCTGGTATT GCGACACAGG CAAAGGTAGA AGCTAAATTA AATGAACAAG 3 00 

GAATAACTAG ATATGATCTT GGTCGTGAAA AGTTTTTAGA ACAGGCATGG GATTGGAAAG 360 

40 AAGAGTATGC G TCATTT ATT CGTGCGCAAT GGGCTAAATT AGGTCTAGGT TTAGATT 417 

(2) INFORMATION FOR SEQ ID NO: 533: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 
45 CB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

so 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533: 
GATCCTGAAC CCGCATTTGT TTCCACTAAA ACAGTATGCC CACTTTCTAC TAAAGCGTGC 60 
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10 



ATTTTCATAC CATCCACCTC CATAATCATC TTAACGCGAA CATTTTGAAA GCGCAATCAA 180 

AAATCCACAA AATTGTAAAG GTTATTACAC TGACTTTTCC GAAAATTGTG GTAAAATATA 240 

ATTAAGAAAG AACAAGGAGG CACTTACTAT GATTACTTAC AAAAATATTT TAATCGCAGT 300 

TGACGGTTCA CATGAAGCGG AATGGGCATT TAACAGAGCA GTTGGTGTTG CTAAACGTAA 360 

CGATGCGAAG TTAACAATTG TGAATGTAAT TGATTCAAGA ACGTATTCTT CTTATGAAGT 420 

TTATGATGCT CAATTTACTG AAAAATCTAA GCATTTTGCA GAAGAATTAT TAAATGGTTA 480 

TAAAGAAGTA GCTACTAACG CTGGTGTTAA AGATGTAGAA ACGCGTCTAG AGTTTGGyTC 540 

1S TCyTAAATCT ATCATTCCTA AAAAGCTTGC ACATGAAATT AATGCAGACT TGATTATGAG 600 

TGGTACATCA GGCTTAAATG CCGTGGAAAg ATTTATTGTT GGTTCTGTAT CAGAATCTAT 660 

CGTTCGTCAT GCGCCATGTG ACGTGTTAGT TGTTCGTACT GAAGAGTTAC CAGCAGACTT 720 

20 CCAACCACAA GTT 733 

(2) INFORMATION FOR SEQ ID NO: 534: 

(i) SEQUENCE CHARACTERISTICS: 
2S (A) LENGTH: 6060 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534: 

TGATGATCCT GAAGCGCTAT TGGATAATTA CAACACTGAA GATGTTGATG CACACAATTA 60 

CAATAATATA AATCATGTTA TTTTTGCCTG CGATGCGGGT ATGGGTTCTA GTGCAATGGG 120 

35 

TGCArGCATG TTACGTAATA AATTTAAAAA GGCGGGCATT AATGATATTA CAGTTACAAA 180 

TACTGCGATT AATCAATTGC CAAAAGATGC TCAATTAGTT ATCACTCAGA AAAAACTAAC 240 

4Q TGATCGTGCT ATTAAACAAA CACCAAATGC CATCCATATT TCAGTGGATA ATTTCCTTAA 300 

TTCACCAAGA TATGAAGAAC TTTTAAATAA TCTAAAAAAA GATGATCAAG CATAATAATT 360 

AAATAAATTA AAAAATGGAG GATACCGCCA TGTTATTGAG TACACGTGAA AAAGAAATGA 420 

TAGCCCtAtT GATTAAGTAC CACGGtCAAt ATATCACTAT ACACGACATT GCTCAGCAAC 480 

TTGCGGTGTC CTCTCGTACT ATTCACCGTG AATTAAAAGG TGTTGAAGCA TATTTAACTT 54 0 

CATTTTCATT AACTTTAGAA CGCGCAAACA AAAAAGGGcT ACGCATTGCT GGCACAGATT 600 

CTGATTTAAA CGATTTGAAG CAATCGATTG ' CACAACATCA AACCATTGAC TTATCTGTTG 660 

AAG AG CAGAA AGTAATTATT ATATACGCTT TGATACAAGC CAAGGAGCCA GTTAAACAAT 720 
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TAGAGCTTGA 


TTTAAATAAG 


TACCAACTAT 


CTTTATCTCG 


AAAGCGTGGC 


GAAGGCATTT 


840 


ACTTGGTAGG 


TACTGAATCA 


AAGAAACGTG 


AATTTTTAAG 


TCAATTAATG 


GTGAATAACT 


900 


TAAATAGTAC 


TAGCGTTTAT 


TCAGTAATTG 


AAAATCATTT 


TGTCTTTCAT 


TCATTAAATC 


960 


AAATCCACAA AGACTTTGTT 


GACTTAGAGC 


GCATTTTTAA 


TGTTGAAAGA 


CtATTaATGG 


1020 


ACT AC CTAAG 


TGCCTTACCC 


TACCAACTTA 


CCGAATCAAG 


TTATTTAACT 


TTAACTGTCC 


1080 


ATATCGTGCT 


CTCCATTTCA 


CGTATAAAAA 


ATGGAGAGCA 


TGTCGCATTA 


AACGATGATA 


1140 


TTTATGATTC 


TGTACAAAAC 


ACATTTGAAC 


ACAAAGTaGc 


AAGCGAACTT 


GCTGATAAAC 


1200 


TTGGTCAAAT 


ATATGACGTC 


ACGTTTAATC 


AGGCAGAAAT 


TGCTTTCATT 


ACTATCCATT 


1260 


TACGTGGAGC 


TAAACGAAAA 


AATCTTAATG 


ATACATCATT 


AAATAATCGT 


TGTGAAGAAA 


1320 


ACAAAATTAA 


AGCGTTTGTT 


AACAAAGTAG 


AAATGATTTC 


CGGTATGACA 


TTTGCAGATT 


1380 


TGGATACTTT 


AGTAGATGGA 


CTGACGCTAC 


ACCTTAATCC 


TGCAATCAAT 


CGTTTGCAAG 


1440 


CTAATATCGA 


GACCTATAAT 


CCGTTAACAG 


ACATGATTAA 


GTTCAAATAT 


CCAAGACTAT 


1500 


TTGAAAATGT 


AAGATTAGCT 


TTAAATGATT 


GTTGGCCTGA 


TTTGATTTTT 


CCAGAGAGTG 


1560 


AAATTGCTTT 


TATAGTt TTA 


CACTTTGGTG 


GCTCGATTAA 


AAACCAAGGT 


AATCGATTTT 


1620 


TAAACATATT 


AGTCGTTTGC 


AGCAGTGGTA 


TGGGAACTAG 


T CGTCT ATT A 


TCAACTCGTC 


1660 


TAGAGCAAGT 


TTTTAGTGAG 


ATTGAGCGTA 


TTACACAAGC 


ATCAGTCAGC 


GATTTGAAGT 


1740 


CACTAGATTT 


AAGTCAATAT 


GATGGCATTA 


TTTCTACTGT. 


GAATTTAGAC 


ATCGACTCCC 


1800 


CCTATTTAAC 


GGTAAACCCA 


TTATTACCAG 


ATAGTGATAT 


CAGTTATGTC 


GCACAGTTTT 


1860 


TAAATACAAA 


GTCTACGTTC 


CAAGAGACGC 


ATGATAAATC 


AT CAAACATG 


ATTGATAAGG 


1920 


ATGATG TT CA 


TGTTGAAACG 


AAAGATGTTG 


ATGGCAACAC 


ATCTTTTGAA 


AATGAACAAA 


1980 


CTTCATACTT 


AACTTCAGTT 


TTCGAAAAAC 


ATTTAAGTGA 


CGAAAAATCA 


GAACAATTAT 


2040 


TGCAT CAT AT 


GCGTTCGGGT 


TTAACTTTGC 


TTGATTCAGT 


GAAAATAGTT 


AGTACCGAAG 


2100 


TTAAACAGTG 


GCAAACATAT 


ATCGCAGATT 


ATCTATATCA 


ATGCGATGTA 


ATAAACGATC 


2160 


CAACGTCATT 


CGCTGAACTA 


CTAGAGCAAC 


GATTGATTGA 


CAATCCAGGA 


TGGATATTAA 


2220 


GTCCATATCC 


TGTTGCAATA 


CCACACCTAA 


GAGACAATAT 


GATTAAACAC 


CCTATGATTC 


2280 


TAATCACAGT 


TTTAGAAGAA 


CCGTTAACAT 


TGCCTAGTAT 


TCAAAATGAC 


AATCAAACAA 


2340 


TTAAATATAT 


GATTTCCATG 


TTTATTTCTG 


ACAATGATTT 


TATGGCATCA 


CTGGTAAGTG 


2400 


ACTTGTCCGA 


ATTTTTAAGT 


TTGAAATTAG 


AATCTATTGA 


TACTTTTATG 


GAAAATCCAC 


2460 


AGGAACTTGA 


AACATTATTA 


AGAAACAAAT 


TTTTAGAACG 


AATTAAAAAA 


CAATTTATTT 


2520 
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TAACAGCCAA AATGAAGCAA TTGAAAAAGC AGGTAAAGCC TTAGTTGATA GTGGTGCTGT 2640 

AACAGATGCT TATATTCAAG CAATGAAAGA TCGTGAGCAA GTCGTATCAA CATTTATGGG 2700 

AAATGGCTTA GCAATTCCTC ATGGCACAGA TGAAGCTAAA ACAAATGTGA TTCACTCAGG 2760 

TTTAACATTA TTACAAATCC CTGAAGGCGT TGACTGGGAT GGCGAAGTAG TTAAAGTTGT 2820 

CGTGGGAATT GCTGGTAAAG ATGGCGAACA TTTAGACTTG TTATCTAAAA TTGCAATTAC 2880 

ATTTAGCGAA GAAGAAAATG TGGATCGTAT CGTTCAAGCA AAATCTGCAG AAGAAATTAA 2940 

ACAAGTATTC GAGGAGGCAG ATGCATAATG AAAGCAGTTC ACTTTGGTGC TGGTAACATA 3000 

1S GGTCGTG G T T TCATTGGTTA TATTCTg CAG ACAACAATGT TAAAGTAACA TTTGCAGACG 3060 

TCAATGAAGA AATCATTAAT GCTTTAGCTC ATGATCATCA ATACGATGTT ATTTTAGCTG 3120 

ATGAGTCTAA AACAACGACG CGCGTGAAtA ATGtTGATGC AATTAATTCA ATGCAACCTT 3180 

20 CTGAAGCGTT GAAACAAGCA ATTCTAGAAG CTGATATTAT TACAACAGCT GTTGGTGTTA 3240 

ACATACTACC TATTATTGCT AAATCTTTTG CGCCTTTCTT AAAAGAAAAA ACAAACCATG 3300 

TTAATATTGT TGCTTGTGAG AATGCTATTA TGGCAACTGA TACATTGAAA AAAGCAGTAC 3360 

TTGATATTAC TGGCCCTCTT GGTAACnaTA TTCATTTTGC TAACTCAGCA GTTGATAGAA 3420 

TTGTACCATT ACAAAAGAAT GAAAATATAT TAGAOGTTAT GGTTGAGCCA TTTTACGAAT 3480 

GGGTTGTTGA AAAAGATGCA TGGTATGGTC CAGAACTAAA CCATATTAAA TATGTTGATG 3540 

ATTTAACACC ATATATTGAG CGTAAATTAT TAACTGTGAA TACAGGACAT GCATATTTAG 3600 

CGTATgCTGG tAAATTTGCA GGTAAAGCTA CAGTTTTAGA TGCAGTTGAA GATAGTTCAA 3660 

TTGAAGCTGG CTTACGCCGT GTTTTAGCTG AAACTAGTCA ATATATTACT AATGAATTTG 3720 

ATTTTACTGA AGCGGAACAA GCTGGTTATG TTGAAAAAAT AATAGATCGT TTCAACAATT 3780 

CTTATTTATC TGATGAAGTA ACACGTGTCG GACGAGGTAC ATTACGTAAA ATTGGCCCTA 3840 

40 AAGATAGAAT TATAAAACCA TTAACATATC TTTATAATAA AGATTTAGAA OGCACTGGTT 3900 

TATTAAATAC AGCTGCATTG TTATTGAAGT ATGATGATAC AGCAGACCAA GAAACTGTTG 3960 

AGAAAAATAA TTACATTAAA GAACACGGTT TAAAAGOGTT TTTAAGTGAA TATGCTAAAG 4020 

45 TTGACGATGG CTTAGCCGAT GAAATAATTG AAGCGTACAA TTCACTTTCA TAATTTATTG 4080 

AGCTTTGTTT GAAACAAGAA GTTTCCAACG TTATTCGTTA ACAATCAGTA ATAATGTAGT 4140 

AGTTCCCTTG AATTAACAAT ATTAAATTTC TGAACATAAA AAATACTCCC TTCAACATAG 4200 

SO 

ACACTTAACT TGTGTTATGT ATGAAAGGAG TATTTTTGCG TTAATAATTT GTTTTATTTT 4260 

CGAGCCACAG CCACCTATTC AATGGCTATT GGTCATTACT AAAACAAATT CATATTAACT 4320 
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TTGAATAAAT TTTATTCTTC AGTTTGTTGG TCTTTCTTAG TGAATCTTCT AATTAAGAAT 4440 

GCCATACCTG CACCTAGAGC TAATTCAGCA TATGGTAAAT CGTCATTATG TGACATACCA 4500 

GTATCTGGTA AAGTTTTAGC TTGTTGTTTA GCTTTATTAA CTTTTCCTTG TTGAGCTGAT 4560 

TTTGTCTTAG CTTGGTGGTC GTCAGTGTTA GTTACATTAA GCATATCTTG ATTAGCACTA 4620 

TTGCTTCCAT TTGAAACTGT AGCTGGAGAT GCATTGGCAC CGTCGTTTTG CGTAgyTTTA 4680 

TTGTTTGCAG CTGAACCAAC TGATTTTTGC GTATCATTAG TATCTGCTGT TGCCGTATCA 4740 

TCTTTTTGGC TAACATTAGT TGAAGTCATT TTTTCTTTTG CTTCAGAAGA TGCAGATGTT 4800 

GATGGTTTAT TCGAAACTTC AGTATCAGCT TTGCTTGGCG ATTTATCTGC TTCGTTAGAT 4 860 

GCAACGTTAG TTTCAGACTT AAGTTGTCCT GCATCAGTTT GATTTGTCGT ACTTTCTTCT 4920 

TTATCTTTTG ATGTATTAGA AGGTACATTT GGTTCTGTTA TGTCTGCTGA AGGCAATGTT 4980 

20 TCAGTTGTtG ATTCAAC CAT ACTTTGATTT GTTGAATCAC TACCATCTTT TTCTGCCTTA 5040 

GCTTTATTTT CAGATTTTGG TTGTGCAACC TTGTCATTAG TTGATTGAGA TTCAGCACTA 5100 
TTATTTACTT CAGCATTTTG TTTTGAATCA TTTACAGATG CATTATCTTT GCTATCAGCA . 5160 

25 GATGATGCTG CTTCTGTGCT CGCAGTTGTT GGAGCGGTTG CTGTTGATCC TGTTGGTGCA 5220 

TTCTCGTTTG TTGCTGTAGT TGTACTATTG TTATTTGTTG TGCTTTCTGC TGGCGTTGCA 5280 

TTAT GAGTTT CTGTTACAGG TTTATCAGTT GTGCCGTTAT TAGTTGATTC TACTTCTGGT 5340 

TTACTAGTTA CATCGTTATC CATTGTCGGA CTGTTTGTTG ATG CAT CTAC ACTAGAATTG 54 00 

TTATTAGCTT GCGGTTTATC ATTTGCATCA TCAGTTGCTG ATGTTGCTGT TGTTTCACCT 54 60 

GTTGCCGCAT CACTATTATT TGGTGTTGTC GGAGAAGGGT CTGCTTTGCC ATTAGCTGTC 5520 

GTCTCAGATA CGTTAGGTTG TCCAGTATTT TCTGGTGTTG CATTAGCATT TGAATTTGCT 5580 

GTTG CATCAT TATTATCTAT ACCATTATTA GTATCATTAG CATCTGGATC ATTCTGAGGC 5640 

40 ACAATCGCTT CAATTGCAGG TATCGTTACA TTTTGTAATT CAGCAACTTC TGCATTTGTT 5700 

TGTGTTTTAT CTAATTTATC AGCAAATCTG TCAAAATATC TACCTAAATC CGTACGTGCA 5760 

ATTTCTTTCG CCGATGCATC TGCATCTGCA TTTTTAATTA TTTCTATTTG CTTGTTAACC 5820 

45 ACTTCTCTGA TTGCTTCCAA AGCATTTTTC TTAACTTCAG GATTAATACG TTGTGCTTTA 5880 

AGTTGTTCAA GCGCACTATT TTTGACAGTA GCGATTTCTG CATTTGTAGT TTGATCAGAA 5940 

ATATCTTCAG TTGCTTTTGA TAAAATGTCT TCTAAAGCAT TCGTAAACGC TTCTTTTTCT 6000 

TCAGTTGTAG CATCAGCGTT GACATTTACA CCTGCTTCAA TCTGGTCTAG TGCAGTTTCT 6060 
(2) INFORMATION FOR SEQ ID NO: 535: 
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(A) LENGTH: 377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

<D) topology: linear 



<Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535: 



10 


AACAAAGCCT 


TCCAATTATC 


TGCGTCGGTA 


GAACAAGTAT 


TAGCAACTTT 


ATCACCTACG 


60 




CTAAACAGTC 


CTTACGATTT 


ATACGGCAOG 


ACAAAAATGC 


TAGATATTAC 


ATTCGATTCA 


120 




TTTGAACATG 


ATGGTACAAC 


GTACCCTGTC 


GACTATGCTA 


CGTTTGAAAA 


TGATTATGAA 


180 


IS 


GATAATAAAG 


ATCCTGAGTT 


TAGAOGTAAA 


AGTTTCAAAT 


CGTTTAGCGA 


TGGGATTCGA 


240 




AAATATCAGC 


ATACTACCGC 


GGCTACATAT 


AATATGCAAG 


TACAACAAGA 


AAAAATTGAA 


300 


20 


GCTGATTTAC 


GTGGATTTGA 


ATCAGTCATC 


GATTATTTAT 


TACATAGTCA 


AGAAGTAACG 


360 


CGTGATATGT 


TTGACCGTCA 


AATCGATATG 


ATTATGCGTG 


ACTTGGCACC 


AGTTATGCAG 


420 




AAATATGCTA 


AACTTTTACA 


ACGTATTCAC 


GGATTAGATA 


ACATGCGTTT 


TGAAGACTTG 


480 


26 


AAGATTTCTG 


TAGACCCTGA 


TTATGAACCA 


GAGATTTCAA 


TTGAAGACTC 


AAAAAATTAT 


540 




ATTTTCGGTG 


CGTTAAGTGT 


TTTAGGTGAT 


GACTATACAA 


ACATGTTACG 


TGAAGCATAC 


600 




GATCAGCGAT 


GGaTTGATTT 


TGCACAAAAT 


AAAGGTAAAG 


ATACAGGCGC 


ATTTTGTGCA 


660 


30 


AGTCCATACT 


TTACACATTC 


ATATGTGTTT 


ATTTCTTGGA 


CTGGTAAAAT 


GGCTGAAGCA 


720 




TTTGTCTTAG 


CACATGAATT 


AGGTCATGCA 


GGTCATTTTA 


CATTAG CTCA 


AAAACATCAA 


780 




CCATAT CTTG 


AATCAGAAGC 


ATCAATGTAC 


TTTGTTGAAG 


CCCCTTCTAC 


AATGAATGAA 


840 


35 


ATGTTGATGG 


CCAATTATTT 


ATTTAACACA 


AGTGATAATC 


CAAGATTTAA 


GCGTtGGGTT 


900 




ATTGGCTCAA 


TTTTATCTAG 


AACATATTAT 


CATAATATGG 


tACCCmTTTA 


TTAGAAGCnG 


960 




CTTATCCACG 


GGGAGTG 










977 



40 

(2) INFORMATION FOR SEQ ID NO: 536: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 
45 <C> STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536: 

AGACAGTGAT yGaATTTCAT TTACAGTACA CAAATCATCG AAAAATTGGT AACATTCTTC 60 
TCTATTTTCT AACGTTAgwA TTgCATCAAA CAATTCATCT AACGCTGCAC CTCGTAATTT 120 
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AATCTATTAA 


AGTGTAGCGa 


TTTATATTTT 


ATTAAATCTG 


AATCGTTACT 


TTATTTAATT 


240 


TTATGCTAAT 


CCAGCGCGTT 


CGAAAATAGT 


GTCAACTTGa 


TTCAAATGAT 


GTTTAGGATC 


300 


GAAACATTCA 


TCCAATTCTT 


CTTTTGTTAA 


AACACTTGTA 


ATAGACTCAT 


CTTGTTCGAT 


360 


TAATTCACGG 


AACGGTGTTT 


TCGTTTCCCA 


AGATATCATC 


GCTTTTGGTT 


GTACTTTGTC 


420 


GTATGCTTCT 


TCACGAACCA 


TACCTTTATT 


AATTAATGCT 


AATAAGACAC 


GTTGTGAGAA 


460 


AATCAGACCA 


AATGTTTTAT 


CTATGTTATT 


ACGCATATTA 


TCTTCAAATA 


CAGTTAAACG 


540 


GTCCACAATA 


TTTGTGAACG 


ATTCAATGCA 


TAATCTAGTG 


CtATTGTAAC 


ATCTGGTAAC 


600 


ATAATACGCT 


CAGCAGAAGA 


ATGAGAAATA 


TsTCTTTCAT 


GCCATAATGG 


CACATTCTCA 


660 


TAAGCTGTAO 


TAATATAACC 


ACGAATGACT 


CTTGAAATAC 


CTGTGATATT 


TTCAGAACCA 


720 


ATTGGATTTC 


GTTTATGAGG 


CATTGCAGAT 


GAACCriTTT 


GGCCTTTTGC 


AAATGCTTCT 


780 


TCAACTTCTC 


TCGTTTCGGT 


TTTTTGAAGG 


TTACGTATTT 


CAACGGCAAA 


TTTTTCTAGT 


840 


GATGTCGCGA 


TTAATGCTAA 


TGTCGCAATA 


TAGTATGCAT 


GTCGATCGCG 


TTGCAATGTT 


900 


TGCGTTGATA 


CAGGCGCTGT 


GCCAATACCT 


AAATGTTTAC 


ACACATAACT 


TTCTATTTCA 


960 


GGAGGAATGT 


TAGCAAAAGT 


ACCTACTGCA 


CCACTCATTT 


TCCCTACTTC 


AATTTCTTCT 


1020 


CTTACTTGTT 


TGAAACGTTG 


TAAGTTACGT 


TGCATTTCCG 


TGTACCACAA 


TGCCATTTTG 


1080 


ACACCAAATG 


T AGTTGGTT d 


TGCATGCACT 


CCATGTGTAC 


GTCCCATCAT 


CAATGTATAT 


1140 


TTATAATTTT 


TTGCTTTTTC 


AGCTAAAACG 


TCGATAAATC 


TTTCTAAATC 


TTTTTCAATA 


1200 


ATGTCATTTG 


CTTGTTTAAT 


AACGAAACTT 


AAAGCTGT AT 


CTACAACATC 


AGTAGAAGTT 


1260 


AAACCATAAT 


GTACcACTTA 


CGTTCTTCAC 


CTAGCGTTTC 


AGAAACTTGT 


CTAGTAAAGG 


1320 


CTACAACATC 


ATGGCGCGTT 


TCTTGTTCAA 


TTTCTTGTGC 


ACGTTCGACA 


TTTACCTTTG 


1380 


CGTTTTGACG 


AA'rn-rrixsT 


ACGTCAGCTT 


TCGGTATATG 


TCCTAATTCA 


CTCCATGCTT 


1440 



40 (2) INFORMATION FOR SEQ ID NO; 537: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
4S <D> TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537: 
GATAAATCTA TnCAGTTTCC GTCCAAATAT CtGCaCCTAA AGCTTTTAaG TGTTCTACAA 60 
TATCTgTATA ACCTCTATAA ATATGTTTAA CATTGTAAAT TGTAGTTACA CCCTCAGCAA 120 
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CATGTAATGT TGATGGTTTT ATCGTTGCTG TGCCTTCGTC AACTTCAATA TTTGCACCCA 240 

TGCGCTTTAA TTCTTCAACA TGTTTAAAAC GCTCCGGATA AATCGTATCA GTTACAAATG 3 00 

AAGGACCATT TGCCATAAAT AATAATGGTG TAATAGGCTG TTGCAAATCA GTAGCAAAAC 360 

CTGGATATAC TAGTGTTTTA ATATCAACAA ATTGATATGG CGCATTATTA TTGATGCGAA 420 

TTCTTTCGTC TCTTACATCA ACATTCACAC CTAATTCACT AAATTTAGCA GTTAATGTTT 480 

CTACATGTTT CGGAACAATA TTATTTAATA TAACATTTTC TCCACATGCT GCAGCGATAC 540 

ACATATATGT GCCTGCTTCA ATTCTATCAG GTATAACTTG ATACTCAGAA CCATGTAATT 600 

, 5 CTTTGACGCC ATTGATTTTA ATTGTTGATG TACCCGCTCC CTTAATATTA GCTCCCATAC 660 

TTGTTAAGAA GTTAGCAACA TCAACTACTT CCGGTTCTTT AGCAGCATTT TCAATTACAG 720 

TTTGTCCTGT TGCATAAACT GCAGCTAGCA TAATGtnAAT TGTTGCACCT ACGCnAACCA 780 

20 TATC 784 

<2) INFORMATION FOR SEQ ID NO: 53 8: 



25 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3733 base pairs 
<B> TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOi 53 8: 

CAATCTCCAC CAATGACACG TCGTCATTCG TGACCTCATA CCArACAAAA AACAGTCTCG 60 

CAATCAAGAC TGTTTTCCAC TCAAT AT ATT CATCCATTAG : CGTAATAGAT TATTTGACTT 120 

CTGTAGCTAC AAAGaTTTTA CGTTTTTCCC AAACGCCTGT cTTTTCATTG TAATCATCAC 180 

AAGTAATTAA TGTTAATTGT TTATCTTTAC CTTTTTGTTC ATCTAGAAcT CCTACATCTG 240 

40 TAGGCTTAAC ATCTCTTATA CTTGTCATTT TATACTTACG TGTTTCATTA CCAACTTTAA 300 

AGTACACCAT ACTACCTTTT TTGGCTGCTT TAAGATTTGT AAATTGATAG TTCGGACGGT 360 

CAATGAAAGT GTGTCCTGCA ATTGAAATAT TTTGATCATC TAGTGATTCA TTTTCTTCTG 420 

45 CAAAGCTTAC AC CTCT ATTT AATTGTTCAG GTGTTGCTGG TCCTGGATAT ACTGGTTCTT 480 

TAATATCAGC ATCTGGAATT TCAATATAGC CTGCCACTTT CGATTTATCT TTCGGAATTT 540 

GAGGTTTAGC TTGCTGCTTT TTATCTTTAC TCGCCTGTTC TTTTACATTT TTATCATATT 600 

60 

. GTTCAATCTT TTCATCTTTA TCTTTATCGT GAAGATAATT ATCGATATGT GGTTTAGCAA 660 

ACAAATATGC TGCCACTAGG aTAAGTACCA CACCAGCGAT TGTCATTAAT CGATTTGTCC 720 
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AGTATACCAT TAATTTCAAA ATGACTCATA OCAATTCATT TTATACTATA AAATTTACAT 640 
GTATACTTTT ACGTTAGATT TCATTACACA TATTTGCATT CAAATAACGA AACGCTTTTA 900 
ATAATTACTA AGGGGGAATT GATATGATTA GATACGCTAA AAAAGAGGAT TTAAACGCTA 960 

TATTAGCGAT ATACAATGAT GCCATTATCA ATACTACAGC TGTTTATACT TATGAACCAC 1020 

AAACCATAGA CGAACGTGTC GCATGGTTTG AAACGAAACA ACGTAAGCAT GAGCCTATCT 1080 

TTGTATTTGA GGAAAATGGA AGTGTCTTAG GGTTTGCGAC GTTCGGTTCA TTTAGACCTT 1140 

GGCCAGCATA CCTATATACA ATCGAACATT CTATTTATGT CGATGCTTCA GCTAGAGGAA 1200 

AAGGTATTGC TAGTCAATTA CTACACCATT TAATTGTGGA AGCAAAAGCT AAAGGTTATC 1260 

GTGCGCTAGT TGCAGGCATT GATGCTTCCA ACAAAGCGAG TATTCAGTTG CATCAAAAGT 1320 

TTGCTTTTAA GCATGCCGGC ACACTGACCA ATGTAGGTTT TAAATTTAAT AGATGGTTAG 13 80 

20 ATTTAGCATT TTACGAATTA GATTTACAAG ACTAGTAATG TTTGAATCAG ATAATATAAA 1440 

CAAGACAACC ATGTTAATTC CCTTAACATA ACAAGCCAAC ATATAAAATT TTAAACTTCT 1500 

CAGGGGAGTG GGACAGAAAT GATAAAGAGC CACTAATGAT TTATTATGTA GTGGTTCTTA 1560 

25 CACATTAGCC ACAGCTAATG TGTAcTTAAA AATAGGAATA CATGAGTAAA ACTCATGCAT 1620 

AAGAAATACT AATTTCTATA GAAAAAGTAT TTCTTTATCG TCGTCCCACC CCAACTCGCA 1680 

CATTATTGTA AGCTGACTTT TCGTCAgCTT cTGTGTTGGG GCCCAAAAAG CTTGTTACAA 174 0 

GCGCATTTTC GTTCAGTCAA CTACTGCCAA TATAACTTTG TAG AG CAT AT TACATTGATT 1800 

TACATTGTCC CTTTTATTTA TTCTTTTCAA ATACTATCCC CATAGCTTTG ATTTAACGCT 1860 

TTTTCTCAAT AACAAAACGA ATATAGTAGA ACATGAAAAC GATAATCATG CTGAGCGATA 1920 

AAGATTTAAA TAATAGATTG ACCCACGTTC CCTCAGTCGT ATATCCATAT GTAATCGTTG 1980 

TGTTAATGAT GAATGCTATA AAGATGATTG ATAGTCTTAG CATATCATCA CTCCTTTTAA 2040 

GTTATTTTAG ATATACGGGG GCGCTTTTGC AATCACTATT TTGATTAGTA TGCATTTTCC 2100 

40 

ATAAATCTTT CAACTTCTTC AGAGATAATT AAGAAGCATC TATCTGGTAC TAATGATCCA 2160 

GACAGATGCT TCTTTTTTAT CAATATTTTA TTGTTATCTC ATTAATTATT TTTAACCATA 2220 

45 TCTTCAGCTG TGCCAAAGAT TTTACGTTTA ATTGCTTCGC CAGTTGGTGT GCCTGCTAGT 2280 

CCACCCAATC CAG TTTCACG TAATGATGCA GGAAGGTTAC GACCAACCTT ATCCATTGCT 2340 

TCAATAACTT CATCAACAGG GATTCTACTT TCAATACCTG CTAATGCTAA ATCTGCTGAA 2400 

60 ATTAAAGCGT TACCCGAACC AATTG CATTT CTCATAACAC AAGGAATTTC AACAAGTCCG 24 60 

GCTACTGGAT CACAAACTAA ACCTAATAAA TTACTTATCG CTAATGCCAT AGCGTGCCCG 2520 
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GAACCAACTT CAGnTTGGCA GCCACCTGTT GCACCAGCTA CACTTGCATT GTTTGCTAOG 2640 

ACACGCCCAA ACAATGCTGA AGTGAATAAG AAATCAATCA TTTGCTCTTC TGTTAAATCA 2700 

TGTGTTTTTT CTAATTTAAA AAGTGCACCG GGAATGGTAC CCGAGGAACC AGCTGTTGGC 2760 

GTtGCACAAA TAATACCCAT CGCAOCATTG ACTTCATTTG TTGCAATGGC AcCtTTGcTG 2820 

CGTCAATCAT TTCATATCCA GACAAAGCAT GATGTGTTTC ATTATAATCA CGTAGTTTAG 2880 

CAGCATCATG ACCAGTGTAG CCCGTTACAC TTTCAACCCC ATCACCTGTC GTCCCTTTGA 2940 

TTACTGCGTC TCGCATGACA TCTAAATTTT GTTTCATTTO CGCTCGCACT TCATCACGTG 3000 

ATTTACCGCT TAATTCCATT TCTTCTTTAA CCATGATATC CGCAAATGAC ATATTATTTT 3060 

CTACGG CATA ATCTATAGTC TCTCTAATTG AATCAAACAT GTTTATTCCC CCTCTAATTT 3120 

ATATAGGAAA CGTTTACGTC ACTGTATTTC TCTTTAATTG TATTTAATGT TGATTCTGAG 3180 

20 ATTGCTTTAT TTAATGGTAT TACAACCAAG CATTTATCTT CATCTATCTT AATAAATTCA 3240 

TCTTTACAGT CTAATTTCAT ATCGTTGATA TCATTGATGA AATGATTTAC TTGTGCTTTA 3300 

GTCATATTTC CGTCAACAAC TAAAATTGGT AATCCATGAT TTAAATCTAC TTCTAGTCCA 3360 

2S TTTATATGAA TACCTTTAAT TTTAATTGTA CCACQACCGA TTGAAATACC GATAATTTCA 3420 

ATGTAGCGAC CAT CATTACG AGATGATTTG ATAT AAG CAC AGTTTGGATG TTGACCAATA 3480 

CTATCGCCTT CTTCTTCGAT GATATCTATT TT AAT AC CAT CATCAGCTGC AATTTCTAAT 354 0 

GAAGATTTAA TTCGGTTATC AAATGTTGAA TATCCCATTG CTCCACCCAC AATAGCGACA 3600 

TCTGTACCAT GTCCTTGGTG TGTTTG AG CA AATGATT CAT AATAATGTAT TTCAATATTT 3 660 

TTATATCTCC CAATATTGCG CGTGCTGAAT TCCCCTTTAC TGCACCAGCC GTATGAGAAC 372 0 

TTGAAGGGCC CAT 3733 
(2) INFORMATION FOR SEQ ID NO: 539: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539: 
TGGCTGTCTT CTCTATGAGT GTAGTAAGTA AGTTAACGGA TTTAACGC CA AGG CAAATAC 60 
60 GTTACTATGA AACACATGAA CTCATCAAAC CTGAAAGAAC AGAAGGTCAA AAACGTCTGT 120 

TCTCACTCAA TGATTTGGAA AGATTACTAG AAATTAAATC ATTATTAGAA AAAGGATTTA 180 
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AAGAGATAAG AAAAAAGATG ATTGTAGATG CCACGCAAAA GCCTATTGGA GArACTTTGC 300 

CAATAAATCG TOGTGATTTA TCCCGATTTA TTAAATAAAA TTTGGAGGAT TTTAAAATGC 360 

CAAAACGTAC TTTCACTAAA GACGACATTC GTAAATTTGC AGAaGAGGAA AaTGTAaGaT 420 

ATTTAAGATT ACAATTCACT GATATTTTAG GAACAATTAA AAATGTTGAA GTGCCTGTAA 4 80 

GCCAATTAGA AAAAGTACTT GATAACGAAA TGATGTTTGA CGGTA 525 



10 

(2) INFORMATION FOR SEQ ID NO: 540: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 08 base pairs 
1S (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540: 
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TTGATTTGCT 


ACAAAGTATC 


TnCTCATTTC 


TGTATCCTGA 


AAAATCTTTA 


GTGTAATAAT 


60 


GTTGTTCAGT 


TTTAATATTT 


TCAGTCATAG 


TTGACTACCT 


CCGTATATTT 


TGATTTAATT 


120 


AAGTTGTATA 


TTTTGATGAA 


CACTTATTGT 


TACTTGTTGG 


CGCAAGT AG C 


AGTTTTTTCA 


180 


TTCTTCATAA 


AAGTATTCCT 


TATAGAATAT 


GAATGTTGCG 


ATACTTGCGA 


ATCCTGCAAT 


240 


TGa CCATGCT GtAGTGAAGT 


ATAGAAACGG 


CATAAGTACA 


ATCGCTAAGA 


CTGTGAAGCA 


300 


TAGTACTGCT 


ACTAGGTAGC TTTTAT AAAT 


GTTACTCATT 


TTCTTTTTTC 


AACTCCTCCA 


360 


TTATTCTCTG 


GTCTGATAAG 


TCGTGATAAG 


GGAATTTTTT 


CCtAGCTAAT 


TGGACTGGTA 


420 


TTCTGCGTCG 


TATCGCAATG 


TATCCTTCAT 


CTTCAAGCTC 


TTTATTCAGT 


TCTCTTATTA 


480 


TTTGTCCTGC 


TTTGGATTTA 


GAAACAGATA 


AAATTACCgC 


AAGTTCTTTA 


GCTTGCAAAC 


540 


TATTTTTCAT 


CATATCTTTT 


CCTCCTTTAA 


AATAACTGTT 


GATTCTCTGG 


GTTATCTGCT 


600 


TCGTAATTAT 


CTGCAATAAT 


ACTTTTAGCG 


AAAAAGTCCA 


AACTGACCTT 


ATATAGGTTG 


660 


TTCATAGATT 


TCTTTACGTT 


AACCCCTTCC 


TCAAGTACAT 


AAGGCACCCT 


AAAATCATTT 


720 


ATAAACAGTC 


CGTTTTCGTC 


TAAAGTAACG 


GTTGGTAATT 


CAGGTTTGTT 


CCGTCTATAA 


780 


ACTTCTCCTA 


GTGTAGGTTT 


TTGCTTTTCA 


GCTTGTTTAG 


TGAAGTCGGA 


AAATGCCTTA 


840 


AGTAGTTTTA 


TTC CTGAATC 


AGGATCACTG 


TGTCGCTCAA 


TCGTTTCTGC 


TGTAGACTCT 


900 


TTACTAAAAT 


CATTTCTATT 


GATTACAGGC 


TTTCTCGTAT 


TTCGTTCAAT 


CTTCCAAACC - 


960 


TTCCACGTCA 


CAACTGCCAT 


TGTGGTGAGG 


AGGGTTGTTT 


TGTATAGTGC 


GTT CATTTGT 


1020 


AATTCCTCCT 


ATTAAGTTGT 


TTGTTCAATT 


GTGTGTGTTA 


TTCTTCTTCG 


TCTAAATCAA 


1080 
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CGACTTOGTA AGTGTGCTCA ATCTCGCCTC cATATGTCAC AGTAAGAGTA TCTTTGTGTG 1200 

TGTATGTTTG ACTTTTGTTy TCTtTAACTG CATAAAGTGT TAATACTATA TTGTTTAGCT 1260 

TTyCTTTTTG TTCTGGTGTC ATTTACGCTC CCCCTAmATT AGCyTCATAA CCGAATTCAG 1320 

TCATGATTTC ATGTATTTTC AATCTGCCTT TTTGTGTCCA TCTAGTTTGT AAAACTGTGT 1380 

CTTCTCTGCC ATCAGAACGC ACAATTGT 1408 

(2) INFORMATION FOR SEQ ID NO: 541: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541: 

GTTCGTCAAT GATTTTTATC CGATGCTATG AGCATTaTCA AATACAAAAT GCTCTCTTAA 60 

AAG CAGTTAT TGACTGAAAA TCTACTTCTA AGAGAGCACT TTATTTAATT ACTTAAGAAA 120 

25 TCTTGAAATT TCAATATACG ATGTTTATGA TAAGTCGCTT ATTTCATCTT TAGGCTTGTT 180 

ATTAGTAAGT AGTTTAATAC CACTGATTAA CCATAAAGCA AATGTAATTA TGTTACCACT 24 0 

TATTACAGCT CCAATAATCA ACAATATACC ACTCATTTTT TTGTTTTTAG ATGCTTTAAA 300 

30 

CATACCGATT GCACCTAAAA TAATTGAAAT GATTCCAAAT ATGAATAGGG ATAAGAATAA 360 

TACAGTGAAA ATTGCTGCTG CTGTTTCTGc ATCAACTGGG nCAACCTCAC CATTAACTGT 420 

TGTTGGACAC AT 432 

35 

(2) INFORMATION FOR SEQ ID NO: 542: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2426 base pairs 
^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 

ATAATCATGA AGTnGCTAAA nCGCCAAATA ATGATGGTTC TGGACATGTT GTGTTAAATA 60 

AATTCCTTTC AAATGAAGAG AATCAAAGCC ATAGTAATCG ACTCACTGAT AAATTACArG 120 

60 GAAGCGATAA AATTAATCAT GCTATGATTG AAAAATTAGC TAAAAGTAAT GCCTCAACGC 180 

AACATTACAC ATATCATAAA CTGAATACGT TACAATCTTT AGATCAACGT ATTGCAAATA 240 
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GTATAAAAAG TCAACGAAAT ATTATTTTOG AAOAACTTGC ACGTACTQAT GATAAAAAGT 360 

ATGCTACACA AAGCATTTTA GAAAGTATAT TTAATAAAGA CGAGGCAGTT AAAATTCTAA 420 

5 

AAGATATACG TGTTGATGGT AAAACAGATC AACAAATTGC AGATCAAATT ACTCGTCATA 4 BO 

TTGATCAATT ATCTCTGACA ACGAGTGATG ATTTATTAAC GTCATTGATT GATCAATCAC 540 

AAGATAAGTC GCTATTGATT TCTCAAATTT TACAAACGAA ATTAGGAAAA GCTGAAGCAG 600 

10 

ATAAATTGGC TAAAGATTGG ACGAATAAAG GATTATCAAA TCGCCAAATC GTTGACCAAT 660 

TGAAGAAACA TTTTGCATCA ACTGGCGACA CGTCTTCAGA TGATATATTA AAAGCAATTT 720 

r5 TGAATAATGC CAAAGATAAA AAACAAGCAA TTGAAACGAT TTTAGCAACA CGTATAGAAA 780 

GACAAAAGGC AAAATTACTG GCAGATTTAA TTACTAAAAT AG r AACAGAT CAAAATAAAA 840 

TTTTTAATTT AGTTAAATCG GCATTGAATG GTAAAGCGGA TGATTTATTG AATTTACAAA 900 

20 AGAGACTCAA TCAAACGAAA AAAGATATAG ATTATATTTT ATCACCAATA GTAAATCGTC 960 

CAAGTTTACT AGATCGATTG AATAAAAATG GGAAAACGAC AGATTTAAAT AAGTTAGCAA 1020 

ATTTAATGAA TCAAGGATCA GATTTATTAG ACAGTATTCC AGATATACCC ACACCAAAGC 1080 

25 CAGAAAaCGt TAACACTTGG TAAAGGTAAT GGATTGTTAA GTGGATTATT AAATG CTG AT 1140 

GGTAATGTAT CTTTGCCTAA AGCGGGGGAA ACGATAAAAG AACATTGGTT GCCGATATCT 1200 

GTAATTGTTG GTGCAATGGG TGTACTAATG ATTTGGTTAT CACGACGCAA TAAGTTGAAA 1260 

30 

AATAAAGCAT AATTATATTG GGGGAAGAGC ATCTATATAT TTTTTTAAGT ATATAAGACG 1320 

TCTTATTTCC CCTTAATTTA TTGTGAAGTA TATGCAAAAT GCAATG AAT A GATTGTCCAT 1380 

CATTTTAACG TTATAATGAA TTTAACGACT TAGAACTACA CAAGTAAAGG AGAATGAAGA 144 0 

35 

TGTCTCGAAA AACGGCGCTA TTAGTTTTGG ATATGCAAGA AGGTATAGCG AGTAGTGTAC 1500 

CTAGAATAAA AAATATTATT AAAGCGAATC AGAGAGCAAT TGAAGCAGCA AGACAACATC 1560 

40 GAATACCAGT CATTTTCATA CGTTTAGTGT TAGATAAGCA TTTTAATGAT GTCTCCTCGA 1620 

GTAATAAAGT GTTTTCAACA ATTAAAGCTC AAGGATATGC GATTACTGAA GCAGATGCAT 1680 

CTACACGAAT ACTTGAAGAT TTAGCACCAC TAGAAGATGA GCCGATTATT TCTAAGCGAC 1740 

45 GCTTTAGCGC ATTTACAGGT AGTTACTTGG AAGTTTATTT ACGTGCAAAT GATATTAATC 1800 

ATTTAGTATT AACGGGTGTC TCTACAAGTG GAGCTGTATT GAGCACGGCA TTAGAAAGTG 1860 

TAGATAAAGA CTATTATATT ACTGTTTTAG AAGATGCTGT TGGTGATAGA TCAGATGATA 1920 

50 

AACATGACTT TATTATTGAA CAAATTTTAT CACGCTCATG TGACATTGAA TCCGTAGAGT 1980 

CATGGAAAAG TAGTTTATAG TTAATATAAC GTCAATTAAA GCTCGGCAGT AATGTTTGAG 2040 
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GAGGAACATT TGAACATAAA ATAATATATT TATATAAAAC GACCgAGGCG TTCGAACTGA 2160 

ATGtCCTCGG GTTTAATTGA ATAGAAATCG GACTTATGAA CGAAATATGT TTAAGTCGAA 2220 

5 CTCCTTGTTT ATACTTATAA ATTTTACGGG TTTAATATAA TACTTATTTA C CTGTAAT AT 2200 

ATGCATAATT nCTTCAGTCG GTCAGCCTGT CGTTOCATAG TTCCTATGCA GCAAATGCAT 2340 

ATCCTAATCC TTTAACATTG GCATTnCTGC AAATGAACGC ATAGAATCCA TTCACTGTTA 2400 

10 

ACTTTTTnCA ACAAATGTCT nACATG 2426 
(2) INFORMATION FOR SEQ ID NO: 543: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1874 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543: 





GAGTTGGGGA 


ATGTGCTCAA 


AATATGCGGA 


CTTTATGCAT 


TyCGGAATTG 


sCCaATTGCA 


60 


25 


GCTTTAAGCT 


ATGGTCAAAA 


AAAGAGGGTC 


ACTATAGCAT 


CTGTTTTAGT 


CTTAAATCCG 


120 




GAAATAATCA 


TATTGGATGA 


ACCGACTGCT 


GGTCAAGATT 


TCTAT CATTA 


TAATGAGATA 


180 




ATGTCATTTT 


TAATTGAACT 


AAACAGACAG 


GGGAAGACGA 


TTATTATGAT 


TACGCATGAT 


24 0 


30 


ATGCATTTAT 


TGTCTGAGTA 


TAGTTCAAGA 


ACAGTTGTAT 


TATCAAAAGG 


TCAAGTCGTT 


300 




GCTGATACCA 


CGCCAgTATT 


GGTTTTAAAT 


GATAAAAAAA 


TCTGTGAGAT 


TGCATCATTG 


360 


35 


AGACAAACAT 


CGCTATTTGA 


AATGGCCGAA 


TATATAGGGA 


TTAGCGAGCC 


ACAGAAATTA 


420 


GTACAATTAT 


TTATTAACCA 


TGATAGGAAG 


GTGAGACGcC 


AATGAATCAA 


TATAATACTA 


480 




TAGGTTTTCA 


CCCGGGAAAT 


AGTCGTATTC 


ATCAATTAAA 


TGCGACTGTT 


AAACTTTTAT 


540 


40 


TCTTATTAGT 


TGTTTCTATT 


TCTGCAATGG 


TGACTTATGA 


CACAAGATAT 


TTAATTTTAA 


600 




TTAGTGCTTC 


ATCTATTTTA 


TTGGTCAAAT 


ATGCTCATAT 


TGAATGGAAA 


CAAGTTCGCT 


660 




TTGTTGTTAA 


ATTCATTCTG 


TTTTTCACAA 


TAtTAAATAT 


TATTGCCGTG 


TACATATTTG 


720 


45 


ACCCTGAATA 


TGGTGTGAAG 


ATTTATAATC 


AGCGTACAGA 


GTTAGTCAAT 


GGTATTGGTC 


780 




GATTTACGCT 


AACATCACAG 


GAATTATTCT 


ATCTTTTTAA 


TCTAATATTA 


AAATATATTA 


840 




GTACAGTTCC 


TTTAGCGTTA 


ATATTTTTAT 


TCACAACGAA 


TCCGAGTCAT 


TTTGCTGCAA 


900 


SO 


GTTTAAATCA 


GCTAGGTGTG 


AATTATAAAA 


TCAGTTACGC 


AGTCTCACTA 


GCATTAAGGT 


960 




ATATTCCAGA 


TATTCAAGAA 


ACATATTttA 


ATATTTCACA 


AGCGCAACAA 


GCAAGAGGAT 


1020 
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TACCTTTAAT ATTTTCTAGT ATCGAAAGAA TTGACACTAT TAGTACTGCT ATGGAGTTAA 1140 

GACAATTCGG ACAGTATAAA AGGAGAACCT GGTACGTCAA AAAACAATTA AAAAAAGATG 1200 

ATTATGTTGT TTTGTGTTTG ACGTTAATAC TTCTGATGTT AGTAGTTACA TTATTCTTTT 1260 

TAAATAATAG TCGATATTTC AACCCGTGGC ATTAGTATTC ATATAAATAG TCTTTAAATA 1320 

GAAATAGGAG GGAGACATTT AATGATAAAT ACTGAAAGAT TAAATTTAAT GATTCCAAGT 1380 

TCCTCGCATT TAATTGAACT TTATAATATT TGTAGTCATC CACAAGCAAA TATATACACT 1440 

CCCAAAGGTT TACATAATTC CAAATTAGAC ACACAACGGT GGATTGAAAA ATGGCGAAAC 1500 

CATTGGCAAC AATATCAATT TGGTTACTTT GTATTGGTAA AAAAAATAGA TTGTAGTGTT 1560 

ATTGGTATTT GTGGATATGA ATATCGACAA TTAAAGCAAG AaACAGTATT AAATTTATTT 1620 

TATAAATTAC ATCCAAGTTT TGAAGGACAA GGGTACGCAT GTGAGGCTAT TACAG CAATC 1680 

20 ACAAATTTTG TGAATTATAT CGATCAAGAA ACAGTAAAAG TTATCAGGAC AAATAAGTGT 1740 

AACCAACGTT CAATAAATTT AGCAOAAAGG CTTAAATTCA AGCGAGACGA TACTATGGAC 1800 

GACATTATCA ATCAAGGAGA TATTGTGTTT TAAnAATAAA ATACTATGAC ATTATCTAAA 1860 

25 AAATAAAATT AAAA 1874 

<2) INFORMATION FOR SEQ ID NO: 544: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 5280 base pairs 

<B) TYPE: nucleic acid 

(C) 5TRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 544: 

TCAACATTTC TAACACCAAT GTGAAAATGA TCTATGTGAT TTGCAATGGC TTGATTTGTA 6 0 

ATATGTGTGC CTAAATGACC TGTAGCACCT GTTAACATAA TATTCATTCA CTTCATCTCC 120 

TAATCTTTAT ATACATAACA TAATACTTAT TTGATGGTTT TCAAAACATT TGATTTTATA 180 

AAAAATTCTA ATCTGTATTT ATTGTCGACG TGTATAGTAA ATACGTAAAT ATTATTAATG 24 0 

TTGAAAATGC CGTAATGACG CGTTTTAGTT GATGTGTATC ACTAATATCA TTGAAAATTT 300 

TAATCaGGTA CTACGACAAT ATGATGTCTG TTTTGTGTCT GAAAGTTTTA CAGTTTTTAA 360 

SO AATAAAAATG GTATAAAGTG TGATTTGTAT AAAAAAGAGT CTCGACGGAT AAGAATTGAT 4 20 

TAATAACAGT TAGCATTTTA TTAATTACCT TAACAATGAT TCAAGTTTAG TTAAATGAGG 4 80 

TTTAATTTGA AAGGGGATAG CGCCTCAATA TAATGTAGGT AGATTGTTCA TATTACGTAA 54 0 
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AAGCAAGTTT AGCATTGGGA ATGTTAGCAA CAGGTGTAAT TACaTCGAAT GTACAATCAG 660 

TACAAGCGAA aGCAGrAGTT AAaCAACAAa GTGAaTCAGA GTTAAAACAC TATTATAATA 720 

AmCCAATTTT AGAGCGTAAA AATGTGACTG GATTTAAATA TACTGATGAG GGTAAACACT 780 

ATTTAGAAGT CACAGTAGGG CAACAGCATT CTCGAATCAC TTTACTTGGA TCTGATAAAG 840 

ATAAATTTAA AGACGGAGAA AACTCAAATA TAGATGTGTT TATCCTTAGA GAAGGTGACA 900 

GTAGACAAGC AACAAATTAC TCAATTGGTG GCGTTACAAA ATCAAATAGT GTGCAGTATA 960 

TTGATTATAT CAATACGCCA ATTTTAGAAA TCAAGAAAGA TAATGAAGAT GTACTTAAAG 1020 

ATTTTTACTA CATTTCAAAA GAAGACATCT CATTAAAAGA ACTTGATTAT AGATTAAGAG 1080 

AACGTGCGAT TAAACAACAC GGCTTGTATT CAAATGGTCT TAAACAAGGT CAAATTACAA 1140 

TTACAATGAA TGATGGCACA ACACATACAA TCGATTTAAG TCAAAAACTT GAAAAAGAAC 1200 

20 GTATGGGTGA GTCAATCGAC GGCACTAAGA TTAATAAAAT TCTAGTAGAA ATGAAATAAT 1260 

ACTTTCTAAC AACAAAGCGC TATGTTGAAT AGTGCTTGTT ATGGAAATAT ATGGAAGTTA 1320 

AG CGACGT AC TGTTGCTTAG CTTCTTTTTT TGAGGGGAAA AGTTACAAAA CTCACACAAA 1380 

CAGTCGCACC ACGCATTATC TTTTGCTTAA ATAGCTTAAT CATATTTTAT GAATAGTTAA 1440 

AAACAGGTTA ATGTGAATAT CCGAATACAG CTCCTATAAT ATGGGTGTAT GATTCAAATT 1500 

ACGTAATAAA ACAATCTAAT TATAATAGAT TGGAGCATAC AACTATGAAA ATGAAAAATA 1560 

TTGCAAAAAT AAGTTTGTTA TTAGGAATAT TAG CAACAGG TGTAAACACT ACAACGGAAA 1620 

AACCAGTTCA TGCCGAAAAG AAACCTATTG TAATAAGTGa AAATAGCAAA AAATTAAAAG 1680 

35 CTTATTATAA TCAACCTAGT ATTGAATATA AAAATGTGAC AGGTTATATC AGTTTCATTC 1740 

AACCAAGTAT TAAATTTATG AATATCATAG ATGGTAATTC TGTTAATAAT ATTGCTTTAA 1800 

TTGGCAAAGA TAAGCAACAT TATCATACGG GTGTACATCG TAATCTTAAT ATATTTTACG 1860 

40 TTAATGAGGA TAAGAGATTT GAAGGTGCAA AGTACTCTAT TGGGGGTATC ACGAGTGCAA 1920 

ACGrTaAAGC TGTCGAC CTA ATAGCAGAAG CAAGAGTTAT TAAAGAAGAT CATACTGGTG 1980 

AATATGATTA TGACTTTTTC CCATTTAAAA TAGATAAAGA AGCGATGTCA TTGAAAGAGA 2040 

TTGATTTTAA ATTAAGAAAA TACCTTATTG ATAATTATGG TCTTTACGGT GAAATGAGTA 2100 

CAGGAAAAAT TACAGTCAAA AAGAAATACT ATGGAAAGTA TACATTTGAA TTGGATAAAA 2160 

AGTTACAAGA AGACCGTATG TCCGATGTTA TCAATGTCAC AGATATTGAT AGAATTGAAA 2220 

TCAAAGTTAT AAAAGCATAA CACATATACT TGATGACGAA ATAAGTTGAA ATTGAAATAG 2280 

AGAGGTTAAG TGACGATCAA ACGTTGCTTA ACTTCTTTTT AATGCTTAAA AATTATTTCA 2340 
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TTAATAATAC TTCAATAATT GTTAAAAGGG GTTTAATGTG ATTATCTTAG AACGCCATCT 2460 

ATAATGATGT TGTATGATTC AAATTACGTA AAAAGACAAT CGAATATAAT ATAGATTGGA 2520 

GCATACAATT ATGAAAATGA GAACAATTGC TAAAACCAGT TTAGCACTAG GGCTTTTAAC 2580 

AACAGGCGCA ATTACAGTAA CGACGCAATC GGTCAAAGCA GAAAAAATAC AATCAACTAA 2640 

AGTTGACAAA GTACCAACGC TTAAAGCAGA GCGaTTAGCA ATGATAAACA TAACAGCAGG 2700 

TGCAAATTCA GCGACAACAC AAGCAGCTAA CACAAGACAA GAACGCACGC CTAAACTCGA 2760 

AAAGGCACCA AATACTAATG AGGAAAAAAC CTCAGCTTCC AAAATAGAAA AAATATCACA 2820 

r5 ACCTAAACAA GAAGAGCAGA AAACGCTTAA TATATCAGCA ACGCCAGCGC CTAAACAAGA 2880 

ACAATCACAA ACGACAACCG AATCCACAAC GCCGAAAACT AAAGTGACAA CACCTCCATC 2940 

AACAAACACG CCACAACCAA TGCAATCTAC TAAATCAGAC ACACCACAAT CTCCAACCAT 3000 

20 AAAACAAGCA CAAACAGATA TGACTCCTAA ATATGAAGAT TTAAGAGCGT ATTATACAAA 3060 

ACCGAGTTTT GAATTTGAAA AGCAGTTTGG ATTTATGCTC AAACCATGGA CGACGGTTAG 3120 

GTTTATGAAT GTTATTCCAA ATAGGTTCAT CTATAAAATA GCTTTAGTTG GAAAAGATGA 3180 

26 

GAAAAAATAT AAAGATGGAC CTTACGATAA TATCX3ATGTA TTTATCGTTT TAGAAGACAA 324 0 

TAAATATCAA TTGAAAAAAT ATTCTGTCGG TGGCATCACG AAGACTAATA GTAAAAAAGT 33 00 

TAATCACAAA GTAGAATTAA GCATTACTAA AAAAGATAAT CAAGGTATGA TTTCACGCGA 3360 

30 

TGTTTCAGAA TACATGATTA CTAAGGAAGA GATTTCCTTG AAAGAGCTTG ATTTTAAATT 3420 

GAGAAAACAA CTTATTGAAA AACATAATCT TTACGGTAAC ATGGGTTCAG GAACAATCGT 34 80 

35 TATTAAAATG AAAAACGGTG GGAAATATAC GTTTGAATTA CACAAAAAAC TGCAAGAGCA 354 0 

TCGTATGGCA GACGTCATAG ATGGCACTAA TATTGATAAC ATTGAAGTGA ATATAAAATA 3600 

ATCATGACAT TCTCTAAATA GAAGCTGTCA TCGGAAAAAC AAGAAGTTAA GTGACAACGG 3660 

TTTACATGTT GCTTAGCTTC TTTTATTATG CGTAATGATG TAAAAAGACG AATATTCATT 3720 

TGTTTGTAAA AGTGGCATTT CTATGTCTTA AAAGTGACGA AACTTCAAAT GTGCCAAGTG 3780 

TTGAATCACA TCAAAATCAT TTTTATTTAA CGAACATTAT GGATTTCTTA ATTTACTTAA 3840 

CGATGATTCA AATATAGTTA AACAAGGTTT AATGTGAATG GAGCAATACG CCATCTATAA 3900 

TAAAGCTGTA TGATTCAATG AATGTAATCG AACAAATCTA ATAATTACGA ATGGAGCATA 3 960 

so CAACTATGAA AATAACAACG ATTGCTAAAA CAAGTTTAGC ACTAGGCCTT TTAACAACAG 4020 

GTGTAATCAC AACGACAACG CAAGCAGCAA ACGCGACAAC ACTATCTTCC ACTAAAGTGG 4080 

AAGCACCACA ATCAAGACCG CCCTCAACTA AAATAGAAGC ACCGCAATCA AAACCAAACG 4140 
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CGCCTTCAAC TAAAGTGACA ACACCTCCAT CAACAAACAC GCCACAACCA ATGCAATCTA 4260 

CTAAATCAGA CACACCACAA TCGCCAACCA CAAAACAAGT ACCAACAGAA ATAAATCCTA 4320 

AATTTAAAGA TTTAAGAGCG TATTATACGA AACCAAGTTT AGAATTTAAA AATGAGATTG 4380 

GTATTATTTT AAAAAAATGG ACGACAATAA GATTTATGAA TGTTGTCCCA GATTATTTCA 4440 

TATATAAAAT TGCTTTAGTT GGTAAAGATG ATAAAAAATA TGGTGAAGGA GTACATAGGA 4500 

ATGTCGATGT ATTTGTCGTT TTAGAAGAAA ATAATTACAA TCTGGAAAAA TATTCTGTCG 4560 

GTGGTATCAC AAAGAGTAAT AGTAAAAAAG TTGATCACAA AGCAGGAGTA AGAATTACTA 4620 

AGGAAGATAA TAAAGGTACA ATCTCTCATG ATGTTTCAGA ATTCAAGATT ACTAAAGAAC 4680 

AGATTTCCTT GAAAGAACTT GATTTTAAAT TGAGAAAACA ACTTATTOAA AAAAATAATC 4740 

TGTACGGTAA CGTTGGTTCA GGTAAAATTG TTATTAAAAT GAAAAACGGT GGAAAGTACA 4800 

20 CGTTTGAATT GCAGAAAAAA TTACAAGAAA ATCGCATGGC AGATGTCATA GATGGCACTA 4860 

ATATTGATAA CATTGAAGTG AATATAAAAT AATCATGACA TTCTCTAAAT AGAAGCTGTC 4920 

ATCGGAAAAA CAAGAAGTTA AGTGACAACG GCCTACATGT TGCTTAGCTT CTTTTGTTAT 4 980 

GTTCGATGAT TTGAGAACCC GAATTTTCGA TGGGTCCAAA TATGACGTGG AAGAGACCTG 5040 

AATTTATCTG TAAATCCCTA TCTATCGGGT GTGAAGCACA ACGGGATCAG TTTTATTTAA 5100 

CGAACATTAT AGATTCCTTA ATTTACTTAA TAATGATTCA ATGATTATTA AACATGGTTT 5160 

AATGTGAAAG GTCAAATACG CTAACTATAA TAAAGCTGTA TGATTCAATA GACGTAAGCG 5220 

AACAAATCTA ATAATTACGA ATGGAGCATA GAACTATGAA AATGACAGCA ATTGCGAAAG 5280 
55 (2) INFORMATION FOR SEQ ID NO: 545: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 886 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545: 

AGTAAAATTG CCGGTATGAT GGACACAAAC GGTGACCTTG GTCAAGGTGA ATTAGCGATT 60 

AATCCACCtA AATCAGATTT gAACGAATTA CCTTGGGCTA CACGTAAAAA TAAACAGCCA 120 

GCTTCATCCG AAAAAgGTTC AAGTGGTCAT CATGGTAATG CAGCAATGCC TCAAACCAmA 180 

TTAGATTATC AAATATCTAT TGATAAGGTC GTTGAACAGG CGCAAAAAGC TGGTATTAAA 240 

AAGCCGTTTT CAATCGTATA TCCAAGTGAT AAAAATGGTA CCTTTATTGT ATCTAATACT 300 



1482 



EP0 786 519 A2 



10 



GATCAATATA GCGGTAAAAA GCTAGGTACG ATTAAATATG ATGACTACGG TATTATTOCT 420 

AAATGGTTTA CATGGGGCAT TCCGCTTCAC GAAGGTCATT TATTCGGCAT TTTAAATAAA 480 

ATCATTAATT TATTTGTATG TATCGCTTTA TTAGTAGCCA TTGGCATGGG GTTTGTCTCT 540 

TGGATAAAGC GTACAAAAAA TACTGCAGTA AAAGTACCAC ATCGCGTAAA AAAACCAGCA 600 

TCTATATCAC TCATAATATG TTTAATTGTA TTAGGATTAT TAATGCCATT ATTTGGATTA 660 

TCACTTATCC TTGTATTTAT AATTGAATTA ATATTATATA TTAAAGATCG TCGTGCTAAA 720 

CAATAATGCA CTTAAAGTTT TGAACTGAOG AAATTTACAA AATGgATTCT CGTCTCTCTA 780 

1S ATTACtTAAA ACGGGgTtCy AaTAATAAAT CgTACTGaTG GgAAAGTTTT TACTTTTTAt 840 

CTGtCCGAtT TTTTnGAAwT TGAAGATAAA AAAGCATCTA AAACGC 886 
<2) INFORMATION FOR SEQ ID NO: 546: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

.■:--(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546: 

GGCATTTGTG TCCTTATATA AGGAACTGTG tTAAATACAT TACTGTTGTT AAGTTGTTTT 60 

30 

TGTAATTCAA AGAGCAGAAC AGAGTAACAT CATCAGTTGT AGTAAACGAT AATCCGGTAA 120 

AACAACTAAA TGAAATAATG AAAGTCATTT AACCTGAACA TTAAAATATA TTTGTTTTTC 180 

35 ATTAAGAATA^ ATTCAAGTAT ATTTAAATCG AGG TTAATTA TCGTATGAAA CGATGCACGT 240 

TATAATAAAA ATGTATGATT CAAATTACGT AATGAAAACA AT CCAATATA TTAAGATTGG 300 

AGCAAATAAA TATGAAATTT ACAGCATTAG CAAAAGCGAC ATTAGCTTTA GGAATTTTAA 360 

40 CAACAGGAAC TTTAACAACA GAAGTTCATT CAGGTCATGC AAAACAAAAT CAAAAGTCAG 420 

TAAATAAACA TGACAAGGAA GCATTATACC GATACTACAC TGGAAAGACT ATGGAAATGA 480 

AAAATATTAG TGCTTTGAAA CATGGTAAAA ACAACTTACG TTTTAAGTTT AGAGGTATTA 540 

AGATTCAAGT TTTACTGCCT GGAAATGATA AAAGTAAATT TCAACAGCGT AGTTATGAGG 600 

GGTTAGATGT TTTCTTTGTT CAAGAAAAAA GAGATAAGCA CGATATATTT TATACTGTTG 660 

GTGGTGTAAT ACAGAATAAT AAAACATCTG GAGTTGTCAG TGCACCAATA TTAAATATTT 720 

CAAAAGAAAA GGGTGAAGAT GCTTTTGTGA AAGGTTACCC TTATTACATT AAAAAAGAAA 780 

AAATAACACT AAAAGAACTG GATTATAAGT TGAGAAAGCA TCTAATTGAA AAATACGGAC 840 

55 
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ATAACCTTGA TTTAAGATCT AAATTAAAAT TTAAATATAT GGGGGAAGTC ATAGAAAGCA 960 

AACAAATTAA AGATATTGAA GTTAACTTAA AGTAAATCAT TACGAATAAT TAAAAOTAAT 1020 

5 

TGAAGCGGCT TAACGGTGAA ATGTAAATTG GTGCGCATAG CTTATACAAA AAGGATGCAT 1080 

CAATCGATAT CGTCGTTAAG CCGTTTTGGT TTGTGTGTCA TGAATCCTAT CCCAATCTCC 1140 

ATAAAGGTAA AATTTCCACC ACCAACATCA AAATTCTCCA CATCGCAACA TAACCAAATG 1200 

10 

TTATAATAAA TCTATTACAC AAAGAGATAA ATTACTTATT CAAAGGCGGA GGAATCACAT 1260 

GTCTATTACT GAAAAACAAC GTCAGCAACA AGCTGAATTA CATAAAAAAT TATGGTCGAT 1320 

15 TGCGAATGAT TTAAGAGGGA ATATGGATGC GAGTGAATTC CGTAATTACA TTTT A GGCTT 1380 

GATTTTCTAT CGCTTCTTAT CTGAAAAAGC GGAACAAGAA TATGCAGATG CCTTGTCAGG 1440 

TGAAGACATC ACGTATCAAG AAGCATGGGC AGACGAAGAA TACCGTGAAG ACTTAAAAGC 1500 

20 AGAATTAATT GACCAAGTCG GTTACTTCAT TGAGCCAGAA GATTTATTCA GTGCGATGAT 1560 

TOGTGAAATT GAAACGCAAG ATTTCGATAT CGAACACCTG GCGACGGCAA TTCGTAAAGT 1620 

TGAAACATCA ACATTAGGTG AAGAAAGTGA AAATGACTTT ATCGGTCTGT TCAGCGATAT 1680 

25 

GGATTTGAGT TCAACGCGAC TAGGTAACAA TGTCAAAGAA CGTACTGCTT TAATCTCTAA 1740 

AGTCATGGTT AATCTTGACG ACTTACCATT CGTTCACAGT GACATGGAAA TTGATATGTT 1800 

3Q AGGTGATGCA TATGAATTCC TAATTGGGCG CTTTGCGGCG ACAgCGGGTA AAAAAGCAGG 1860 

CGAGTTCTAT ACACCACAAC AAGTATCTAA GATACTGGCG AAGATTGTCA CAGACX3GTAA 1920 

AGATAAATTA CGTCACGTGT ATGACCCAAC ATGTGGTTCA GGTTCACTGT TGTTACGTGT 1980 

35 TGGTAAAGAA ACACAAGTGT ATCGTTATTT CGGTGAAGAA CGTAACAATA CTACATACAA 2040 

CTTAGCACGC ATGAATATGT TATTACATGA TGTG CGTTAT GAGAACTTCG ATATCCGTAA 2100 

TGATGACACA TTGGAAAACC CAGCCTTTTT AGGCAATACA TTTGATGCGG TTATTGCGAA 2160 

40 

CCCACCGTAT AGTGCGAAAT GGACTGCAGA TTCAAAGTTT GAAAATGACG AACGATTCAG 2220 

TGGTTACGGC AAACTTGCGC CTAAGTCTAA AGCAGACTTT GCCTTTATTC AACACATGGT 2280 

ACATTACCTA GACGATGAAG GTACCATGGC CGTTGTACTC CCACATGGTG TATTATTCCG 2340 

45 

AGGTGCTGCA GAAGGTGTCA TTOGTCX5TTA TTTAATTGAA GAAAAGAACT ACTTAGAAGC 24 00 

TGTGATTGGT TTGCCAGCGA AT A TT TT CTA TGGGACAAGT ATTCCAACAT GTATTTTAGT 2460 

60 ATTTAAAAAA TGTCGCCAAC AAGACGACAA CGTACTATTT ATCGATGCAT CCAATGATTT 2520 

TGAAAAAGGA AAAAATCAAA ATCATTTAAG CGATGCCCAA GTCGAACGTA TTATAGACAC 2580 

ATATAAGCGT AAGGAAACAA TTGATAAATA TAGCTACAGC GCGACACTAC AAGAGATTGC 264 0 

55 
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GATTGATTTA GATCAAGTCC AACAAGATTT GAAAAATATC GATAAAGAAA TCGCAGAAAT 2760 

TGAGCAAGAA ATCAATGCAT ACCTGAAAGA ACTTGGGGTG TTGAAAGATG AGTAATACAC 2820 

AAAAGAAAAA TGTGCCAGAA TTGAGGTT CC CAGGGTTTGA AGGCGAATGG GAAGAGAAGC 2880 

AGTTAGGGGA TCTTACAGAT AGAGTAATTA GGAAAAATAA AAACTTAGAA TCGAAAAAGC 2940 

CTTTAACAAT ATCCGGACAG TTAGGTTTAA TTGATCAAAC AGAATATTTT AGTAAATCAG 3000 

TTTCGTCGAA AAATCTAGAA AATTATACAC TAATAAAGAA TGGAGAATTC GCGTATAACA 3060 

AAAGTTATTC TAATGGATAC CCATTAGGGG CTATTAAAAG ATTAACTAGA TATGATAGTG 3X20 

GTGTATTGTC CTCTTTGTAT ATTTGTTTTT CTATTAAAAG TGAAATGTCT AAAGACTTCA 3180 

TGOAAGCATA TTTTGATTCG ACACACTGGT ATAGAOAAGT TTCTGGAATT GCAGTTGAGG 3240 

GTGCAAGAAA TCACGGATTA TTAAATGTTT CTGTGAATGA TTTTTTTACT ATTCTAATTA 3300 

20 AATATCCAAG TTTAGAAGAA CAGCAAAAAA TAGGCAAGTT CTTCAGCAAA CTCGACCGAC 3360 

AAATTGAATT AGAAGAACAA AAGCTTGAAT TACTTCAACA ACAGAAAAAA GGCTATATGC 3420 

AGAAAATTTT CTCACAGGAA CTGCGATTCA AAGATGAGAA TGGTGAAGAT TATCCAGATT 3480 

GGGAAAATAG CAAAATAGAA AAATATTTAA AAGAGAGAAA CGAACGTTCT GACAAAGGGC 3540 

AAATGCTTTC AGTAACTATA AATAGTGGCA TTATAAAATT TAGTGAATTG GATAGAAAAG 36 00 

ATAATTCAAG TAAAGATAAA AGTAATTATA AAGTAGTTAG GAAAAATGAT ATTGCATATA 3660 

ATTCTATGAG AATGTGGCAA GGGGCTAGTG GTAAATCAAA TTATAATGGG ATTGTTAGCC 3720 

CTGGATATAC TGTGCTTTAT CCAACACAAA ATACTAGCTC ATTATTTATT GGATATAAGT 3780 

35 TTAAAACACA TAGAATGATT CATAAATTTA AAATTAATTC ACAAGGATTA ACATCAGATA 3 840 

CATGGAACTT AAAATATAAA CAATTAmAAA ATATAAATAT AGATATACCT GTATTGGAGG 3900 

AACAAGAAAA GATAGGTGAT TTCTTTAAAA AAATGGATAT ATTGATAAGT AAACAGAAAA 3960 

40 TGAAAATTGA AATATTAGAA AAAGAGAAAC AATCCTTTTT ACAAAAAATG TTCTTATAAC 4020 

TTTGATAAAT ACATAGATTG CATAAGAATA AAATTTGTAT AATTTAACAT AAAAGTTGTA 4080 

AAAGTAAAGT GAATTAAAAA CGAACATTAA ATTTAGGCAC TGTGAAAGCG CAGTGTCTTT 4140 

TTTGTGTCGA AATTGTGTAC AGAATAAGTA GTTAAATAAA GATTAAGTTG AGATAAAGTG 4200 

TTATTCGTAA ATAAAAGAGA GTAGATCGAT AGGAATTGAA TGATATTAGT TAACTATTTA 4260 

TTAAATTACT TAATAATGAT TAATTTTTAG TTAAAGTAAG TTTAATGTGA AGCACGACCA 4320 

TTGCTCATTA TAATGA 4336 
(2) INFORMATION FOR SEQ ID NO: 547: 

£5 
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(A) LENGTH: 487 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY : linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 

10 TAAGCTATCT GAGATAATTG CTGATAACAT TAAACCGGCA ATTTCAGGTT TAATTTCAAA €0 

GCCACGTTCT CTAAACATTT TGTATAAAAT TGTAGCTGTA CAACCAACTG GTTCAGCACG 120 

ATAACATAAA GGACCAGCAG TTTCGAAATT TGCAATTCTG TGATGATCAA TTACATGCTT 180 

AATTGTAGCA GAGGCAATCG TATCAGAACT TTGTTGGaAT TCGTTATGAT CAACTAAGAT 240 

AACATCTTGA CCATCTAAAT CATCTGTTAA TAATTCCGGA GCAGGTACAT TAAATGTATC 300 

TAACGCGAAT TGAGTTTCTG CACTCACATC ACCTAAACGG TATGCTTTGG CTCCTGAATT 360 

ACCTCGAAGT TGTTCaAATT CTGCCaTAAT AATCGCAGAT OAAATTGCAT CAGTGJcCTGG 420 

aTTCTTATGT CCGAAAATAT ATGTTTTAGC CAnTGTCAAA TATCTCCCTT GTAAATTGTA 480 

TTCTTTA 487 
{2 J INFORMATION FOR SEQ ID NO: 54 8: 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



3S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 8: 

TTGGTGGTGC AGCAGTTATA GCAATTGTTT TAGCATTCAT TGGTAAGTTC ACTGCATTAA 60 

TTTCTTCTAT ACCTACmCCA GTTATGGGAG GAGTATCTAT ATTACTTTTC GGTATTATTG 120 

CAGCAAGTGG CTTAAGAATG TTAGTTGAAA GCAAAGTAGA TTTTGCGAAC AATCGAAATT 180 

TAGTTATAGC TTCTGTAATT TTAGTTGTAG GTATCGGTAA TTTAGTATTT AACTTAAAAG 240 

AAATTGGTAT CAACCTTCAA ATTGAGGGGA TGGCATTAGC TGCACTTTCA GGAATTATTT 300 

TGAACTTAAT CTTACCTAAA GAGAAAAAAC AAAACAATTA AGATTTACAA ATTAAGGAGG 360 

GCGCTTTTAT GAATCATTTA TTATCAATGG AACATTTATC TACAGATCAA ATATACAAAC 420 

50 TTATCCAAAA GGCAAGTCAA TTTAAATCTG GTGAACGTCA ACTACCAAAC TTTGAAGGGA 480 

AaTATGTCGC AAATTTATTC TTTGAAAATT CTACTCGrAC AAAATGTAGT TTTGAAATGG 540 

CAGAACTTAA GCTAGGGTTA AAAACGATTA GCTTTGAAAC ATCAACATCA TCTGTTTCAA 600 
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TCATTAGACA TCCGTTTAAT AACTACTATG AAAAATTAGC GAATATTAAC ATCCCAATTO 720 

CGAATGCTGG TGATGGTAGT GGACAACATC CAACACAAAG TTTACTTGAT TTAATGACGA 780 

TATATGAAGA ATATGGATAT TTTGAAGGCT TGAATGTATT GATTTGTGGA GAcATTaAAA 840 

ATTCACGTGT OGCACtAGTA ATTACCAnAG T 871 



(2) INFORMATION FOR SEQ ID NO: 54 9: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 400 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDEDNESS: double 
1S (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549: 
20 TCAATCTAAT ATATACTTCA TGACTTCCCO CCAAAGGCGC AATCGTAGGA TTAGTCTGTT 60 
TATCGATAAG ATCTATTAAT ATTGTTTCTA CTTTAGATTC ACCTATTCCC GCAAATCTTA 120 

ATAGTTCAGA ATGTATAATT CGATTATGGT TTATAAAATG TGACAACAAT TCATTTTTCA 180 

2S 

CCATTGGTTG CATTTCTTTC GGTGGACCTG GTAATAAAAT AATTTGTTTG TTTTCAAAAT 240 

TCACCATCAT TCCTGGAGCC ATGCCATGAT GATTTGTTAA TACAGTTGAA CCTTCAATTA 300 

CTAAAGCCTG TTGTcTATTA TTAGGTGTCA TTTCTtGTCC TTGTTCCTCA aAAwAGCTTT 360 

30 

CAATATATTG AAAGAAGGCT CATCAATAAC TAAATCTnTA 400 
(2) INFORMATION FOR SEQ ID NO: 550: 

35 (i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 
ATACCTCCAG CTAGAATACC AGCGTATTTC ATAAAATACT TCCTCCATTC AACTATATCT 60 

45 

ATATTTAATT ATTTAAATTT CGTTGCATTT TCCAATTGAA AACTCATTTT AAAATCAAAA 120 
CTCTAAATGT CTGTGTATTA CTTAAAATTA TACATATTTT GCTTATATTT TAGCATATTT 180 
TGTTTAAACC TATATTACAT TATATCAGAC GTTTTCATAC ACAAATAATA ACATACAAGC 240 

so 

AAACATTTCG TTTATTATTT ATATCACTTA ACTAATTAAT TTATAATTTT TTATTGTTTT 300 
TAAGTTATCA CTTAAAAATC GTTTGGCaAA TTCGTTGTGA CGCTTGTCCA TCTTCTAATG 360 
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10 



1S 



20 



30 



TTTGTTTTAA 


TviCATCAATG 


AGTGu~i\* i~a~X" . 


GATTTTCAAC 


AATTGGACCT 


GGCAACTCTT 




TTTTATAATC 


CATGTAAAAA 


CCTCTAAGCT 


v-A I Ajvrt- t-A 1 A 




IViAlAluLiiX 




AGAAAATTTo 


CGGACGCTTT 


AATACAC CGA 


Au 1 (JtiAAVJAi 


GACAGATGAG 


TAGTCGGTAA 




CTAACGCATC 


GCTGAi lAAG 


TATAAATCCG 


AAATGTCTTC 


ATAAtCTGAA 


ACGTCTTTCA 


OOU 


CAAAATCATC 


ATGTTCATCA 


ATACGTGTCA 


CAACTAAATA 


ATGCATGCGT 


AAkAAAATAA 


•ion 


CATArTCATC 


ATCCAGCGCT 


TGACGCAAAG 


CTTCTATATC 


AAAGTTAACA 


TTAAATTGAT 


780 


ATGAACCTTC 


TCGAATCGCT 


TCATCGTCAC 


GCGAAGTTGG 


CGCGTACATA ATCACTTTTT 


B40 


TATCTAATGG 


AATATTTAAT 


CTTGTCTTAA 


TACCATTAAT 


ATATTCAGTA 


TCATTGCGTT 


900 


TATGTGATAA 


TTTATCATTT 


CTTGGATAAC 


CTGTTTCCAA 


AATCTTATCT 


CGACTAACAT 


960 


GAAATGCATT 


TTGAAATATC 


GATGTCGAAT 


ATGGATTAGG 


TGACACTAGA 


TAATCCCACC 


1020 


GTTGGCTTTC 


TTTTTTAAAG 


CCATCTTGGT 


AATTTTGAGT 


ATTTGTTCCT 


AGCATTTTAA 


1080 


CQTTACTAAT 


ATCCAAACCA 


ATcrrrrrrA 


ATGGCGTGCC 


ATGCCATGTT 


TGTAAGTACG 


1140 


TCGTTCG CGG 


TGATTTATAT 


AACCAATCTG 


GTGTACGTGT 


GTTAATCATC 


CwCGCTTTCG 


1200 


CTCTTGGCAT 


CGCTAAAAAC 


CATTTCATTG 


AAAACTTTGT 


AACATATGGT 


ACATTGTGCT 


1260 


GTTGGAATAT 


GTGTTCATAT 


ccttitttca 


CACCCCATAT 


TAATTGGGCA TCGCTATGTT 


1320 


CAGTTAAGTA 


TTCATATAAT 


GCTTTGGGGT 


TGTCGCTGTA 


TTGTTTACCA 


TGAAAGCTTT 


1380 


CAAAATAAAT 


TAGATTCTTG 


TTTGGCAATT 


TTGATAGTAA 


TTTAAAAGTC 


GTATATATAC 


1440 


TATGTTCTAT 


caatttttta 


ATTGTATTTT 


TAATCATGTC 


GTACCTCCGA 


CGTGTTTTTG . 


1500 


TAATTATATT 


AATATGTATG 


AGC 








1523 



(2) INFORMATION FOR SEQ ID NO: 551: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4923 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551: 
CAGTAAGAGA TTTTCTTAAT TGAAAATAAT CTTACTGCTT TTTTAAATTT AATTTCGAGA 60 
60 TTCAATATTA GTTTATCTCA TTGTGGCATT AATTGATAAA ATTGTTTTAA TGTTATAAAT 120 
CAAAGTCtTC TTCAACAATT TCAATGTCTG CATCAGATCT ATGATATGTG AAAAAGCTAA 180 
TTCTTATGCG GTCTAAATGC TCTAAATGGT GTCGATACTC TTCGATGGCT GCAACAATTT 240 

ss 
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15 



20 



30 



35 



40 



45 



50 



ATGTGGACAA 

TATAACGTGC 

CTAAATCTAA 

AACGCGACAA 

AAAGTATTTT 

TATAGTAATC 

GTTTAATATT 

CTGAGGATAT 

CAAAOTTAAC 

CGGTAATATA 

AAGTAGCAAC 

CACCCATAAT 

CTATAATGTT 

GCATTGCACA 

AGAAGCGATA 

TCTCAAAAAT 

TGTTATACAC 

CATAATACTG 

ATAAAAACTC 

TCTACTATGT 

ATTTCATTTG 

TATGTTCAGT 

TTAACATTAA 

AATGTGTGAC 

TAGCTGTGAT 

AGCTGTCTAA 

CTTTCATAGA 

CTACTTGTTC 

CAATTTTACC 



TAAATCTTTT 

TTTTTTAGAA 

TTTGATTTGT 

TTTGTTTAAT 

CTTTCTATTC 

ATAAATTTTC 

ATTAAACTCC 

ATTTAATGAA 

TAGAGATGAA 

GAAAGAACCG 

ATGTGCTAAA 

GATAAAATTA 

ACCCAATGCT 

ACAGCACTGA 

GACATAGCTA 

AATGACGCCC 

TTACTAGTTA 

ATAGCGATGT 

ACATAGTGAC 

GAGCATTTGC 

ACTAAAAGCA 

TGTTAAGAAC 

TTTGAAAAAT 

TTTTTCATCT 

ATTATGTTTT 

TTTTTCATAA 

TAACGGGTTA 

CATAATATCT 

AAATGCAGTT 



TTCTGCGGAT 

AGGCTAGCTA 

AATAGGAAAT 

ACTTCATAAG 

TGTTGGTGAA 

TCTAACTTTT 

GTCGTATCAT 

TTATAGTAAA 

CTTATGACAC 

GTATGTTGTC 

TTAAATTTAA 

TCACTAAAAA 

TGATCAGAAA 

CACCAGCTAA 

AAATAATGGC 

CTTTTAAGTT 

CAAAATTCAA 

AAAACTTTAG 

ATATCAAGTT 

TTTATTTTAA 

TAGTCTGCAG 

CAAGCTTCAA 

TTACCGAACG 

GTAAAGTACA 

TCGATTAATT 

ACACCGTCTT 

CCAG C CATTG 

TGACGTCCAC 

AAATCAGGGA 



TATCTAACTC 
AAATTTGCTC 
TGTTATTAAG 
CTTGTCTAGT 
TATGTTTTTT 
CGACACGTTG 
TTAATACTAA 
TTTTTGTTTC 
CAATCATTAC 
CCATAATGAT 
ATAAAATAGC 
TTGTTACCAT 
CTGTTTTAAA 
GGCTTTCAGA 
TATACCAGTT 
TATTGAATAT 
GCTTATTTAT 
TCAGAGATTA 
AAACGTTAAT 
TTCAATTAAA 
CTTTTAAAGT 
ACTTAGAAGG 
CTTCGCCGTC 
ATGTTAAAGA 
TAAGTAAACC 
GTTCTAGTAC 
TACCAGCTTG 
CATAGCCTCC 
TAACACCTAA 



ATAATCAACA 

GTGAAAAGCA 

TAAATATAAG 

CGTCTGAACT 

TGTAATAGGA 

TTTTAAATTA 

TTTAAACCAC 

AAATTTAGGT 

AAGTACAAAC 

TAATGCTGTT 

AATAAGTACT 

TGTAACAGAT 

TGAACGATAA 

CCAACATCAT 

TTAAAAATCC 

CTAATATTCG 
AGTTGTTAAA 

AAATAGTATA 
AGTTAACGAT 
AATATACTTC 
TTGTTTAATA 
TGCTAAATTG 
AGAATGTTCA 
TCCATAAATA 
TTCTTCTAGT 
TTCGAGTAAT 
ATATGCAGGT 
AATTGGTAAA 
TAAATCTTGA 



TGTGTCACAT 

GTTAATGAAT 

TCGTTTTGAT 

ACTTCTCTGA 

CGTTCTTCGC 

TGACTATCTT 

ATAAAAATAT 

GGTAGAAACA 

CTGTAAAAGG 

ACACTCGCCA 

ATGACGGTAA 

AGTATGGCAC 

ATACTAGGTT 

CCGGTAGTAA 

GAGCTCCTAG 

TATTCATTAC 

ATAAATCATA 

AATTTGTAAA 

ATAAAATGAA 

CTTAAAAGTT 

TCTTCTTCTG 

ATACCTTGAT 

ACTTGATCAT 

CGATTAATTG 

TGTTGGCCTA 

GCAATACCTG 

CCTAGAGGTG 

CCACCGCCAA 

GCGGCACCGT 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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AAATTTCATT AACCTCTTCT AAAAATCCAG GTTGAGGCAT TACCATTCCA AAGTTACCAA 2160 

CAATTGGTTC TACTAATACT GCGGCAATTT CATCACCCCA AAATTCAATT GCTTCTTTAT 2220 

AGGCGTTAAT ATCATTGAAA GGTACAGTAA TGACTTCACG TGCGACGCTT TCTGGAACAC 2280 

CAGCTGAOTC TGOAGAACCG AGCTGAGATG GGCCGCTACC TGCTGCAACC AATACTAAAT 2340 

CAGAATGGCC ATGATAAGAT CCAGCAAATT TTATAATTTT ATTTCTTTTA OTATATGCAC 2400 

GTGCAACACG AATTGTTGTC ATGACTGCTT CTGTTCCAGA ATTTACAAAG CGAATTTTCT 2460 

CAAGAGATGG AATTGCATCA CGTAATTTTT TGCTGAATTC AATTTCTAAT TCAGTCGGTG 2520 

1S TACCAAATAA AACACCTTTA GCAGCTTGTT CTTGAATTGC TTTAGTAATA TGAGGATGTG 2580 

CATGCCCCGT AATAATTGGA CCGTATGCTT GAAGGTAATC AATAAATTTA TTGCCATCGA 2640 

CATCATATAA ATATGCACCG TGTCCTTCTT TCATAACAAC AGGTGCACCG CCTCCTACAG 2700 

20 CTTTATAAGA ACGAGAAGGG GAATTOACAC CGCCTAGAAT ATATTCGTTT GAAAGTTGTT 2760 

GTAAACGTTC ACTTTCACTA AAATTCATTT ATATCAACCT CTTTTAATTT AATATTTTCA 2820 

TCTAATATCG TATCATAAAA TTATTATAAT GAAGAAAAAG GTGATTATAT GTTGCAAAAA 2680 

GGAOAACAAT TTCCAATATT TAAATTAGAA AATCAAGACG GAACTGTCAT TACAAATGAT 2940 

ACATTAAAAG GTAAAAAGGC GATTATATAT TTTTATCCTA GAGATAATAC ACCTACTTGT 3000 

ACCACAGAAG CTTGTGACTT TAGAGACAAT TTAGAAATGT TCAATGATTT AGATGTTGCA 3060 

GTATATGGTA TAAGCGGTGA TTCAAAGAAA AAACACCAAA ATTTTATTGA GAAACACGGA 3120 

TTGAATTTCG ATTTATTAGT AGATGAAGAT TTTAAATTAG CTAAAGAAAt GGCGTATATC 3180 

55 AGTTAAAAAA ATCATTTGGC AAAGAAAGTA TGGGCATTGT AAGAACGACT TTTATAATAG 324 0 

ATGAACAAGG TAAAGTATTA GATGTTATCG AGAAGGTTAA GGTTAAAACA CAAATAGAAG 3300 

AACTTAAAAA CATTTTGGGG TGACATATAT GAAAGTTGTT GGGTTAAATC GTATGCGTGA 3360 

40 AGTTGAAACT GAATTACAAC AACGCTTTTC AGATTTAGAT TTTAAATTTT ATAAAAAAGC 3420 

ATCAGAAATA CCTGAGAGCG ACTTGGCTGA TTTAGATATA TTAGTTGGTT ATGATGGCGG 3480 

TATCAATGAG GCATTTTTAC GACGTTGCCC GAATTTAAAA TGGATTGCAT GGTTTGCAAC 3540 

GGGTGTAAAT ACATTGCCGT TAGATTATAT TGCAGATCAC GGCATACTTT TAACTAATGG 3600 

AAAAGGTGTT CAAGCTAAAC AATTATCTGA ATACATTTTA GCTTTCATTT TAGATGATTA 3660 

TAAAAAGATG AAACTATCAT ATGATAACCA ACGACAACAT ATATATGATT CGAAAATAAC 3720 

TGGTAAACGC CTATCAGGAC AAACAGTTTT ATTTTTAGGT ACAGGTGCAA TTGCTACTAG 3780 

AACTGCGAAG TTAGCAAAGG CTTTTAATAT GAATTTAATT GGTCTGAGCA AGTCAGGTCA 3 840 

55 



2S 



30 
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TGCTGACATT ATTATAAATG CTTTACCAOA AACGCAAGAA ACGATTCaTT TaCTAAAGAA 3960 

AAAACATTTT GAATTAATGA AAGATGAAGC ACTTTTTATA AATATAGGAC OAGGTAGCAT 4020 

AGTTAAAGAA GCGCTCTTAA TAGAAGTATT AAAAAGTAAA GTTATTCGAC ATGCATATTT 4080 

AGATGTGTTT GAAAATGAAC CTTTGAAACC TAATCATGAA TTATATGAAT TGGATAATGT 4140 

AACTATAACA GCGCATATAA CTGGTAATGA TTATGAAGCA AAGTATGACT TATTAGATAT 4200 

TTTTAAAAAC AATCTAGTTA ATTTTCTCAA TAAGAATGGT CTAATTGAGA ATGAAGTTGa 4260 

- TGCTAAAAAA GGCTATTAAA TGArATCATC ATGTAAATAT TGACACGCGC GCAATACTAC 4320 

1S AGTTATATTT aTAGTAAgTt AATaATgATT ATATAAGAaA GATGGTgATA TAGATGAGTG 4380 

TTGAAATAGA ATCAATTGAA CATGAACTAG AAGAATCAAT TGCATCATTG CGACAAGcAG 4440 

GCGTAAGAAT TACACCTCAA AGACAAGCAA TATTACGTTa TTTaATTTCT TCACATACTs 4500 

20 ATCCAACA3C TGaTGaAATT TAT CAAGCAC TTTCACCTGa TTTTCCAAAT ATAAGTGTTG 4560 

CGACAATATA TAATAACTTA AGAGTGTTTA AAGATATTGG AATTGTAAAA GAATTAACAT 4620 

ATGGAGACTC ATCAAGTCGA TTCGACTTTa ATACACATAA TCATTATCAT ATTATATGTG 4680 

25 AACAATGTGG TAAGATTGTT GATTTTCAAT ATCCACAGTT AAATGAAATT GAAAGATTAG 4740 

CTCAGCATAT GACTGACTTT GACGTAACAC AT CATCGAAT GGAAATTTAT GGAGTTTGTA 4 800 

AAGAATGCCA AGATAAATAA TTTAACTTTG GTAGTATGAC AAATTAAAAA AGCGTTACTW 4 860 

30 

ACTTCATATA AGTAAGCGTA ATATTTAAGA nGTTAAACGA CATGaAAGTt GTTTAACTTT 4 920 

TTT . , ... 4923 
35 (2) INFORMATION FOR SEQ ID NO: 552: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 917 base pairs 

(B) TYPE: nucleic acid 

<C) STRAND EDNESS : double 
40 (D) TOPOLOGY: linear 



45 



50 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 

TCCGGCTTTA AAAACTTTTC CCAATTCCAG CTTGGGCCTT TGGCATTAAT ATTAACCTCC 60 

TGGTTCGGAT TAATTGGGAA CCTAACCnTT TAGGCAATAA TTGGTTTAGG CAACTTCCAA 120 

ATTGGTGGTT CAACCAACGT CTTTGGATAC CyTGcTCATT TAATTCTAAA ATGgTyrGAA 180 

CGCATTTTGG TACCCAAAAt GgTGACGTTC GTTTGCACGG TCTAATAAAT TGTCTAAGTT 24 0 

GTCGATTGGT TTCATTAATT GAa CACCATT TTGCTCTTCA GCAAGACCTT TGTCTACTTT 3 00 
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GCGATCCATT GTTTGTTCGA AAAGAATAGC ACCCAAAATT TTATCTGGTG AGAATGAAGG 420 

TGAAGTTACC ACACGTGTAC GCATATCGTG AACAAGTTGG AACATTTCGT CTTCATTGCT 480 

ATATTGATCT TCGTTTACAC CATATTCTTT AAGTGCTTTT GGTGTACTAC CACCACTTTG 540 

GTCTAATGCG GCAATAAAGC CTTTTCCATT TTTCATTTTT TCTAATTGCT CTTTATTCAT 600 

ACTTTCCACT CCTTAACTTT TCAATACACC TCCAGTATGA TAAAAATGAG AACATTTCTC 660 

AAGTCATAAA CCTTGAAAAG TGTATAAAAT GTGAAAAATA ATTGTCAGTT TAATTAAAAA 720 

TATTATTTTA TTCTAGGTAT GACTAACGCC ATTAATGACA TAAAGAAAAT ATGTGTAATA 780 

r5 ATCCAACCGA TTAATTCTGT CACACTAAAT TGAAAAATTG GACGTTGCGC AATAAATACT 840 

AAAAAGGGAT ACAATGCTAT AAATAAGAAA AATAAAGGGA TATAACATAG ATAGTAAAGC 900 

CTTTTAGAAG TATGAAA 917 
20 (2) INFORMATION FOR SEQ ID NO: 553: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 432 base pairs 

(B) TYPE: nucleic acid 
2S (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 553: 

ACTCCCATAT CCACTGGCAA GATTTATAAT TTGTAATTCT AAAGTATAAT AGCCTATATT 60 

TTTCAAAATC TCTTTTTTTA TGATTAACAT CCCTCTAAGT GCACCTAAAG GTTTTTATGT 120 

55 AATGTATTAA ATATTTCCAT TTATACATAT ACATCTTGTA TAAAAAGAAA. GAACTCCATA 180 

TATACTCAAA AGGTATACTG AAGCTCTATA TATTATATAA AATTATTATA CTATTTTGAA 24 0 

AAACATAAGT AAATCACTAA ACATGATTTC TTTCACTTTT AGAAAACTTT TAATACTATA 300 

40 AAAGCACCCA CTCAGTCACT AGTTTGGGCA GTTATTGTAT GCCTATTGaA CTCAATGCgT 360 

ATATTACAAT ACCTTTTtCG CATATTCATA TAAGacTTTG CATCTTTAAG CTTAATTGCT 420 

ATCTCTTTCT CT 432 

4$ 

(2) INFORMATION FOR SEQ ID NO: 554: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1374 base pairs 
(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 



1492 



EP0 786 519 A2 





TTTTGCTTTA 


TGTTAAATGG ATTATTaTTA GAATAAAAAA TCGGTGATGA GCTAAAAAAG 


60 




TGTGTAGGAT 


GTTTTCCkAA 


CCCAATTTTT 


ACATCCGAAG 


AT AT CG AACA 


ATATCTTCCT 


120 


5 


ACTTCTACAT 


TATTAAAATC 


ACTACCAAAT 


CCAATATAAC 


TGTATTCACC 


AATGTGAGAA 


180 




TTCCTGATTT 


TACACCATCT 


ATCTATATAG 


TTATTGCCAT 


CAAATTTTGA 


GTTTGTAATA 


240 


10 


TACGCCAAGC 


GATGAATCTT 


AACATTCGAT 


TCTTTAGAGG 


ACTGGTTTTT 


CAGCAAACCA 


300 


ATTATCTTTT 


CAATCGCTAT 


CCTCATCGTC 


ATTTCCTCCA 


AGTATTTCGA 


TTGATAATAT 


360 




CTTTATAGCT 


TTGAATAATT 


TTAACTACCT 


TTGTCGAAAC 


GTTAGTGTCT 


TTATAATCAA 


420 


15 


TAGCATCAAT 


CATCGGTTCG 


TTATTGTTTT 


GCATCTCTCT 


TGCTAGTTCA 


ACGGATTGGA 


480 




TTAGATTGTT 


ATAGGTAATA 


CCACCTACAA 


TAACOGTACC 


TTTATCTAGT 


ACTTCCGGTC 


540 




TTTCTGTGQA 


AGTTCGAATA 


AGGACACCAG 


GGAACTTCAA 


AATAGACGAC 


TCTTCTGACA 


600 


20 


ATGTTCCACT 


ATCTGATAGC 


ACAACAAATG 


CATCTTTTTG 


CAATGCATTA 


TAATCAAAGA 


660 




AACCAAATGG 


CTTTAACTGT 


TTAACTAATG 


GATCAAATTC 


AAATTTACTT 


TCTTCAATTT 


720 




TCTTCCAACT 


TCTTGGATGC 


GTTGAATAAA 


TCACAGGCAT 


TTTATACTTT 


TTGGCAATAT 


780 


25 


CATTTATCGC 


ATTCATTAAT 


GATTTAAAAT 


TCTTTTCATT 


ATCGATATTC 


TCTTCTCTAT 


840 




GCGCAGaTAC 


TAAAATGTAT 


TGTTGCGGTT 


CTAATCCTAG 


TTTATTTAAA 


ACGTCACTGT 


900 


30 


GATTAATTTT 


ATCTCGATGC 


GCTTCTATCA 


CTTCTGTCAT CGGTGATCCT GTyACAAAGA 


960 


TATTCGCTTT 


ATTGAAGCCT 


TCATCTAATA 


AATAACGTCT 


GCTATGTTCC 


GTATAAGGTA 


1020 




GATTCACATC 


ACTGACATGG 


TCAACAATTT 


TACGATTGAT 


TTCTTCAGGT 


ACATT CTG AT 


1080 


35 


CAAAGCATCT 


ATTACCCGCT 


TCCATGTGGA 


ACACAGGAAT 


CTTTAATCGT 


TTAGCAGATA 


1140 




CTGCTGCTAA 


ACAACTATTT 


GTATCACCAA 


GAATTAAAAG 


TGCATCTGGT 


TGTTCGCGTA 


1200 




ATAAAACATC 


ATATGTCTTC 


GCAATAATAT 


TCcccAtcxrr 


TTCTCCAnGt 


TACTTCCAAC 


1260 


40 


TGCCTCTAAG 


TAGTGGTCCG 


GTTGTCTTAA 


TTCCAAATCA 


TCAAAGAAAA 


TTTGATTCAA 


1320 




TGTATAATCA 


TAATTTTGAC 


CAGTGTGTAC 


TAATATCTGA 


TTAnAAATAT 


TGAT 


1374 



(2) INFORMATION FOR SEQ ID NO: 555: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555: 
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X X V_MWV7V? Ill 


GCTTATGATA ACGTTATCGA 


W\X XV>X lJ"\r\f\ 


X * u 




A A X JL A A X 


TGAAAAAATA 


f\- X X X X Wl 


GATAGACGCT 


ACTTGGaTTT 


WO X VarCIU XXX VJ 


180 


5 


TAAtAACGCA 


G t ATAA tTAC 


CaTTAPTAAA 

V_0 X X X #*WU% 


tTGaGTTAAT 


CGTTCaTATA 


IV 2l(W IV PT 2V2V T 
nnu X /tv inn x 






AGGATAATAA 


ATGTTATCCG 




CGTATAGTTC 


ACTTGATACG 


XXX V^VJV^V- XXV* 




10 


AJVr'AATTfjrT 

/WVw^W% X 1 W V X 




Vll 1/W\1 J"lX X 


AGTCATCATA 


ACCTCAGAAG 


2V TTTA IV C* IV IV IV 
nl l\>iviwvv\ 


■JCA 


iunnnnAlun 


1AU 1 1 1 vAIa 




TTGATGTTCA 


TATGTTGAAT 




*1 "5 A 




CGCTTTTTCA 


AAACTATAAG 


CTGC7l*GCATA 


AATATCATCT 


TTAGCTAATG 


AATGTGTACA 




15 




TTAAAATACT 


TTGCCGCTTC 


GTAACTTAAA 


TATAACGAGA 


CATATCTACC 


540 




TTGTCGTKGt 


GCGCTTGTGC 


AAAGTGTATC 


ACTTCTCCCA 


CATCAGCCAA 


CTTAGTAGCA 


€00 




ACATACTTCT 


TrTATAAATCC 


CTTTAATTGA 


ATATGGTATT 


GCTTTATATTC 


ATTTTCAGTT 


660 


20 


AAATAGTAGC 


GATAATTATA 


TTCTATTCTC 


ACAGTAATCA 


CCTACCTTCG 


ATAAAAATAA 


720 




TTCAACTTGT 


CGATAACCGT 


ATTCACTCAA 


AATAGATTCA 


GGATGATATT 


GCACACCAAA 


780 




AACCGGAAAT 


CTAATATGCT 


CAAATGCCAT 


AATAATCGCT 


TCATCGTTTT 


TTGCTGTAAT 


840 


25 


CTTTAAGCAA 


TTTGGAAAAG 


TCGCTCCGTC 


AGCAATTAAT 


GAATGATAAC 


GCATTACATT 


900 




GAAATTTTGA 


GGCAGTCCTT 


GAAAAATACC 


TTCATTGGTA 


TGGCGTAACT 


GTGTAGTATG 


960 


30 


TCCGTGTACA 


GGATGATAGC 


CGTGAATGAT 


ATTTCCACCA 


AAATAAGACA 


CGATACATTG 


1020 


AAATCCTAAA 


CATACACCTA 


GTATAGGTAC 


ACGCTGATAA 


AATTGTTCTA 


ACACTTCATT 


1080 




CAAGATAGGA 


TAATCATCCG 


GaTTACCCGG 


CCCAGGCGAA 


ATAACAATTG 


c~i"J.~±~xGGCTT 


1140 


35 


CATATTAATG 


ACGTCTTCTA 


TCAGCAGATT 


ATGAATACCA 


ACAACTTGAA 


CTGTTAGTTT 


1200 




CGTTTGAGTC 


TTAATATAGT 


CTATTAAATT 


ATATGTAAAT 


GaATCATTAT 


TATCTATGAC 


1260 




TAGAATCATT 


GTATACTCCG 


TTCTAAATGT 


GTTTTATTTT 


TATAATATCT 


ATTGGATGTA 


1320 


40 


GCTAAAACTT 


TAAAAGCATT 


GTCATTATCC 


TGACCTTGAT 


TTTAACTAAT 


ATATGGTATA 


1380 




TTCTATTCAT 


CGTACATAAA 


TGAATATCAG 


AGGTTCCTAG 


CTGAAACCCT 


CTATAAAAAA 


1440 




CTAGGCCATT 


GAAATTTCAA 


ACATTCGTTG 


GG 






1472 



(2) INFORMATION FOR SEQ ID NO: 556: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 
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is 



AGAACAGCAA GGATTACTTA CTGAGGAATT AAAOAAGGAT ATTTTAAAAC AGAACAAATT 60 

ACAACGTGTT GAAGACCTAT ATAGGCCTTT TAAACAAAAG AAAAAGACAA GGGCAACTGA 120 

GGCGAAACGT AAAGGGTTAG AGCCATTAGC GATATGGATG AAGGCACGTA AACATGAAGT 180 

CTCAATTGAA GAAAAAGCAC AACAATTTAT AAATGAAGAA GTGCAATCGG TTGAAGATGC 240 

TATCAAAGGT GCACAAGATA TTATTGCGGA AGAAATTTGA GATAATCCTA AATATAGAAC 300 

AAAAATTTTA AAAGATATGT ATCATCAAGG TGTGTTAACT AGATCTAAAA AGAAAAATGC 360 

TGAAGATGAA AAAGGTATTT TTGAAATGTA CTATGCATAT AGTGAGCCAA TTAAACGCAT 4 20 

TGCTAATCAT AGAGTTTTAG CTGTTAATCG TGGTGAAAAA GAGAAAGTAT TATCTGTAAA 480 

GTTTGAATTC GATACGACAT CAGTAGAGGA TTTCATTGCA CGTCAAGAAA TCAATCATAA 540 

TAATGTAAAT CGCAGTTATA TTTTAGAGGC GATTAAAGAT AGCTTGAAAC GCTTAATTGT 600 

20 CCCTTCGATA GAGCGTGAAA TCCATG CTGA TTTAACTGAA AAAGCTGAAA ATCATGCAAT 660 

AGATGTTTTT AGTGAAAACT TAAGAAATCT ATTACTGCAA CCTCCAATGA AAGGTAAACA 720 

AATATTAGGC GTAGATC CAG CATTTAGAAC AGGTTGTAAA TTAGCAGTCA TTAACCCATT 780 

CGGTACTTTT ATAGCAAAAG GTGTGATTTA TCCGCATCCA CCAGTTTCTA AAAAAGAGGC 840 

AGCAGAGAAG GATTTTGTAC AAATGGTTAA AG CGTATGAT GTGCAATTAA TTGCAATTGG 900 

CAATGGTACT GCAAGTCGTG AAACAGAACA ATTTGTTGCA GATTTAATTA AAAAGCATCA 960 

GTTGCCAGTA CAATTCATCA TTGTCAATGA AG CGGGCGCT TCAGTATACT CAG CAT CAG A 1020 

AATTGGTAGA GATGAATTTC CTGATTTTCA AGTG 1054 
3S (2) INFORMATION FOR SEQ ID NO: 557: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1057 base pairs 

(B) TYPE: nucleic acid 

<C> STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 



25 



30 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 557: 

AAATGTCAGA ATACAAGAAA AAAATAATTG AATTAATTGA AAGTAATTTA ACAGGATATG 60 

AAATTTCTAA AAAAACTGGA GTTTCTCAAT ACGTACTTTC ACAATTAAGA CAGGGCAAAC 120 

GCGAAGTAGA TAATCTAACC CTGAATACAA CAGAAAAATT ATATGAATAT GCCAATAAAG 180 

TTTTGTAATT TAACTAATGT ATAAATTAAT CAAGCTATGT TTATTTGATT TAACTATTAA 240 

TAAAAATCAT ATGGTGAATG GATATTATAA TAATTAAAAT ACAAAAATAG TAGATTCCAA 300 
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AAGGGAAAAT AAGTGTTAAG TTTTAAATGA TAAAAAAGAT TGGAATGGAT CGTCTTGAAA 420 

TQCTCCCTTC AAAOTTTTCA TTTTTTCAAT GTCGACTTCG AAGGGGGCAT TTTCATTAAA 480 

TTGTTATAGC TTTTTATATT TGTATAATGA ACATATAAGT TTAAGAAGGT GCGAGTGAAG 540 

GAAATAAAAA AGCTCAAATG TACCAAATTG TTAATCTTAA TAAATCTCTA CTTTATAAAG 600 

ATTGAATGGA CATTCGAGCG TTAATCAGTC AGGAGGGACT TTCCCTCCTA CAATTTAATA 660 

ATAATACTTG CTTCACCACT ATACAAGOAG TGAGTTGTTA TGTTCAAAGT GAATTATTOG 720 

ATTTTAAGTT ATTATCCAGA ATATAATATC GCAGTAAGTT GGCAACGTTT AAGAGAAGGA 780 

1S AAAACAATAA AAAACAAGAT TTAATACTGC TGCGTCATGA GGCGCTTGAA CATTATTTGA 840 

TGAATAAGTA TAATTTCAAC TATGATTATG CACATAAAAT TGTATCAAAA AAATACGATT 900 

ATTCAATTTT TATAAAAaAG AAGGTGGATT AAATGCTTAC ATTAATAAAA TTGGAAAGAA 960 

20 GATGaACAGG tTATAATATA TGraTATATa CCTGaAGATG aTATAAGTAC CGGGTAAAGG 1020 

GTCCCGTACC TTTTTAATTA AAAAAGTTCC AGGGGGT 1057 
(2) INFORMATION FOR SEQ ID NO: 558: 



25 



30 



• (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558: 

35 CTGTGCTGTA TTTACtTTAA TTTGACGACA TTTGAGAAGT ATTATGATGG nTGTAGTTAT 60 

TTTATGAAAG TAATGTATTA ACAATCGAAT TACTAAGTCT ATATTGGTAG GAATATCGAT 120 

GTTTAGTTTA AATGGAATGC ACTATTTAAG TTTTTAATAT GGAGATGTTT GTGACTTTTC 180 

40 GATGATT AAG ATTTTTATAG GTGTGCATCA TTTCAAATAA ACTTTGTGTT TAAAATTGAG 240 

CTTAGGAAAT CGATAGGTTT AGATGAGGAT ATTGTTGAAG TTATGTGTCT TGTATCCTTA 300 

G TT GT TATAA AAGCGCAAAA AATAGCACCG CTTTCTCTTT ATCTGTGTAG AAAGGATGCT 360 

45 

ATTATTGTAA AACAATAGTT TTAATTTnAT TTTCTGATaT ATCATATGTr ATTcTACCTG 420 

TATCAATTTC TATCGAATTA TAACCATCAA AATTATCAAC TTTATCATCA AAGTCTATCA 480 

CTTTTCCAAT TAATATTTTA TTATTAGTAA GCGTTAATTT GACTAATTTG CCTATGTATG 540 

50 

ACTGTAAATT CATATTTAAT CACTCCTTTT TAATATACGG AACTACATGA AAACCAGTTT 600 

TAGAATAATG AACCTTACCT AATTTCGTTT CAATATATTT ACCATTCACA TAAGATTTAC 660 
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AAACTGGAAT 


ATCAATAAAT 


TCTTTTTTAG 


TCATCTTTTC 


ACTTTCATTA 


GAATCTATTA 


780 




TAGTATAACT 


TGGTAATAAA 


GATGAATTAT 


TTTTCTTATT 


ATTATATTCA 


TTTGTGCATA 


840 


5 


AAATATGACG 


CTTTTGCTTA 


ACATTATTCA 


ATTTCATCGT 


CATTTTACCA 


TTACTTATCA 


900 




TTTCAAGCAT 


TTCTTTTTTA 


GCTTTTTGTA 


GAAAGGATGC 


TATTTTGTAA 


TTATTTTAAT 


960 


10 


AATCTAATAC 


TTTTTATCTC 


AGTTTCATCA 


AACGAATAAA 


CAGCAAAATC 


AGTCTGTATA 


1020 


TCTAGAACAA 


GATTTCCTGT 


TTCGCTCTCG 


AATTCATTTT 


CATAGTCAGT 


TACAAACCCT 


1080 




TCGTATTCTT 


CATTATTCAA 


AAGCGTGATA 


ATAACATCTT 


TACGATATGC 


ATCTnCAATT 


1140 


1S 


CTCAATACTT 


TTCACCTACT 


TATCAATATA AGGTACTATA TGaGcACCTG 


GTCTTCGAAT 


1200 




AATGGCACTT 


CCCTTTCTGG 


TTTCAATATA 


CACATTTTCG 


ATATGTATTT 


TTCCAATAAT 


1260 




TTGATTAAAA 


TTAATAATCT 


CTTTCAAATC 


AAATCGCTCA 


TCACTTAATA 


TTAGATTGCC 


1320 


20 


TGTTQACATT 


TTTTCTCTTA 


ACAATTCATT 


CAATAAATCT 


ATAGAAAGTA 


TTGTATAGCT 


1380 




AGGCAATTTC 


TTATTATTTA 


AAATGGCTCT 


TTTTTTATTT 


TCATTATATA 


GGTGATGACC 


1440 




TAACATATAT 


CTATTTTATT 


TTCCGCAATT 


TATTTCTATT 


TTTATTTTAC 


CATTTTTAAT 


1500 


25 


CATTTCTTTC 


ATCTGGTTTT 


TAGCTTTTtC 


CTGtAATTAT 


GCTTCTTTTA 


CTTCTACTTG 


1560 




ATATTTACCt 


TCACGCTCTT 


TAAAGAACTT 


GTCCCGCCAA 


TTGCCAACAT 


GTGGCACTGT 


1620 


30 


GGTACTTCTA 


CACCAAGGAT 


GCATAGGTGG 


CGCATTCACA 


CCTGGTATCA 


TATCTTTAAC 


1680 


TTTAAATATT 


TTTCOGTTAA 


GTGAATGACA 


TAATTTAGAT 


GTTTTACTAT 


CTATTTTGGC 


1740 




AACATATTTA 


TATTOGCCAT 


CTTCACCAAG 


TTCTTTTAAA 


TATGTTAACT 


TTTGTGCTTC 


1800 


3S 


TGCATTTTCA GTAAATAGTT 


AAAAAAGCGT 


ATAAAAATAG 


CACCACTTTC 


TCTTTAkCTG 


1860 




TCTAAAAAGG 


ATGCTATTTA 


TCTTTTGAAT 


TTGAATTCTT 


TTTCGCTTTT 


TCTATACTTT 


1920 




CAAATTCTTC 


AACTAATTCT 


TTAAAAGATT 


CACTCAATTC 


TTTTGCAGTT 


ACATTTCCAT 


19B0 


40 


CTAATTGTGA 


ATCTAACATA 


ATTAAAATCA 


TCTCACTTTA 


TATTTAATCa 


TATTTATACT 


2040 




ATAAAGTTTT 


TTCAATAATT 


TTTCAATATG 


GCTATCATTA 


ATGATATTAA 


TATGCGTAAA 


2100 




ATATTTAGCA 


CAAAATTTAC 


TCACTATTTT 


ACCATGAAAC 


CTATTTGACT 


TGGTAATAAA 


2160 


45 


TTTTACTTGT 


CCCTTATTAG 


TAACGATTGT 


CATTGATTTT 


ATTGATGGAT 


GCTTAAAAAA 


2220 




TGTAAATAAA 


TCATATTCTG 


AAAATCCTGA 


CTGTCCAGGA 


TGGTTATGTA 


ACATAACAAT 


2280 


SO 


TGAATTCGGT 


TTACTGTTAA 


ATAATAATTC 


GGTTGCTTGT 


TACCCTGGCA 


CAAAAGATAC 


2340 


ACTATCTTGA 


TTGACATATA 


CTTTTGTAAA 


TTTACCATCT 


TTTAACAAAT 


AAGCTACTTC 


2400 




ATTGCTATCA 


TTGTTTTCCA 


TGAGATATAC 


CTCCGTTTAT 


AG t CTGTCG A 


CTGATATTCC 


2460 
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s 



15 



25 



35 



GAAGGTTAAA 


ATTTTTAAAG 


TAAACTTTCT 


TTATCCCCAT 


GCTACGAGTT 


CAGATTCAGG 


2580 


AAATAGCCCT 


TTACTAGTAT 


TTATGTATAT 


TCTGTCTATG 


GCATGAATAA 


AATAATTATC 


2640 


TCTTGTATTT 


tTTTCTAAAC 


TAGATTTTTC 


AGCATTGATA ACTTCAAGAC 


TATCTATATC 


2700 


CATTTGAATA 


ATACCAGGCT 


TAATATTTTC 


ATCATTATTA GGAAAATATT 


TATATGTAAC 


2760 


ACTTTTATCA 


TTAATTTCTT 


TTATTTTTAA 


TATTAGCAAT 


CATTTCCACC 


TCTAATTAAT 


2820 


TAAAATACTA 


TAATTATATT 


TTATTTCTGT 


AAGTTTATGT 


GCCTCTATAT 


AGTGTAAATT 


2880 


ATACTTATTC 


ATTAGATAGT 


GTTCAAGAGC 


TTCATGTTTC 


TACATTATTA 


TATCCATTTT 


2940 


TTTAATATTT 


TTCCCTTCTC 


TTAAACGTTG 


CCAACTTTGA 


GCCATATAAA 


AGTCAGGATC 


3000 


AAATTGTTTA 


AATCCACTTT 


CTAATAAATA 


CTTATTTTCA 


AATATATGTT 


CATAAACTCT 


3060 


TTGAATTAAA 


TTTTTATTTA 


TATTAgTATT 


TTTAGCAATT 


TTAGAAATCT 


CTATCTGTTT 


3120 


ATCTCGATTT 


CTAACTGAAT 


TATAATAAAT 


TTGAGCATGT 


CTGTTCCTTT 


TGATACCGTA 


3X80 


TTCATCACTT 


TTATTATTAA 


GTGCACCTGA 


TTCAATAAAA 


CAACCTTCTA 


CTTGATATTT 


3240 


ACCTTCACGC 


TCTTTAAAGA 


ACTTGTCTCG 






CTGTGGTACT 


3300 


TCTACACCAA 


GGATGCATAG 


GTGGCGCATT 


CACACCTGGT 


ATCATATCTT 


TAACTTTAAA 


3360 


TATTTTTCCG 


TTGAGTGAAT 


GACATAATTT 


AGATGTTTTA 


CTATCTATTT 


TGGCAACATA 


3420 


TTT ATATT CG 


CCATCTTCCA 


CCAAGTTCyT 


TTAAATATGT 


TAACTTTTGT 


GACTCtTCyT 


3480 


TTTCAACGAA 


TAATGAAAAA 


AGCATATAAA 


AATAGCATCG 


CTTTCTCTTT 


ATCTGTGTAG 


3540 


AAAGGATGCT 


TTAATACCaT 


GCTATTTTAT 


AATTTTcGGG 


AAATTCTTGC 


TTCTCGATAA 


3600 


AGTCTCTTAC 


TACAGAAAAA 


GACTTATTAC . 


GATATAACAT 


AAAATATTCT 


TCATTTTCTA 


3660 


TTTCAGATAA 


TGAGAAATCT 


ATAACCTCTG 


CATCTTTTTT 


ATTAAAAGTT 


ACTGAACCTT 


3720 


TACCGTTACT 


TATATCATCT 


TCAGGTATAT 


ATTC 






3754 



40 (2) INFORMATION FOR SEQ ID NO: 559: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559: 

60 

ATTTAACTAA AACTATAaAT AATCAAATGA TATTGGAAGA TATTaGCATA GATATCGAAA 60 
AAGGTAAATT GACTTCTTTA ATTGGACCTA ATGGTGCGGG TAAGAGTACT TTACTTTCAG 120 
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CTGATTATAA 


AAATAATGAC 


TTGTCGAAAA AAATATCTAT ATTAAAACAA ACAAACCATA 


240 




CTGAAATGAA 


TATTACGGTA 


GAGCAGTTOG 


X A\^%#^V-» 1 1 i WSJ 




TATTfTAAAG 


300 


5 


GTCGTTTGAC 


GAAAGAGGAT 


CATGATATTG 


TCAA1Y3ATGC 


vjv» X rtvxrt X X X Vj 


x x wVJiinU ±.tw~ 


360 




AAGATATCAG 


AAATCGTAAT 


ATT A AHTC AT 


X/\ 1 UlVJUi 


ACAAOGTCAG 


CGTGCATACA 


420 


10 


TTGCAATGAC 


AATAGCACAA 


un 1 At- 1 VsAA 1 


ATATTTTGCT 


AGATGAACCA 


TTAAATAATT 


4S0 


TAGATATGAA 


GCATGCTGTT 


UAAA 1 I A i vj 


AAAvJvj x I AAA 


AATGTTAGCG 


CATAAAATGA 


540 




ATAAAGCGAT 


TGTCATTGTG 


TTACATGATA 


TTAACTTTGC 


GTCCTGTTAT 


TCAGATCAGA 


600 


IS 


TTGTAGCATT 


GAAAAACGGA 


CAACTAGTTA 


AGTCAGATTT 


GAAAGATAAT 


GTCATTCAAA 


660 




GTAGTC5TTTT 


AAGTGATTTA 


TATGACATGA 


ATATTCAAAT 


TGAACATATA AGAAATCAAA 


720 




GGATTTGTTT 


ATATTTTAAG 


GATTGATAAT 


TTGGAGaCAC 


TTTAAAGGGG 


TGATGCGCCA 


760 


20 


ATTAAAOAAG 


GGTTAAACGT 


AAAGCATTTA 


TTTAT 






815 




<2) INFORMATION FOR SEQ ID NO: 560: 








2S 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








30 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560: 








GAAACGAATA 


ATAAATTTAC 


TGAGTTATTA 


GTTGAAAAAG 


CTAATAAACA 


TGATGATGTT 


60 


35 


CTCGATAmGT 


TGATTAATAT 


TTTAAAATAA 


GCGATACACA 


CTACTAAAAT 


TGTATTATTA 


120 




TTATGTTnAA 


TTGACnCCTC 


CTAAATTTGC 


AAAGATAGCA 


ATTTAGGAGG 


CGTGTTTATT 


180 




TTTATTGACG 


TCTAACTCTA 


AAAGATATAA 


ATTAGACATT 


TACAAATGAT 


GTAAATAACG 


240 


40 


CAATTTCTAT 


CATCGCTGAT 


AACAATTCAT 


GGTTTAATAT 


GCAATGAGCA 


TATACTTTTT 


300 




AAATAGTATT 


ATTCACTAGT 


TTTAACAATC 


AATTAATTGG 


TATATGATAC 


TTTTATTGGT 


360 


45 


TATTTTTATC 


CCATAGTGTG 


ATAATTACTA 


TTTTTCATTC 


ATAATAAAGG 


TTTAAAGCAT 


420 


GTTAATAGTG 


TGTAAGATTA 


ACATGTACTG 


AAAAACATGT 


TTAACATAAT 


GATATAAGGA 


460 




GTGACGTACA 


TGATCCGTCT 


AGGTAAAATG 


TCAGATTTAG 


ATCAAATCTT 


AAATCTAGTA 


540 


SO 


GAAGAAGCAA 


AAGAATTAAT 


GAAAGAACAC 


GACAACGAGC 


AATGGGACGA TCAGTACCCA 


600 




CTTTTAGAAC 


ATTTTGAAGA 


AGATATTGCT 


AAAGATTATT 


TGTACGTATT AGAGOAAAAT 


660 




GACAAAATTT 


ATGGCTTTAT 


TGTTGTCGAC 


CAAGACCAAG 


CAGAATGGTA yGnyGACATT 


720 
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TATAAAGGAG CTGCTACAGA ATTATTCAAT TATGTTATTG ATGTAGTTAA AGCACGTGGT 840 

GCAGAAGTTA TTTT A ACGGA CACCTTTGCG TTAAACAAAC CTGCACAAGG TTTATTTGCC 900 

5 AAATTTGGAT TTCATAAGG 919 

(2) I NFORMATI ON FOR SEQ ID NO: 561: 

(1) SEQUENCE CHARACTERISTICS: 
ro (A) LENGTH: 516 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561: 
ATCATATAAA CCGGCTGCTT CTAAAAACGA AAATACTGTT ACTGGACCTA AAAATTTAAA 60 

20 CCCGTATTGT TTTAAATCTT TAGATAGTTG TGTTGCAGTA TCATCAACTG TGATACGATC 120 

AGAAGCATGT TCATACTGCA AATCTTTAGG CTTACCATTT ACATATGACC ATAAAAATTT 180 
ACTAAAACTA CCATATGCTT GTTCAATTTT TAAATACCCT TGAGCTTGAT TAACAATTGC 240 

25 TTCTAATTTT TTACGATGAT GAACGATATT TGGAAAAGTC ATTAAGCGGT CGATATCTTG 300 

AGCGGTCATT TGTGCTACCT TTTCTGGTTC GAAATCATAA AATGCTTCTT CATAGGCTTC 360 
TTTCTTTTTT AAAATAGTTA ACCAAGATAG CCCAGCATGT TGTGaTTCTA ATGCTAAAAG 420 

30 

TTTAAACAAT GCCTTGCnAT CATAGAGCGG TTGTCCCCAT ACATGGATCm TGATAGTCTA 480 
AGTAGACTGG GATCTTTAGT ACCAAATGCG CATTCATT 518 
<2) INFORMATION FOR SEQ ID NO: 562: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



45 



so 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 

CTTATTTAAA AGTAAATCAA TCAATGTATT ATAATCCGAA TAGTCCGCAT AAAGCTGGTT 60 

TGCaGGCAAA t CAATTACT A CAACAAGCAA AAACCCAAAT TAATGCAATG rTTaATTCAA 120 

AAACAAATTA TGATGTTGTA TTCACTAGTG GTGCAcTGAA TCCAATAATC TTGCTTTAAA 180 

AGGTATTGCC TATCGTAAAT TTGATACAGC GAAGGAAATA ATTACATCCG TGTTAGAGCA 240 

TCCGTCCGTA TTAGAGGTTG TAAGATATTT GGAAGCACAC GAAGGATTTA AAGTTAAATA 300 
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CAAAGTCGGT 


TTAGTAACAT 


GTATGTATGT 


AAATAATGTA 


ACTGGACAAA 


TACAGCCTAT 


420 




TCCACAAATG 


GCTAAAGTTA 


TAAAAAATTA 


TCCTAAGGCA 


CATTTTCATG 


TAGATGCGGT 


480 


5 


TCAAGCATTC 


GGCAAAATTT 


CAATOGATCT 


CAATAACATA 


GATAGTATTA 


GTTTAAGTGG 


54 0 




ACACAAGTTT 


AATGGTTTAA 


AAGGACAAGG 


CGTCTTACTT 


GTAAATCACA 


TTCAAAATGT 


600 


10 


TGAACCAACT 


GTCCATGGTG 


GTGGTCAAGA 


ATATGGTGTT 


AGAAGTGGAA 


CAGTTAATTT 


660 


GCCAAATGAT 


ATTGCAATGG 


TTAAAGCGAT 


GAAGATAGCT 


AATGAAAACT 


TTGAAGCATT 


720 




GAATGCATTT 


GTTACTQAGT 


TAAATAATGA 


CGTCCGTCAA 


TITITAAATA 


AATATCATGG 


760 


1S 


AGTTTATATT 


AATTCTTCAA 


CTTCAGGTTC 


ACCATTCGTT 


TTAAATATTA 


GTTTTCCTGG 


840 




CGTAAAAGGT 


GAAGTATTAG 


TTAATGCTTT 


TTCAAAATAT 


GACATTATGA 


TATCTACGAC 


900 




AAGTGCTTGT 


TCATCTAAAC 


GTAATAAATT 


AAATGAAGTA 


TTGGCTGCAA 


TGGGATTATC 


960 


20 


AGACAAATCT 


ATTGAAGGTA 


GTATAAGATT 


ATCATTTGGG 


GCTACTACAA 


CTAAAGAAGA 


1020 




TATAGCGAGG 


TTTAAAGAAA 


TATTTATCAT 


CATTTATGAG 


GAAATTAAGG 


AGTTGCTAAA 


1080 




ATAATGAAGT 


ATGATCACTT 


GCTTGTTAGA 


TACGGGGAGT 


TAACATTAAA 


GGGTTCAAAT 


1140 


25 


AGAAAGAAAT 


TTGTAAATCA 


ATTAAGAAAT 


AATGTAAATA 


AGTCaTTAAA 


AGGACTTGAT 


1200 




GGGTTTGTCG 


TTAAAGGCaA 


ACGAGATCGT 


ATGTATATTG 


AACTTGAAGA 


CCATGCaGAT 


1260 


30 


ATAAATGAAA 


TAACATATCG 


ATTATCAAAA 


ATTTTCGGTA 


TTAAATCTAT 


TAGTCCAGTA 


1320 


TTAAAAGTAG 


AAAAAACAAT 


AGAGGCAATA 


AGTGCAGCGG 


CAATTAAATT 


gCGCAGaATT 


1380 




TGAAGaAAAC 


AG CACATTT A 


AAATTGATGT 


GAAGCGTGCC 


CGATTAAAAT 


TTCCCCAATG 


1440 


35 


GATACGGTAT 


GGAATTACAG 


CGTGGAATTG 


GGGTGGTGCC 


AGTATTGGAG 


CACTTCGCCA 


1500 




TATTTCCAGT 


GGATGTCCAA 


CGTCCCAGnC 


CCAGGAATT 






1539 



(2) INFORMATION FOR SEQ ID NO: 563: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 968 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563: 
ATAnCGTATA CATGTGTTCT TTAAAATTGT GATAAGGAGT TTAGGATGGT TTATTTAAAA 60 

SO 

TCAATAGATG CCATTGGATT TAAGTCTTTT GCAGATCAAA CCAATGTTCA ATTCGATAAA 120 
GGTGTAACTG CAATTGTTGG TCCAAATGGA AGCGGTAAAA GTAATATTAC AGATGCTATT 180 
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ATCTTCTCAG GTGCAGAACA TCGCAAAGCT CAAAATTATG CTGAAGTACA GTTAAGATTA 300 

GATAATCATT CTAAAAAGCT CAGTGTTGAT GAAAACGAAG TTATTGTAAC AAGAAGATTG 360 

TATCGAAGTG GTGAAAGTGA GTACTACATA AATAATGACC GTGCAAGATT AAAAGATATT 420 

GCCGATTTAT TTTTAGATTC TGGATTGGGA AAAGAAGCGT ATAGCATTaT CTCGCAAGGT 480 

AGAGTTGATG AAATACTAAA TGCTAAACCA ATTGATAGAC GTCAAATTAT TGAAGAATCG 54 0 

GCTGGTGTAC TTAAATATAA AAAAOGTAAG GCTGAATCAT TAAATAAACT TGACCAAACA 600 

GAAGATAATT TAACGAGAGT AGAAGACATT TTATATGATT TGGAAGGTCG CGTAGAACCT 660 

CTAAAAGAGG AGGCAGCTAT AGCTAAAGAA TATAAGACAC TTTCACATCA AATGAAACAT 720 

AGTGACATTG TAGTTACAGT sCACGATATT GATCAATATA CAAATGACAA TAGACAATTA 780 

GATCAACGTT TAAATGATTT ACAAGGCCAA CAAGCAAATA AAGAAGCTGA CAAGCAACGT 840 

20 TTAAGCCAAC AAATTCAACA ATATAAAGGT AAACGTCATC AACTTGATAA TGATGTTGAA 900 

TCgCTTAATT ATCAATTAGT AAAAGCTACG GAAGCCTTTG AAAAATATAC GGGACAATTA 960 

AATGTTTT 968 

25 (2) INFORMATION FOR SEQ ID NO: 564: 

<i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 436 base pairs 
(B) TYPE : nucleic acid 

30 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



10 



15 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564: 

TTGTGTGTAT GCATTCAATG TGCTCTGTTT GTAAATGGCT AGCTATATAA TTTAgGATTC 60 

GAGGATCGTC ATCGACAACA AGACATTGCA CCATAGCTAT AAACTCCCTT ATCTTTTTCA 120 

40 TTTATTATAC ATGTAAAATA TTTTTGCGTA AAAAAACAAT TGTTCATATT GAGTTCATAT 180 

TTCAACCTTA TACTGACGCT AAAGAAGAAA TAGGGAGAAG TGAATCGATA TGAAATTAGC 240 

GATAAAAGAG ATTATGTTTT ACAAATTTCG TTATATTTTA ATCACATTAA TCATTCTTTT 300 

45 ATTAAGTATT ATGGTGTTAT TTATTAGTGG TTTAGCTCAn GGGCTTGGTA GGGaGAATAT 360 

TTCGTwATTT GAACACTTTG GATAATGATG aaTATGtTGT TCaAAAAATG AAAGAGCCGC 420 

aAATTGaGAA ATCGCA 436 

50 

(2) INFORMATION FOR SEQ ID NO: 565: 

<i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2554 base pairs 

55 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565: 

ATGTTTTGTC AATATCAATT GTTTGAGATA AATCCGCTTG TATAAAACGA ACTTTATCAT 60 

10 CATTAAATTT GCTAGTTAAT TCATTTATAT CAGTACGATA ATATTGCACA TAAACTTCAA 120 

ATCCATCAGT TAATAATTGT TTGACTATCT CAGAACCAAT TGAACCAGAA CCACCTAATA 180 

CTAATGCTTT CATTACTTTT TAATCTCCAA ACGACTATCG ACTTGCTGAT CTAAATTTAA 240 

T5 ATATAACGAT GACGTTTCGT TAATACTATC TAACGTGATA TTTTCAACAA TGTCTAACAT 300 

GTCAAACACG CTAACACCTT CAAAATACAA TTTAGTATAT TGATTAGCAA TATATTCAGG 360 

TGAGTTTAAA CTTGATATGA ATTCACCTAT AAATTGCTTT TTCAAAAGTT CAAATGCTTC 420 

20 

. TGCATCTTGG AAAtTGCCTT TTTTATCACG CAACTCATCT AATAATAATT TTTTTAATTT 480 

ATCTGGTTCT TCAGTAGCAC TTGTCACGAT TGAAAAACTA TACGTCGGCT CTAGTACAAA 540 

TTGATAACCA AATGTATCAT CGATAAGTCC TTCGTTTAAT AAATTCTGAT AAAAATCTGT 600 

25 

TTCTTCCCCA AAAATTAACT CAAAGAATAA TGACATTTCT AAATCACGTT GTACATATTT 660 

TTGAGGCGCT TCTTGTAATG GTTTATTTTT AAAACCAAGC ATTAGTCTTG GTGATTGAAT 720 

30 TTTCATAGAT TCAGTAACAA ATGCTTCTTT AACATCcTCC GGTTCATCAA CAAGTCCTCG 780 

TTCGATTTTG GGTTGGTTAA CTTTATTACG AGCATCCTCG TGTTGTTTTA CTATTCgACA 84 0 

TATTGCTTCA GGATCCACAT CGCCAACAAC AAATAAAACC ATATTTGATG GATGATAAAA 900 

35 CGTTTCATAA CATAGATACA AATCATCTTT TGTAATATCG TATATACTTT CTACACTACC 96 0 

GGCAATATCA ACACGTATTG GATGTTGTTG ATACATTGCA CGCAAtGTAT TaAACATTAA 1020 

TTTATATCCA GGTTGCTCTT GaTACATTTT TATTTCTTCT GCAATAATAC CTTTTTCTTT 1080 

40 

ATCAACAGTT TCTTTTGTAA AATAAGGCGT TTCaACCATT GTAAGTAAAC GTTTAATGTT 114 0 

GTTTTCAATA TtATCAGTTG CACTGAACAA GTAGCTTGTA CGATCAAAGC TTGtAAACGC 1200 

ATTTGCTTGT GCGTTATCTT CAGCAAACGC AGTAAATAAG CTTCTTCTTC TTTTTCAAAT 1260 

45 

AATTTATGTT CcTAAAAAGT GAGCAACTCC ATCAGGTACA GTAACAAATT GGTCTTGTCC 1320 

AAGGGGTTTG AATTGATTAT CTAATGAACC AAATTGTGTA GTGTAAGTGA CAAATGTCTT 1380 

60 TTGAAAACCT GGlTkGGGGA TAATAAATAA TCGTAAACCA TTTTCTAATt CTTGTTCGAA 144 0 

tACTCTTTCG TCTATTAATT CATAATAACG CTCTTTCATT ATTTATCCCC TCCTTTTGTC 1500 

AACACATAGA TtGTATCTAA AAATGCTTTT TCAGCAACAG AAACAAT AT C TTCG CGACTT 1560 
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5 



20 



30 



TTATGCATAA 


TCTCTATAAT 


ACTTTTCGGA 


CGATCTTCAG 


ATTCATATCG 


ATGAGAAATG 


1680 


ATTACTTTTT 


TAGCTAACTC 


TAATTTTTCT 


TCAGTGAAAT 


CTCCTGCTTT 


TATTTTTTCA 


1740 


AATTCACTTA 


TAATAGTGTC 


TTTTGCAGTT 


TCGTACTTAT 


CACTTGAAAC 


CCCACTCAAA 


1800 


ACAAATAAAT 


AGCCATTTTT 


GCCATCAATT 


TGTGAATGTA 


TAGAGTACGC 


TAAACTTTGC 


1860 


TTTTCTCGCA 


CTTCATTAAA 


TAAAACAOAT 


GAAGGATCTC 


CTCCAAACAT 


CATGTTAAAT 


1920 


ACAACAAAGG 


CAGCATATCC 


ACTTTGTCCA 


TATTGTGTTG 


GAAAACGGTA 


TCCCATATTT 


1980 


AATTTAGCTT 


GATCCACGTC 


ATCATATTCA 


ACAATATAAT 


CAACTTCTTC 


ATCGTGTAAA 


2040 


TGATGAGTAG 


AATG7TGGAA 


TTGATGTTTA 


TCGAATGGTT 


TAAGTGCAAA 


TTTTTCACGT 


2100 


ATTTGTTTCT 


CAACACTTTC 


AGGTTCTACA 


TTGCCGACAA 


CATAAACAGA 


ACATTGATCA 


2160 


TTATTAATCA 


TTGATTGATA 


TGTATGATAT 


AGTGTTTCAG 


CAGTAATATG 


TGGGATTTGT 


2220 


TCTAGTTGTC 


CTGTAGATAA 


GTATTTATAT 


GCTTCATTTT 


CAAACATATG 


GTCGAGTAAT 


2280 


TTTAAAAACG 


AATATTGTGC 


TTTATTATCT 


ACCATTGCTT 


ctattttttt 


GGCTAATAAT 


2340 


GTTTTCTCTT 


GGTTAACAAA 


ATTATCATTG 


AATGCTTTAT 


TTTCAATTAA 


TGGATTCCAA 


2400 


ATGATTTCTT 


GtAATAAATC 


TAATCCTTGa 


TTAAATAATG 


AwTCACCGkT 


TCyTAAATAA 


2460 


CGkkCaTTAA 


caATTyCTAA 


tGaAAATGtA 


ATgACaTGCT 


GaTCTTTGAA 


TTTTGAAATT 


2520 


GTACTATTCA 


CATACGCACC 


ATATAAATCG 


GCTA 






2554 



(2) INFORMATION FOR SEQ ID NO: 566: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1424 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) - TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566: 
TnTCGCTATT TTnAATTGGT TTTGTATGGT TTAAGTTATA TCAATATACA ACAnACCCTA 60 
AAGCTGATAT CCCAGGTATC ATTTTTAGTA CGATTGGTTT TGGTGCTTTG TTATATGGTT 120 
TCTCAGAAGC TGGCAACAAA GGTTGGGGTT CAGTAGAGAT AGAAACAATG TTTGCGATTG 180 
GTATTATCTT TATTATTCTA TTCGTTATTA GAGAATTAAG AATGAAATCA CCAATGTTGA 240 
50 ATTTAGAAGT ATTGAAATTC CCaACATTTA CATTAACAAC AATTATTAAT ATGGTTGTAA 300 
TGTTAAGTTT ATATGGTGGT ATGATTTTAT TACCGATTTA TTTACAAAAT TTACGCGGAT 360 
TcTCAGCATT AGATTCCGGA TTGTTATTAT TACCTGGTTC TCTAATTATG GGTCTACTAG 420 

55 
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10 



TTGCTQTAAT GACTTATGCA ACATGGGAAT TAACTAAATT AAATATGGAT ACACCATATA 540 

TGACAATCAT GGGTATCTAT GTACTTCGTT CATTTGGTAT GGCATTTATA ATGATGCCAA 600 

TGGTAACTGC AGCTATTAAT GCGTTACCGG OACGACTTGC CTCTCATGGT AATGCTTTCT 660 

TAAATACGAT GCGTCAATTA GCAGGCTCTA TAGGTACAGC AATCTTAGTT ACTGTAATGA 720 

CAACACAAAC TACACAACAC TTATCAGCTT TTGGGGAAGA GTTAGATAAA ACGAATCCTG 780 

TTGTACAAGA TCATATGCGT GAATTAGCAT CACAATATGG CGGACAAOAA GGCGCAATGA 840 

AAGTGTTACT ACAATTTGTA AATAAACTAG CAACGGTTGA AGGTATTAAT GATGCATTTA 900 

1S TAGTTGCAAC GATATTTAGC ATCATCGCCT TAATTTTATG TTTATTTTTA CAAAGTAATA 960 

AAAAAGCAAA AGCTACAGCT CAAAAGTTAG ATGCAGATAA TAGTATCAAT CATGAATAAA 1020 

TAAAATAAAT TAATTGAAGT GTGACTAATC AAAAATTATG TTGTGGGGAC ATGATTTTTA 1080 

AAGTATCGGT GCCAAATATG GTTATCGATA CTTTTTTTAT TTGTTGATTT ATAGAATGTT 1140 

AGAGGAATTA TATTAAAATT TGGCATTGAC GTAGTAGGTC ATTAATAAAG AAAAAGCAGG 1200 

AAGTGGGTCA ACGAAATGAA TTTTGTGAAA ATAACATTTC TGTCCCAATC CCTACTATAT 1260 

AACATTATTT TAAACGAGGC ATG CGATT AC GGAAGAATAA GCTTATAACA AGTAAACCGA 1320 

TGGTACAGCC AAGTAAAATG ATGCCGTTAT GAATAGCGTC ACTTgCTGTA ATCACTTGAT 1380 

CTGGTGGTAC ATTTAAATAA TATTTTTTGA AAACATCTGC AATT 1424 
(2) INFORMATION FOR SEQ ID NO: 567: 



20 



26 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 676 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

<xi> SEQUENCE DESCRIPTION : SEQ ID NO: 567: 
TCTCCCATGT TCTGCTAAAT GACGCATCAC TTTTACTTCA TGAGGCGTCA ATACACGTCC 60 
TTCACCAGCA TTCAAACCGA CAACATTTAA AGGCCCATAT TCAATACGAG ACAGTTTCGT 120 

45 

CACTTGATGA CCAAAATGTT CGAACATTCT TCTGACTTGG CGATTACGAC CTTCTGTAAT 180 
TGTAATTTCA AC CAATGTTG TGTTTTTATC TTTATCTTGT TTCTTAACTT TCACTTCAGC 240 
60 CGGTTGCGTC ATACCATCTT CTAATTCAAT ACCTTTTTCT AGCGCTTTCA CTTCTTCTCT 3 00 

CATTAAATAA CCTTTTAATT TCGCAACATA TTTTTTCTTA ATTTGATATC TTGGATGTGT 3 60 

CATTAAATTA GTAAATTCAC CATCATTTGT GAGTAATAAC AATCCAGAAG TATCATAGTC 420 
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ACGTCCTCTA TCATCAGATA CACTTGTkAT CACTTGAGTT GGkTTATGGA AkAAAATGkA 
AAtTTTGTCT TCTAGTTCTA TTTTAATACC TTCAACTTCA ATCGTATCTG ATGGCTTCAC 
TTTTGTTCCT AATTCAGTQA CAGTCGTACC ATTCACTTTC ACTTTTCCTT CAGAAATTAA 
AOTTTCTGCC TTACGT 

(2) INFORMATION FOR SEQ ID NO: 568: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 54 base pairs 

(B) TYPE; nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 568: 
GAAACGGTTC TACCAAAAAA CAGTAAGGGC TAAACCCAAT CATGGTAAGA CAAAAAGTAC 
AAATAGCCAT GCCCAAGTTG AACTCGCTGT ACGCCTATTT CTTTCTAAAA AGATAATAAT 
AAAAGCCAAT ACTAAATTAA TGATGAATCC AATGGCTAAA ATAATAGTAA ATAACGTTCC 
TAAATCGTTT GAAAATGTAA ATCGCATAGT CTTTTCTCCT ATAAAGAAAG GCACAAAAAA 
ACATTTTGCA CCTTTCACGT CATATTATTT ATTCACAGAT AAAGTTAAAA TTGcATTGAA 
TTCTTCTTCA TTATTTGGGA ATGTTCTTTC TTCTATTTCT TTAATAGTAA TATTTACTAA 
TTTTAAATTT GTAGCTTCTT CAGAACTTAA AAAAGCATTA ATGTTTTTTT CTAATAACTC 
kAAAGTCTCA GCTGtAAAaG TTTTAAGTTT AATT 
(2) INFORMATION FOR SEQ ID NO: 569: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569: 
ACGATATTAC CACCCTCTGA AATAACATCA ATCATGTGAT CTGTAAAGTC CCAAGGATGA 
CTTGTTGTGA AACGAACTCT TGGAATCGCT ATTTTAGAAA TTGCTTGTAA AAGATCTCCT 
AAGTCATATT CTATATCCTG TAAATCTTTA CCATAAGAAT TTACATTTTG ACCTAAAAGC 
GTTATTTCTT TGTAACCTTC ACGAG CAAGT TCACGTACTT CATCTATAAT GTCTTCAGGT 
CTACGGCTTC GTTCTTTACC TCTTGTAAAT GGAACAATAC AATATGTACA AAACTTATCA 
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TCAATAACGT CTCCTTCTTT AGACCATACT TCAACAACCA TTGCTTTAGA TAAGTATGCT 
TCTTCTAAAA TTTCTGGTAA ATGATGAATA TTATGTGTAC CAAATATCAT ATCTACATTT 
TGATACGATT TTAAAATTTT ATTCACTACT QACTCTTCTT GTGACATACA ACCACAAACA 
CCGATTAAAA TATCAGGTCG TTCTTTTTTC AAATrCTTCA AATTACCTAT TTCACTAAAC 
ACTTTGTTCT CGGCATTTTC TCTAATCGCA CATGTATTAA TTAAAATAAC ATCTGCAGTG 
TTAATATCAG TCGTXGCTTG aTAGCCTAAT GCyTCmAGTA TACCAGCAAT GACCTCAGTG 
TCATGTGCAT TCATTTGACA TCCATATGTT TTAATTAAAA ATGTACGCTC GTTCCCCATA 
CCGOGATATT TTyCATCAAT TtGGgAAATC nCTATTATAA CGAACTTCTk GTtTACCnCC 
TTTTTTnCGC TCCTTTAAAA TTAAGGCGGC TOATAAACAG GTCCAAAATA TTAC 
(2) INFORMATION FOR SEQ ID NO: 570: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

' <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570: 
TATCAATCCC ACAGCACATG CTGAACAAGA TCAAACATGG GAGAAGATTA AAGAACGCGG 
TGAACTTAGA GTGGGTCTTT CTGCAGATTA TGCACCAATG GAATTTGAGC ATACAGTTAA 
TGGTAAGACT GAGTATGCAG GTGTAGATAT TGATTTAGCT AAAAAAATTG CGAAAGATAA 
TAATTTAAAA TTAAAAATCG TCAATATGTC ATTTGATAGT TTGTTAGGAG CTCTTAAAAC 
TGGAAAAATT GATATTATTA TTTCCGGAAT GACTTCAACG CCTGAACGTA AGAAGCAAGT 
TGATTTTTCA GATTCATATA TGATGACTAA AAATATCATG CTTGTAAAGA AAGATAAAGT 
TAATGAATAT AAAGATATCm AAGACTTTAA TAATAAAanA GTnGGGGCAC AAAGGGACTG 
AACCAGAAAA AATCGCTCAA C 
(2) INFORMATION FOR SEQ ID NO: 571: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1205 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571: 



1507 



EP0 786 519 A2 





AAGTCGTTCA 


TCTTTTTCTA 


CAAGGGTGTA 


AAAATAATCA 


ATCATATCGT 


ATAACGCTTC 


120 




TTTACCAATG 


ATGTCATATG 


GTGTTGTTOT 


CATTTAATCA 


CCCATTTTCA 


AAAATTTACT 


180 


5 


OTTACGAACT 


TAAGTTAATA 


TATAACTAAT 


ATAAGATGAT 


TTTAAACATT 


TGAAAGAAAT 


240 




ATGCATATTT 


GCCAATTTAA 


TTTATATTGT 


TTGAAAGTGT 


TTCTTTTTTC 


TTGAAAAAAC 


300 


10 


GTTGAACTTT 


ATTTAAAGGT 


tGATGATGTT 


CGAGGTTTAO 


TTCGTTtAAT 


AAAGATtGGA 


360 




ACTTTTGTAA 


ACCTTGATTA 


TAGTCTTTAA 


CTTCGAACTC 


TAACTCATAA 


TCCGTAGTAT 


420 




CGAAATACTC 


ACTTTTATCT 


AAAACCAGTA 


AATCACCTTT 


ATATTTAGTX 


XCTTGGCGAT 


480 


IS 


ATGTCGTTAA 


TGCACCAAGT 


ATTGATAAAG 


TTGTATCTTT 


TACACCAAAC 

• +*^m m*^^ WW1V 


TGTTCAACTA 

X \rf X X ^^M^Vv X*» 


540 




TAATTTGACG 


AATGTCATCT 


GGAAGATTGT 


CGTTTGAAAT 


AATCAAGTTC 


ATCTCTGGTT 

** X X >» X W>7X X 


600 




TAATGTCGAC 


GATATAGTTG 


x a x^»x«mxn 




X Vlw X wu X \^ X V» 


TTTAAAGTCA 

x. x ifwtfiu x 


660 


SO 


TTTPATATTfS 

& X X Va^% Xf% X X 


ATTfSTCTTTA 




GTAGTGCAGA 

1AU X uVJIVWl 


GCt^TGTTPC 
WvUnxux x 


x x x^vtx x x un 


1 X> KJ 




AATCGGGTGT 

fin x wwvjw x * 


ATCAATATAG 


X «W» X X W\Vv X X 


t "fl'&TTiA A Afv 


V»^lX«X%Vr X V* X Vv X 


TTAAAATATT 

X X Ortftn X~n X X 


780 


25 




X X X X IriXnv) 


ATTY*:Ai-nr*An 


X inlUni X lu 


X L. InAn 


ATt* TT 1 A*IY2 AT 
A 1 LI X Utn X 




TTflTTn ("V™ 1 ATP 


A T ATflTA TA 
vrllnX Inln 




Oil ATTf* A AT* 


^PTPRTy A H O 
XJniVvi InnL 


TATATTATYSA 






ATGACAAAGT 


TGAATTTTAA 


AAGTAATTTC 


CTTTATCTAT 


TATCAATGTT 


AATTTGACCA 


960 


30 


TTAAAAATAG 


TGTTCGTAAG 


TGTTTTGTAT 


TATTGaATTG 


TGTTAAAATG 


TTATGGAATA 


1020 




AGAGGAGGAT 


TAAGCATG3G 


TTTwTATATT 


AATGAAATTA 


AAATTAAAGA 


TGACATACTT 


1080 




TATTGTTATA 


CAGAAGATTC 


TATTAAAGGA 


TTATCTGAAG 


TAGGACAAAT 


GCTCGTTGAT 


1140 


35 


AGTGATAATT 


ATGCCTTTGC 


GTATACATTA 


GATGATGGTA AAGCGTATGC 


TTATCTCATT 


1200 



TTCGT 1205 
(2) INFORMATION FOR SEQ ID NO: 572 : 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572: 
60 TGAAGAAATA GCAATGATGA AATGCCGTAT GTTATATGAG ACGGGGTCAT TTCTTGAATT 60 
AAGAGAAGAA ACAATTGTCT TATTGAAAAC TGGCATACAA CAATATGATG CATTGATGAT 120 
TTATTACGTG AAAAGTTTGA TTGGTTTGGG ACAATATTTT GAAGCGGTAG AAGTAATTCA 180 
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10 



ATTTGCTAAG TCAAAATTAA TTGAAOATGA AAAACGATTG ACTCAGTCAT TAGCTGATTT 300 

TGrTACGTTA TCAATGAGGG AACAGACGCA CTTGATTTTG AAGTTAATAG ACAATGGTCA 360 

TTTTCAATTT CAAGAAACGG TATTATATAT ATkAAaAyCT AATaCGTACa GTtATAACCT 420 

CATTAGTTTA ATGATTGAGT ATTTAAGGTT CGCAAATTGT ACACAAGAAC TGACAATTGA 4 80 

AAAGTATGGT ATGGATGTAA CTTTTGTACC AGCTAATTTA AAAGGGCTAG AACATACAAC 540 

ACTTAAAGAA AAAGTTATAC CTAACGTTAT 570 

(2) INFORMATION FOR SEQ ID NO: 573: 

*5 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 93 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 573: 
GTTGAATGGT TAGCAGCTGC AGTTGTATTA TATTTCTGTG GTGTAATTGT TGACGCTCAT 60 
GTATCATTCA TGTCCTTTAT TGCAATATTT ATCATTGCTG CATTATCAGG TTTAGTCAGC 120 
TTTATPCCTG GTGGTTTCGG CGCTTTCGAT TTAGTTGTAT TACTAGGATT TAAAACTTTA 180 
GGTGTCCCTG AGGAAAAAGT ATTATTAATG CTACTTCTAT ATCGTTTTGC GTACTATTTT 240 
GTACCGGTAA TTATTGCATT AATTTTATCA TCATTTGAAT TTGGTACATC AGCTAAGAAG 300 
TACATTGAGG GATCTAAATA CTTTATTCCT GCTAAAGATG TTACGTCATT TTTAATGTCT 360 
35 TATCAAAAGG ATATTATTGC TAAAATTCCA TCATTATCAT TAGCAATTTT AGTATTCTTT 420 

ACAAGTATGA TCTTTTTTGT AAATAACTTA ACGATTGTkT ACGATGCTTT tATATGATGG 480 
AAATCACTTA ACGTATTATA TTCTATtGGC AATTCATACT AGTGCTTGTT TATTACTTTT 540 

40 

ACTGAATGTA GTTGGTATTT ATAAGCAAAG TAGACGTGCC ATTATCTTTG CTATGATTTC 600 
AATTTTATTA ATCACAGTGG CGACATTCTT CACTTACGCT TCATATATTT TAATAACATG 660 
GTTAGCTATT ATTTTTGTTC TGCTTATTGT AGCTTTCCGT AGAGCGAATA GGTTGAAACG 720 

45 

CCCAGTAAGA ATGAGAAATA TAGTTGCAAT GCTTTTATTC AGTTTATTTA TTTTATATGT 780 
TAACCATATA TTTaTTGCTG GAACGTTATA TGCATTAGAT ATTTATACGA TTGAAATGCA 840 
SO TACATCTGTA TTGCGCTATT ACTTCTGGCT TACGATTTTA ATCATCGCTA TCATCATAGG 900 

TATGATTGCA TGGTTGTTTG ATTATCAATT TAGCAAAGT 939 
(2) INFORMATION FOR SEQ ID NO: 574: 

55 



2S 
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(A) LENGTH: 1059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : lin ar 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574: 



10 


GAATTAATTA 


AATATTACAC 


ACAGCCTCAT 


TTTTCATTTT 


CAAATAAATG 


GCTATATCAA 


60 




TATGATAATG 


GAAACATTTA 


TGTTGAACTT 


AnGAGATATT 


CATGGTCAGC 


ACATATATCT 


120 




TTATGGGGCG 


CTGAAaGTyG 


GGGAAATATT 


AATCAGTTAA 


AAGATCGTTA 


CGTAGATGTG 


190 


IS 


TTTGGACTAA 


AAGACAAAGA 


TACTGATCAG 


TTATGGTGGT 


CTTATAGAGA 


GACATTTACA 


240 




GGTGGCGTTA 


CACCAGCCGC 


AAAACCTTCT 


GATAAAACTT 


ATAATCTTTT 


TGTGCAATAC 


300 


20 


AAAGATAAAC 


TACAAACGAT 


TATTGGTGCG 


CATAAAATAT 


ACCAAGGCAA 


TAAACCAGTA 


360 


TTAACATTGA 


AAGAAATCGA 


TTTCCGTGCA 


CGAGAAGCGT 


TAATAAAAAA 


TAAAATATTA 


420 




TATAACGAAA 


ATCGTAATAA 


AGGTAAGCTT 


AAGATCACCG 


GTGGCGGTAA 


TAACTACACT 


460 


25 


ATTGATTTAA 


GCAAAAGATT 


ACATTCAGAT 


CTAGCAAATG 


TTTATGTTAA 


AAATCCTAAT 


540 




AAAATAACTG 


TTGACGTCCT 


CTTTGATTAG 


TATATGAAGG 


TGACTTATAC 


TTCATGCACT 


600 




TTAATTCCAA 


ATCAGATTAT 


TTAAATGATA 


ATTTTTAAAG TGTATGATGT 


ATATAATAGG 


660 


30 


TAAAATTTTC 


TATATATTTA 


AATGGAATTG 


GGAGTAGGAA 


TGTGACAGAA 


ATAGTATTTT 


720 




ATAAAATTTA 


TTCtTGTCAC 


TCCCCAACTT 


GCACATTATT 


GTAAGCTGAC 


TTTCCGCCAG 


780 




CTTCTATGTT 


GGGGCCCCGC 


CAACTTGCAT 


TGTCTGTAGA 


aTTTCTTTTT 


GAAATTCTCT 


840 


35 


ATGTTGGGGC 


CCCGCCTATA 


ATTGAAAAAT 


GCTTGTTACA 


TGGGCATTTT . 


CATTCGGTCA 


900 




ACTACTACCA 


ATATAATATT 


GtAGaGCCTA 


AGACATTGAT 


TTATTATGTC 


TTAGGCTCTA 


960 


40 


TTCCTTCATT 


TAATGATTAA 


nTTATTATAG 


CAATACTTTA 


TTGTCCCATG 


ATTAGTGTTC 


1020 


TTTTAATGAG 


ACATAGTAAC 


TATAAAGTTT 


AATAATCGT 






1059 



(2) INFORMATION FOR SEQ ID NO: 575: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575: 
GTTTGCTTTA GGTCCTGTTT CATATTTATA CTTCGAAGGA TTTACCTTTT TGaAGTCTGG 60 ' 

55 
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TTCAACTXGC TTTTTATTCT TTTCGAAATC AGCTGGTTGA GTAGTTATGA GTTCATTATT 180 

TTTATTAGmA TAAATCTTAC CATTAACATA TTTATAATCT TTTGTTATAA AGT CACCATT 240 

TCTGAATGGA ACTACTTGAT TATGACCTTT AGAGAATAAA TCAGTACCGA ACATTAAATA 300 

GTTCTTOGTA TCTATACCAG CCAAATGTAA AATTGTTGGC ATTACATCGA CTTGACCAGC 360 

ATATTCATTA TTGATACCAC CAGATTTACC AGGGATTTTA ATCCAGAAAC CAGTTCTGTT 420 

TAAATCTGTA AATTTAGCCG GTGTGATTTT TTCACCTAAT AGTTTTTCCA TGGCATTGTT 4 BO 

ATGGTTTTCA GAGATACCAT AGTGGTCACC ATAAATCATA ATCACTGAAT TGTCATATAA 54 0 

75 TCCTTTTTTC TTCAAGTCAT TAATATATTC TTCT 574 
(2) INFORMATION FOR SEQ ID NO: 576: 



10 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



• (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576: 

CAATGTTTTA TAGTACAATA TATTTTnAAT AATACTCGTT AAGGAGAATG ATATGATATC 60 

3Q AATTCACGCA ATTTCGACAG GAAAAATCCA AGATTTGCCT TATAGCTCGA AAAGACCGAT 120 

GCGCTCTGCT TTAGATAAAA CTAAGATTTC ACAAACAACA TGGTTATCTT CAACTGGTTT 180 

CACTGGTGAT GAACAGGCTT ATAAAGATCA TGGTGGACCA CATAAAGCAG TTTGTGGGTT 240 

35 TAGTAAG CAT AATTATGCAC TGTATCAAGA TGATTTACCT ACACTACCTA CTCATGCGAT 300 

GTTTGGAGAG AATTTAACAT TTGATTATTT AGACGAATCT GATGTTTACT TTGGTAATCA 360 

ATATCGTTTA GGTGAAGCGT TAATTGAGGT TTCTGAAATT AGAGAACCAT ACTGGAAAAT 420 

TCAAGCAAAA TATAATATTC CTGATTTAGT GAAGCGCATG TCTACATCTG GTAAAACAGG 480 

TTTCTATTTC CGGGTATTAA AACAAGGCTA TGTATCTCCA AATGATCAGC TTTACTTAAT 540 

ACAAGAAGCA CCAATCGAAC ATCGTTTATC TGTACAACAG CTTAATGACC TTTATTATAA 600 

TGATAGACAA AATCAAGaTA TGTTACGATA TGCACTAAAC AATCCATTTC TGTCACCAAC 660 

AAGACGCGAT AAACTTCAAA AAATGTATAA CAGAACATTG GAAATAATTA CCTTTCATTn 720 

SO ATAAGTGTTA AATGAACTTT TCAAAACAnA AAGGAATCAA CTTCACACAT CGTTTGTATG 780 

AATAGTCTTA TCTATA 79$ 
(2) INFORMATION FOR SEQ ID NO: 577: 

55 
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45 
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(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577: 



10 



15 



30 



35 



AGAAATTATG 


ACAAAATATA 


AAGATGGAAA 


GTTAGTTTAT 


GCATCAGTCG 


AACCAGGATC 


60 


TTACGTAATA 


CATAAAGATG 


ATGCAATTAA 


ATATGACGAT 


TATTCTAAGT 


TAAAAAAATT 


120 


AAGTCAGCTA 


ACTAAACTTG 


ATCATCCAAA 


ACCAGTTCCA 


TATAGCGTaC 


TCAAATCAAA 


180 


TCTTTCGGAG 


TACCTTTAAC 


AAGCGTTTCA 


TTTATGACAC 


ATGGATCAAA 


GGATACTAAA 


240 


GATGAAGTGT 


TGCCGGCATT 


GGCCTATTTC 


ACTTTTTCAC 


CAAAAAATTA 


TGAAGACAAG 


300 


TCTAATCCAG 


ATCCAAAAGT 


TTTAAATTTA 


GTACATATGG 


ATTTCTTAAA 


TGCATCTAGT 


360 


GATTTTGGTA 


ACGCACATTT 


TGTTGTTTTA 


AGTAAATATA 


TTAAAGAGTA 


TGAATCAAAC 


420 


TATGAAACAG 


CGTCAGATGA 


TTCTTTAAAA 


TAGTATTTAC 


TGTGTGAAAA 


ATAAATAGTG 


480 


TA CT ACATT A 


AATAATCGCA 


ATAATAATCC 


CGATAAACAA 


TCAGCATTAC 


TGCTTATCAC 


540 


ATAGAGTTCG 


TAATAACTAT 


AACTCTATGA 


TTCGCAAATA 


ATAAATGATT 


GTCATCGGGA 


600 


TTTATTTTTA 


TCAATTTATA 


AAGTGACATT 


ACCTTGTTCA 


TCAGCAGGTT 


TGAAAACAGT 


660 


AATCACTGCA 


CTAATAATTG 


CTAAAATGTG 


TGGGATACCT 


GTCCAACAGA 


ATATTAAGTG 


720 


TAGAATACCT 


TGCATATTCT 


TGCCGGCATA 


AAATTTATGA 


ATACCAAAAC 


TACCTAAGAA 


780 


CAATGCTAAT 


AAAATATAAA 


TAACTTTGTT 


TACTTGCATT 


TCTTTCCCTC 


CAGTTGAATT 


840 


GCTTATAATG 


ACATTAGCTT 


CTCTTTTTAT 


TATACCCACT TTTAGTTCAA 


ACATT CTAGT 


900 


TTAAGCATTC 


CCAATCATCT 


AAATTTCAGT 


TATTCAATCC 


TTACAATAAA 


TTTAGGATTA 


960 


CATTTCAGTT 


GCATTGTATT 


ATTTTACGTG 


TGAAATATAC 


GTAATGAATC ACATGACAAy 


1020 


CTyCAAATTG 


AAAAATATAC 


ATTCTATGAT 


GTAAGGTCGC 


ATTTTTAATA 


TATTTACGTn 


1080 


AAAATAGTTT 


GGATG 










1095 


<2) IKFORMATI ON FOR SEQ ID NO: 578: 









<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4 89 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 

so <D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578: 

55 
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ACTGCCAAAA TAATCATAGC CATGTTCTAC AGCTGCTTTC GCTACAATAT CCAAACGCAT 120 

TTCAAAACAA GCGGTACAAC GTAAGCCGCC TTCTTTTTCA TCAGCTAATT CTTTATCCTT 180 

5 CACCATTTTC ACAAACTTAT GTGGTTCATA AGGTGCTTCA ATATACTTCA CATTCGCACC 240 

AGTCTTGCGA TTAAAATCTT CCACAAATTG TTCTTGCACT TTAGCACGTC GTAAGTACTC 300 

ATTTTTCGGA TGAATATTTG AATTCGCGAA ATAAATTGCA ATGTCTGCAT ATTGTGTTAA 360 

10 

AAACTCTAAT GTATATGTAC TACAAGGTGC ACAACAACTA TGCAATAAGa TTTTAGGTCT 420 

GATTGCTTCT CTTTCCCACT GsCCGATTAA TnTCTTCAAC ACCTTGTCCA TAATTAATTT 480 

1S GTnGATTTn 489 

(2) INFORMATION FOR SEQ ID NO: 579: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1287 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579: 



30 



35 



40 



45 



60 



CGTACTCCAT 


CATTTGTTGA 


AAAATTTGCA 


AACTTCTTCA 


AGATGTTATG 


GCTTAGACTT 


60 


AAAGCGATGA 


AGCACTACAA 


AGCCTTAAAT 


AAAGAATCTA 


AGAAGCAAGA 


ATTTGAAAAT 


120 


TCATTCAAAG 


ATGTTCAAAA 


AATTATGCGT 


ATTGTGAATC 


ACAATATTAT 


TTTACGCTTA 


180 


AAAGAAGAAC 


AAAAT AG T AC 


AAATGTACTT 


GAGGTTAGCT 


TAGTCATTAA 


TCATTACTAT 


240 


GATATGAGTC 


GCTCATTAAA 


GTGGCGTGCA 


CAACGTCGAA 


AAGAACGTCA 


AGAAAACAGC 


300 


AATCAAATCA 


TACCGCAAGC 


TATGTTCCAT 


AACCACAAAT 


TGGAAGCATT 


GTACTTACAA 


360 


CGTCATCTTT 


TAGATGAATT 


AATTCGCAAA 


AATAAAATCA 


ACAATATCGT 


TGCAGCTCAA 


420 


ATTCGAGAAA 


ATATCAATTa 


CAACGAAATT 


GTCTTGTCTT 


TACAGTCCAA 


ACATTaAGCA 


480 


AGaCwTaCaw 


TmCCCCCGTA 


CATATGAGAC 


AAAGTCATTA 


TCATCTCATA 


TGTACGGGGT 


54 0 


TTTTATATTC 


AACATCAAAA 


AATCAGATTG 


ATGAAAAGTA 


AATAACCTTT 


CATCAATCCG 


€00 


ATTTGATTAT 


AGAATCTATT 


TTTTAAGTTT 


AAATGGAATT 


GTACATACGT 


TAACATTCTT 


660 


TTGATAAATT 


AAATACAATT 


TCATACG CAA 


ACTAGTTTGA 


TTGTGTAATA 


AATTATGCCA 


720 


ACGTTTCTTA 


GTAATAAATT 


CTGGTATCAC 


TACTGTAATC 


ATATAGTTTT 


GATCGTTGGC 


760 


TTTACGATTA 


ATCTTATCGA 


TAAAACGTGA 


AATTGGTCGT 


ATAATACTGC 


GATATTCCGA 


840 


ATGTAAAATA 


ACTAATCTTA 


CATCTGGGAA 


ATGACGTTTC 


CATTTCTCTT 


GGAATGCTTT 


900 



85 
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TGCGTAATAA ATAGACTTAT CAACTGCTGT TGTAATACTT GTTATCGGCA CAATTGCTAA 1020 

ATTACGATCT ACCACGTCCA CATTAAGAAC ATCAATGTCA GAACGTAATT GTTCTGCGAT 1080 

5 

ATCTCGATAA TGkTTGkTAA TTTTCAAGAA GAAAATCACC ACGAACGGCA AGAAAATAAG 1140 

TATCGGCCAT ACTTGGCTAA ATTTAGTTAT GAGTAAAATC cATAAnAACA ATAAATGTCA 1200 

CGATACCACC AAGTAAGTTC ACAGACAACT TACTTAACCA ATTCTTAGGA CGTTCATGAA 1260 

10 

TCCATTTAAT AACCATACCG AATTGTG 1287 
(2) INFORMATION FOR SEQ ID NO: 580: 

j5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580: 



30 



3S 



40 



SO 



GCCTCCTTTC 


CTATATCCAT 


GGGTGTCCTA 


GTTGGGAATG 


GCTTATTCCC 


TATCCCAAAC 


60 


ATCTGTCAAT 


TCAAGACTAT 


CACTGTATAC 


TAAATCGCCT 


nTCTATACnT 


CCATnCATTT 


120 


CATTTTCCAG 


TTGGAACAAT 


TTGTTTAATT 


ATTTTAAATA 


TCATTATCAA 


TCCAGAAGTA 


180 


TTTACTATTC 


ACTTTTACAA 


TAATCAATCA 


TTTAACTACA 


CATGGGTAGT 


CGGTGGTTTG 


240 


CTTGGGGTTA 


GCTTTTTAAC 


TGGCAATTTA 


TTGTTATTGC 


CAAAATTAGG 


TGCAACATTA 


300 


ACTGTAATTG 


CAACAGTTGC 


GGGTCAAATT 


ATTATGGGTG 


TCATTATTGA 


TACATTTGGA 


360 


TTATTTGGCG 


CTACAATTCA 


TGATTTTAAT 


TTAATTAAAG 


CAATTGGAGT 


ATTGTTACTC 


420 


ATTGTCGGCA 


TCGTCATAAT 


GAATCAATTT 


AACAAGAATA 


ATTTATTACT 


AACTGATCAA 


480 


AAGTATTTAC 


TGTTTTGGCT 


TCTATTAGGA 


TTTATTTTTG 


GTTTCTTTCC 


ACCTATTCAA 


540 


ACGACAATTA 


ATAGTGCTTT 


AGCTAGTCAT 


ACTCATTCAC 


CAGCCTTTGC 


ATCATTAGTA 


600 


TCATTTACAA 


TTGGGTCAAT 


AGCGCTATTG 


ATTTTAACCG 


CTATTTTTAA 


TCGTTCTTTA 


660 


AAACTAAAAA 


CAAGTCATTT 


AAAATTCGGT 


AAATTAAAGC 


CT AT CT ATTT 


TACTGGCGGT 


720 


ATACTTGGCA 


TGGCTTTTGT 


AACAGCTAAC 


ATTATCTTAA 


TGCCTCATAT 


GGGTGCAGCA 


780 


TTAACAACAC 


TTATTGGGAT 


GTTTGGCCAG 


ATTCTAATGG 


GCATATTGAT 


AGATCACTTT 


840 


GGATTATTTG 


GTTCACCTAA 


AATAGCAATG 


ACATCCAGAA 


AAACTATTGG 


TCTATTATGT 


900 


ATTTTGACAG 


GCATTATACT 


TTTAAGATTA 


TTTTAAATTA 


ACTTTTAGCT 


TATCATTTTA 


960 


ACTTGTAATT 


ATTTTTAAAA 


GTGATAAGCT 


ATTTTTTTGT 


GGTCTAAAAA TCTTTAGAAA 


1020 
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CAACTCATTC TTAAGACCTA AATTAATGTT ATnTTTTAAT AATTTACACC AAATTAATAG 1140 

CAAAAATTAT GTTATTCGTG CTAATATTTC ATAGTTGGTT ATTCAATTAA TTAAAAATAA 1200 

5 GTCAAAATGC ACAACTTTTT ATn 1223 

(2) INFORMATION FOR SEQ ID NO: 561: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 454 base pairs 

<B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 581: 
CACCTTTTTA TCATGCTTAG TTATCAATAA ATCTATATTT TGCTGTTTTA CAATTTTTTT 60 
20 AACTTTATCA ATCTCATTAT CTTGGACTAA ATAAATATAT GATCTTGCAT CTGTTGCTAG 120 
AGCTTGTTCG TGTTTTTCTG ATAAAACATA TGTGATGGAA GCGTGAATAA TAATGCCTAA 180 

TGTAACAAAA CTGATAATTA ATATACTGCT TATCAATAAC ATTAAGCGGT GGTGAAACTT 24 0 

25 

CATCATTGTT CTTTAGGTCT TTCCAATTTA TAGCCTAAGC CACGCACAGT TTTAATAAGT 3 00 

TGTGGCTTCT TAGGATTATC TTCTAATTTA TCTCTTAAAT GACTGATATG TACATCAACA 3 60 

ATTCTTGAGT CTCCTGCAAA TTCATAATTC CATACCGTAT TTAACATATG CTCTCTCGTA 420 

30 

ATGACTCTGC CTTGTCTTTC TATCAAATAA AGCA 454 

(2) INFORMATION FOR SEQ ID NO: 582: 

U) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 452 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582: 



45 



60 



TTTATAAGAT 


TTTATTTACA 


AATAATTGGT 


TTTCATATGT 


ATAAACACTT 


TTGACTTTCA 


60 


AATCTTAACG 


ATAATTCTAT 


TACAATACAA 


TCCCCTATTA 


GAATGATTTA 


TGTAATAAAA 


120 


AAAGCGGAGT 


TTCCCCCAGC 


TTTTCTAAAC 


GACTACATAA AATATAAGAT TGCAATTAAA 


180 


TGCAATAGTG 


ATGCTATTAC 


AATAAAAATA 


TGCCAAATCA 


TATGAAAATA 


TGGTCTATTC 


240 


TTTTGTGCAT 


AAAACCATGC 


ACCAATTGTA 


TAAGACACAC 


CACCTAAGAA 


AATGAATAAT 


300 


ATGAATATCC 


ATGATGTGCG 


AATAAAAATA 


ATTGGTAACA 


AGATAATACC 


TACCCAGCCC 


360 
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AAAATCCCCC AAAGTGTCGT TCCCCATAAT AA 452 
(2) INFORMATION FOR SEQ ID NO: 583: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583: 

CGCTTTTTGC AATAAGTCAT TAGCCGCTTT TAAGCCTTCT TCTTTTCGAT CTACAACAAG 60 

TAAAATAAAT GGCTTTAACG C'TTCV JC AT r TTCACTTTCA AGCATATCTG GTTTTTGAAC 120 

CATTTCAAAT GGAGATTTCA ATCCATTATT ATCGCTCATT TCAATAATTG CATCATACTG 180 

20 TGCTTGTGAC ATACTTGCAA TAGCCTGTTT TGCATTTTCT TGAAGGAAAT ATAAGTTTTT 240 

CAATTTAGGA TGCTTATTTA ATGTACTTAA TGTAATCGGT GTAATGTCTT TCTCATAAGA 3 00 

CACTTCAATC ACTGTACTAT TTGTTCTACC AGGAATTGGT GGTTTTTCAT GAATATGCTT 360 

TGATACTTCT CCAATTCCAA CGACAGATTG ATTTTTCGTT CGATTATAAA AAATAATATT 420 

GTCGCCTTCT TCTAACTGAG TATAAAAATG ATAACCATTA CGTTTAATAC CGTTGTACGT 4 80 

GTGCGTATAA ATCGTATATT GQTTTCCAGG TTCAAATTCT TCAGTTTCAG CTAAAAAGAA 540 

ATAACGCGGT ATCTTAATTT CGCCTTTACC AAGACCACTT ATTAAATCAA ACTCTTCTGC 600 

AGTGATTTGA TTGAACAATG TCTCTTTCAT ATnACTTATA CGAAATTCCA AAGCTTCACT 660 

ACGCTTTAAA TAATCTGCTG TTAATGGTTT CAATTGTTCA TTAAAACGAA ACTGTACACG 720 

TATTTTATTT TGTGCACCTG TTTCAACACT AATAATTTCA CCACATCCAA GTAGTCCAGT 780 

ATCCGTCTGA ACTTGATAAA AGATGACTTG ATCTCCTACT TTAGCCTTTT TAAACGCTCT 840 

40 AAATCCTTGA GATGGGTTAA AATGTGCGCC TGATTCAAAT AAAGCTGTTT GTCCTACTAA 900 

CGGTTCATTA TGATTCCAAC GGTTATATCC ACAATTCAAC CAAAAATAAT TCGTTTCTGC 960 

TGTCATCTTA ATACTCCTTA ACCTGAATAA ATTTTAGAAA CACTATGAAT TACATTCTTT 1020 

45 tAGTGTTTCT TATGCAGTTG GACGCGTATG CGAACAACTG TATACCCTTT GTTCACTGCG 1080 

ATTTTAATCG CATTTCCTAT AACATTGTAG CGCCCAGGAC ATTAATTTAC GTCCCAGACC 1140 

CTTATCGTTT TCACTTCTAA GTAAGTCGAA CTATTTTGCT TTACAACAAG TGCGACTCTA 1200 

SO 

AATACAGTTG GACACACATA CGAGCAACTG TATACCTTTT AATCAGTTTT CTATATTTTA 1260 

TTTATTATAT CTGTCTTAAT GATAAAAATT GTTACAAACA GTTTAACATA TTTAGCTACC 1320 
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ATGATACCAC TATGCTTGCn TATCTCTATA GCGCCATTGA TACACATTTT TAAATATCTA 1440 
TACTGCCGTT AOAATTTTAT CATGTCTnAA TT 1472 
(2) INFORMATION FOR SEQ ID NO: 584: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584: 



20 



25 



30 



35 



GGTAGTGAAT 


GGGGTTCAAG 


ACAACAGTAT 


TGGAAGTACG ' 


AATGAATCAC 


AGTTTTTAGG 


60 


AAATTATATT 


GTAATTAAGC 


ACGCAGAAAA 


TGAGTATAGC 


TTAATAGCTC 


ATTTACACCA 


120 


ATATTCAATC 


ATTGTGAATG 


AGGGGCAAAA 


TGTTAAATAT 


GGTGATATCA 


TTGGGAAGGT 


ISO 


TGGGAATTCT 


GGCAATTCTA 


CTGAACCTCA 


TATACATTTT 


CAAGTAATGA 


ATGATAAGAA 


240 


TATTGAAGCA 


TGTACATCTT 


TAAAAATTCG 


ATTTATAAAT 


AATCGAGAAC 


TTATCAAAGG 


300 


GGATGTGGTC 


TGCGGATTAC 


AAGCTGAATG 


ATGGCGATAC 


TTATAAAATC 


TCGACACTAT 


360 


AAAAATGGTA 


TAGTGTCGAG 


ATTTTCTTGC 


TTATTTAGTT 


AATTCAAAGT 


GCACGCCGGA 


420 


TTCATTAGAA 


GTCGACGTAT 


TTTTGTTTGT 


AATAGAGTAA 


CCGGTCATTG 


AAATTTTAGA 


480 


TTCAATATCT 


GAAGCGGAAT 


TTGTAGATTC 


AGGATTATAG 


AAGCTACATT 


CATAAGTGTT 


540 


ATCATCTTTC 


TTTTTAAGTA 


TAAACATACC 


TTTGGCTTTA 


ACTTCGACTT 


TAGTGTTGTT 


600 


AATGTCAAAA 


GTTTGAGTGC 


TATTACTATA 


ATTAACACCA 


GCCCAAACCG 


ATTCATTATC 


660 


TTTCACAACG 


GGGAAGTCAT 


CTTCTTGCTT 


AACGACGTnA 


CTTCATCTTT 


CTCTGTCTTA 


720 


AAGACATCTT 


TAGATAAGCC 


TGGATACAAC 


ACATATCCAT 


ATTTATTGTC 


AGAATTAGAA 


780 


TGCTTTT 












787 



(2) INFORMATION FOR SEQ ID NO: 585: 

Ci) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 83 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

60 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585: 
ATGCTAAATT AGGGGGAATA TTTACGATAA AGAGAC CAGA AAAATAATAA AACACATGCT 60 

65 
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5 



10 



20 



TAAAATGAAG 


ACTATTTTTT 


ATTACAAGAA 


AATGTATCTA 


GTAAACTTAA 


AGTAGCAAGA 




CCTAATAAAT 


TTAATGCATG 


TTGTGCACCT 


Tm™TACCTT 


GGCCAGCTTC 


GAAATGTTTG 


245 


TAAGCAGCTA 


CACTTAAAAT 


GCCTATCGTT 


GATAGTGATG 


CAAGGGGAGA 


AATGTTTTTA 

M A A A A A A M\ 


j v v 


TTQATAAAGC 


TAGCTGAGTA 


TAAAGCAGCA 


GTAGTTGCTT 


CTGCAATGCC 


RACGTATTTT 

W A4~V A AAA 


•J O V 


ACAAGTTCTT 


TTTGCAAGCC 


AAAAGTATGT 


TCAAACAGTT 

A winnwiw x a 


CAATCAT AC C 


^ A AAA V» A A 




TGCAATTTAG 


GTTTACTGGC 

Vp* AAA r% W A V7\9 W 


a a vj\j a t\ x n\jc 




MA X 1 llVVit 1 


1 l l/v\ 


A Q f\ 


CGCAAAATCA 


AAA A AXVVa J. 




1 X\3J\ 111111 


u lunAAun X\3 


nl InUl 1 1AI 


34 U 


CATTTTTACC 


CGTTTCTATA 


AAAATGAATC 


AATTATGTAA 


CGTATGTGTA 


GTTTAGGAAT 


600 


GTTTGCTATG 


GAAATATAAT 


TCTGTTCACT 


CAAAATGTAT 


GAAATTAATG 


TGTAGTTTTG 


660 


TCGAGTTGCT 


CTTTTAATTT 


GGTTAGATTG 


"l"l"l~l"l"i'AGAG 


AAGCGGTACT 


ATTTTTAAGT 


720 


GCATCAACAG 


ATTTACCTTC 


GTTTTGAGAC 


ATTGAGTTTA 


TTACAGCACG 


AAGTTCTGTT 


780 


TCTAGTATGT 


CaGCGTCGCt 


TTAGCATTAG 


AACTTAaTAt 


TTAtAcTCTT 




830 



(2) INFORMATION FOR SEQ ID NO: 586: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586: 

35 TTAGGACGTT TTTACAATCA GTACAATGAG CTCATTGTTA TTAGTCCTTT AACGGCGTCT 60 

TTTAATGCTG GCGCTACATT TGGGCGATTT CATCATTTAA TTGATACTGA AACTTTAGCA 120 

AAATTAGAAC ATGAAAAAGG ACATTATTAT CAGAAGATGA TATGTGATGA CAATGTAGAA 180 

40 ATGATTTCTA TAAATAACAT ACCGAAATAT CCGAGAAATC ATAATGTATT AACTAATCAT 240 

GACTCATACG AATATTCATT GAATTTAGGA AGTAGTAATA GTTATTCAAA GTATGAGCTT 300 

ACCTTAGATG ATATTTATGT TGGTGCTACC TTTtAACAAA TTATATTTAT ATTCTAGCCm 360 

45 

ACtAAATAAA AGGGkaCtaT TTGaATCaAA CmATaTGtAT TAACCTTTTT TA 412 
(2) INFORMATION FOR SEQ ID NO: 587: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587: 

TTCAGTTTGA AATTAATCAT ATAAATTTCT TATGGGAGGG TTGATATCTT AATGATTAAC 60 

ATTATTTCAG CTATA6GATC TATTGGAACA TTTATTATGG CTTTATTTTA TTTTGTATCA 120 

GTTTCAGTTC AACTTTATCA AATGAAAATT AGCTTTCTGC CAGCTTTAGG TTTTAACCAA 180 

ATTTTATTAG AAAGGGAGGA GGATCAACTT AATATAATGA ATTCGGCAAC AGAAGAGCAT 240 

CATCATAAAG ATTATATTAA ACTATATAAT TTAGGTGGCG GTGCTGCTAA rAAAATTGCA 300 

ATAGAGGTTT TATTGGGGAA GGATArAGTC ATTCAGAAAA AATACGTGCA TATTkTACCT 360 

AGTAAAGAAG GGTACATGTT ACCAATTAAT AAAAATGTGT ACGAAGAATT AGAAAGAACG 420 

ATTGAflAACA ATGGTCATGA AGCTGATTTG AATGTACGTA TGACTTATTA TCATAATGTA 480 

AGTCGCAAAC AACAGGAAGT TATATTAAAA GGTCAAATCG ACCGTTTTAA TACTTATAAT 540 

20 AATAAAGAAA TTTATGATTT GCAGTTTATC TAAAAATTGA TTTAAGAGGG TAGTTGTTTA 600 

TTGCGAAAAA TATCATTCAA TTTTAATGAA ATAATGGCGT CATTACTATA AAATATTACT 660 

TTATGTTGTA ATGCATTTTT CTATAAGATA GAACTAAAAG GAGGGGCAAA GATGCAAATT 720 

25 AGACAAATAC ATCAACATGA CTTTGCTCAA GTGGACCAGT TAATTAGAAC GGCATTTGAA 780 

AATAGTGAAC ATGGTTATGG TAATGAATCA GAGCTAGTAG ACCAAATTCG TCTAAGTGAT 84 0 

ACGTATGACA ATACCTTAGA ATTAGTAGCT GTTCTTCAAA ATGAAGTTGT AGGGCACGGT 900 

TTACTAAGTG AAGTTTATCT TGATAACGAG GCACAACGGG AAATTGGATT AGTGTTAGCA 96 0 

CCTGTATCTG TTGATATTCA TCATCAAAAT AAAGGTATTG GGAAGCGATT GATTCAAGCA 1020 

TTAGAACGAG AAGCAATATT AAAAGGATAT AATTTTATCA GTGTATTAGG ATGGCCGACG 1080 

TATTATGCCA ATCTAGGATA TCAACGCGCA AGTATGTACG ACATTTATCC ACCATATGAT 114 0 

GGTATACCAG ACGAAGCGTT TTTAATTAAA GAATTAAAAG TGAACAGTTT AGCGGGAAAA 1200 

40 ACAGGTACCA TAAATTACAC ATCTGCTTTT GAAAAAATAT GATTTCAAGC TAGGATTACA 1260 

TTAGGTAGAG TTCATATTAA TAATAAAAAA TGTTTGCAAT CAAATCGTAC GTTGTCGTTT 1320 

GTAATTCTTA AAATAGCAAT AAATAAAATG TTTGTTAGTA AAGTATTATT GTGGATAATA 13 80 

AAATATCGAT ACAAATTAAT TGCTATAATG CAaTTTTAGT GTATAATTCC ATTGACAGAG 1440 

ATTAAATATA TCTTTAAAGG GTATATAGTT AATATAAAAT GACTTTTTAA AAAGAGGGAA 1500 

TAAAATGAAT ATGAAGAAAA AAGAAAAACA CGCAATTCGG AAAAAATCGA TTGGCGTGGC 1560 

TTCAGTGCTT GTAGGTACGT TAATCGGTTT TGGACTACTC AGCAGTAAAG AAGCAGATGC 1620 

AAGTGAAAAT AGTGTTACGC AATCTGATAG CGCAAGTAAC GAAAGCAAAA GTAATGATTC 1680 
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GTCAAACACT AATAATGGCG AAACGAGTGT GGCGCAAAAT CCAGCACAAC AGGAAACGAC 1800 

ACAATCATCA TCAACAAATG CAACTACGGA AGAAACGCCG GTAACTGGTG AAGCTACTAC I860 

TACGACAACG AATCAAGCTA ATACACCGGC AACAACTCAA TCAAGCAATA CAAATGCGGA 1920 

GGAATTAGTG AATCAAACAA GTAATGAAAC GACTTCTAAT GATACTAATA CAGTATCATC 1980 

TGTAAATTCA CCTCAAAATT CTACAAATGC GGAAAATGTT TCAACAACGC AAGATACTTC 2040 

10 

AACTGAAGCA ACACCTTCAA ACAATGAATC AGCTCCACAG AGTACAGATG CAAGTAATAA 2100 

AGATGTAGTT AATCAAGCGG TTAATACAAG TGCGCCTAGA ATGAGAGCAT TTAGTTTAGC 2160 

15 GGCAOTAGCT GCAGATGCAC CGGTAGCTGG CACAGATATT ACGAATCAGT TGACGAATGT 2220 

GACAGTTGGT ATTGACTCTG GTACGACTGT GTATCCGCAC CAAGCAGGTT ATGTCAAACT 2280 

GAATTATGGT TTT T CAGTGC CTAATTCTGC TGTTAAAGGT GACACATTCA AAATAACTGT 2340 

20 ACCTAAAGAA TTAAACTTAA ATGGTGTAAC TTCAACTGCT AAAGTGCCAC CAATTATGGC 2400 

TGGAGATCAA GTATTGGCAA ATGGTGTAAT CGATAGTGAT GGTAATGTTA TTTATACATT 2460 

TACAGACTAT GTAAATACTA AAGATGATGT AAAAGCAACT TTGACCATGC CCGCTTATAT 2520 

25 

TGACCCTGAA AATGTTAAAA AGACAGGTAA TGTGACATTG GCTACTGGCA TAGGTAGTAC 2580 

AACAGCAAAC AAAACAGTAT TAGTAGATTA TGAAAAATAT GGTAAGTTTT ATAACTTATC 2640 

TATTAAAGGT ACAATTGACC AAATCGATAA AACAAATAAT ACGTATCGTC AGACAATTTA 2700 

30 

TGTCAATCCA AGTGGAGATA ACGTTATTGC GCOGGTTTTA ACAGGTAATT TAAAACCAAA 2760 

TACGGATAGT AATGCATTAA TAGATCAGCA AAATACAAGT ATTAAAGTAT ATAAAGTAGA 2820 

35 TAATGCAGCT GATTTATCTG AAAGTTACTT TGTGAATCCA GAAAACTTTG AGGATGTCAC 2880 

TAATAGTGTG AATATTACAT TCCCAAATCC AAATCAATAT AAAGTAGAGT TTAATACGCC 2940 

TGATGATCAA ATTACAACAC CGTATATAGT AGTTGTTAAT GGTCATATTG ATCCGAATAG 3000 

40 CAAAGGTGAT TTAGCTTTAC GTTCAACTTT ATATGGGTAT AACTCGAATA TAATTTGGCG 3060 

CTCTATGTCA TGGGACAACG AAGTAGCATT TAATAACGGA TCAGGTTCTG GTGACGGTAT 3120 

CGATAAACCA GTTGTTCCTG AACAACCTGA TGAGCCTGGT GAAATTGAAC CAATTCCAGA 3180 

45 

GGATTCAGAT TCTGACCCAG GTTCAGATTC TGGCAGCGAT TCTAATTCAG ATAGCGGTTC 3240 

AGATTCGGGT AGTGATTCTA CATCAGATAG TGGTTCAGAT TCAGCGAGTG ATTCAGATTC 3300 

SQ AGCAAGTGAT TCAGACTCAG CGAGTGATTC AGATTCAGCA AGCGATTCCG ACTCAGCGAG 3360 

CGATTCCGAC TCAGACAATG ACTCGGATTC AGATAGCOAT TCTGACTCAG ACAGTGACTC 3420 

AGATTCCGAC AGTGACTCAG ATTCAGATAG CGATTCTGAC TCAGACAGTG ACTCGGATTC 3480 
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CGATTCTOAC TCCGACAGTG ATTCCGACTC AGACAGCGAT TCAGATTCCG ACAGTGATTC 3600 

CGACTCAGAT AGCGATTCCG ACTCAGATAG CGACTCAGAT TCAGACAGCG ATTCAGATTC 3660 

AGACAGCGAT TCAGATTCAG ATAGCGATTC AGATTCCGAC AGTGACTCAG ATTCCGACAG 3720 

TGACTCGGAT TCAGATAGCG ATTCAGATTC CGACAGTGAC TCAGATTCCG ACAGTGACTC 3780 

AGACTCAGAC AGTGATTCGG ATTCAGCGAG TGATTCGGAT TCAGATAGTG ATTCCGACTC 3840 

CGACAGTGAC TCGGATTCAG ATAGCGACTC AGACTCGGAT AGCGACTCGG ATTCAGATAG 3 900 

CGATTCGGAC TCAGATAGCG ATTCAGAATC AGACAGCGAT TCAGATTCAG ACAGCGACTC 3960 

AGACAGTGAC TCAGATTCAG ATAGTGACTC GGATTCAGCG AGTGATTCAG ACTCAGGTAG 4 020 

TGACTCCGAT TCATCAAGTG ATTCCGACTC AGAAAGTGAT TCAAATAGCG ATTCCGAGTC 4 0B0 

AGTTTCTAAC AATAATGTAG TTCCGCCTAA TTCACCTAAA AATGGTACTA ATGCTTCTAA 4140 

20 TAAAAATGAG GCTAAAGATA GTAAAGAACC ATTACCAGAT ACAGGTTCTG AAGATGAAGC 4200 

AAATACGTCA CTAATTTGGG GATTATTAGC ATCAATAGGT TCATTACTAC TTTTCAGAAG 4260 

AAAAAAAGAA AATAAAGATA AGAAATAAGT AATAATGATA TTAAATTAAT CATATGATTC 4320 

ATGAAgnAAc rCCTTAAAAG GTGGCTTTTT TACTTGGATT TTCCAAATAT ATTGTTTGAA 4360 

TATAATTAAT AATTAATTCA TCAACAGTTA ATTATTTTAA AAAGGTAGAT GTTATATAAT 4440 

TTGGCTTGGC GAAAAAATAG GGTGTAAGGT AGGTTGTTAA TTAGGGAAAA TTAAGGAGAA 4500 

AATACAGTTG AAAAATAAAT TGCTAGTTTT ATCATTGGGA GCATTATGTG TATCACAAAT 4560 

TTGGGAAAGT AATCGTGCGA GTGCAGTGGT TTCTGGGGAG AAGAATCCAT ATGTATCTAG 4620 

TCGTTGAAAC TGACTAATAA TAAAAATAAA TCTAGAACAG TAGAAGAGTA TAAGAAAAGA 4680 

TTGGATGATT TCAATATGGT CCnTTCCCA 4709 
(2) INFORMATION FOR SEQ ID NO: 588: 
40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1554 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588: 

CTTTTTTAAt TAwCgGaAtA TTGtCaTgaT tAcAcTTCGt TAGGGTTTAc gTCGtAATTT 60 

60 

CATTTaaTAA gCgCTTCAcC ATTAAAtGTG gTAmCCTTTA ATTCGCCAGT AGAAACATCA 120 

CAGTAACTAA GCGCAATTTC AGGTTGATTC ATAACAAAAC TTAAAATATA GTTATTTTGT 180 
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CGTCTAACCA TACCTTTCGT TTGTTTCGGA TCTTCCATCT GTTCACAAAT AGCTACTTTA 300 

TATCCATTAT TAACAAGTGT ATCTATATAA CTATCTGCAG AATGATACGG AACACCACAC 360 

ATCGGAATTG GATTTTCTTT TTTAGCATCT CTTTTAGTTA AAGTAATTTC AAGTAyACGT 420 

GATGCCTCCT TGGCATCTTC ATAAAACATT TGATAGAAAT CACCTAGTCT AAAAAATAAT 480 

AAGCAATCTT GGTATTCTGA TTTTATTTTT AAATATTGCT GCATCATTGG TGTAACATTA 540 

GACATATTAT TTCTTCACAA CCCTTGTCTC TTTTTAAAAT TTGTCTTTAC AATATATTCG 600 

TTTGTAAGyT TTTTAATTAT TAATTATTTA ACTTATACAT TTTAACATAC TTACTTTTAC 660 

AAACCTATTC ATACCATATA ATCACGAAGC ATCTTAAATG TATAAGAAAA CGCCTCAAAC 720 

CTAATAAAAT GTGTCAATAG CATGTTTAGA ATTAAATTAA AATTCTAACA TTCAAGACAT 780 

TTAATTAAGT AAGGGCGTTC AATATTAAAA TGAACAATGA CTCTGTTTGA AATCATATAT 840 

20 CATAAAATTA TTTTATAAAC CTTTGAAGAA TACCACGTTT TTTTAGAGTA ATTAATAAGA 900 

AATAACTTAT AATAGATCCG ATAGCACTTG aGACTATGaA CGTAATCATT AACGGTTTAA 960 

TGAAGAAGTC TTGAAGCCCA AGGaAATATG CTAATGGtAT aCaAATTAAA cTTCCgATGA 1020 

25 CaCCAGTTCC aAGTACTTCa CCGACCGCGG CCaTAAATAT ATGTTTACGA TATnygTAAA 1080 

ACATACTAGC CAATAAAACT CCAATCATAC TACCCGGAAA TGCAAAAGst GTACCAGTAC 114 0 

CAAAAAGAAC TCTTAAAATT GATGATATAA GCGCTTGAGC TAATCCATAC CAAGGACCTA 1200 

CTATGACCGC ACTTAATACA TTTACAAAAT GCTGTACTGG TGCTGCCTTA ACTGGTCCTA 1260 

GAGGAATGAT GATAATACTG CTTAATACAA CATTTATTGC AATTAAAAGT G CAGTTAT AG 1320 

CCAGTTTTCT TGATTTCATA TGATTGTTCT CCTTTTTGTT TGTAATTAAT CACTATGCTT 1380 

GGCTTTATTA TGGTCATTTA AACGTGTTTC CATTGTTGAT ACAAACATTT TGAATAATTG 1440 

ATTCGCTTCA TATTGTGAAG TTTGAAACTG TTCAACTATG GGCAATGTAT TTATTTCTGC 1500 

AO TTCTAtACTC TGAATGGtAT GTTCCGACTG nTCCAGCGCA TTTGTTTCCC GnAA 1554 
(2) INFORMATION FOR SEQ ID NO: 589: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 589: 
TTTTGACGAA AAAAGTTGAT TTACAAATAT ATAAACGTTG TGATTTCAAT GTTTGTATAG 60 
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TATATACATO ACAGCAACTT GGGAAAAAAA GGAAOOTAAC GAAGOTTTAT TAACTGTTAC 180 

TGTTOCTGCA GAAAAAOTAA ACAAAGCrCT GAGAGATCCC CTCATAATTT CCCCAAAGCG 24 0 

TAACCATGTG TGAATAAATT TTGAGCTAGT AGGGTTGCAG CCACGAGTAA GTCTTCCCTT 300 

GTTATTGTGT AGCCAGAATG CCGCAAAACT TCCATGCCTA AGCGAACTGT TGAGAGTACG 360 

TTTCGATTTC TGACTGTGTT AGCCTGGAAG TGCTTGTCCC AACCTTGTTT CTGAGCATGA 420 

ACGsCCGCAA GCCAACATGT TAGTTGAAGC ATCAGGGCGA TTAGCAGCAT GmTATCAAAA 480 

CGCTCTGAGC TGCTCGTTCG GCTATGGCGT AGGCcTAGTC CGTAGgCAGG ACTTTTCAAG 540 

TCTCGGAAGG yTTCTTCAAT CTGCATTCGC TTCGAATAGA TATTAACAAG TTGTTTGGGT 6 00 

GTTcGAATTk CAACArGTaA GTtAGtTGCT AGAnCCCA 638 
(2) INFORMATION FOR SEQ ID NO: 590: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 590: 

AAAATATTCC CGTACATTTT GATGTCTGTA GGGGCTTTTT TGACTTTAGG ATTTGTCATT 60 

TTTTCAATTC ATAAAGGGAG ACGAACGAAA AATGAATCAG CACGTAAAAG TAACATTTGA 120 

TTTTACTAAT TATAATTACG GCACATATGA CTTAGCAGTA CCAGCATATT TACCGATAAA 180 

AAACTTAATA GCTTTAGTAT TGGATAGTTT GGACATTTCA ATATTTGATG TCAATACACA 24 0 

AATTAAAGTG ATGACGAAAG GTCAATTACT TGTTGAAAAT GATCGACTCA TTGATTATCA 300 

AATCGCTGAT GGAGATATTT TGAAGTTACT ATAGGAGGGA AAATAGATGG TTAAAAATCA 3 60 

TAACCCTAAA AATGAAATGC aAGATATGTT AACGCCTTTA GATGCTGAAG AAGCAGCTAA 420 

AACAAAATTA CGCTTAGATA TGAGAGAGAT TCCTAAGTCT TCAATTAAAC CAGAACATTT 4 80 

TCATTTAATG TACTTATTAG AACAACATTC TCCATATTTT ATAGATGCTG AATTAACTGA 540 

45 ACTACGTGAC aGTTTCCaAA TACATtATGA CATTAATGAC AATCATACAC CTTTTGATAA 6 00 

TATTAAATCA TTTACTAAAA ATGAAAAATT ACGTTACTTA CTCAATATCA AAAATTTAGA 660 

AGAAgTAAAT CGTACACGCT ACACATTTGT GTTGGCACCA GATGAATTAT TTTTCACAAG 720 

50 

AGATGGATTA CCCATTGCTA AAACAAGAGG GTTACAAAAT GTTGTTGATC CATTACCTGT 780 

GTCAGAAGCT GAATTTTTAA CAAGATATAA AGCGCTGGTT ATCTGTGCAT TCAATGAGAA 84 0 

55 
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AACTAAAGTT ATTGAAGCGG CAACGTTAGA TTTACTAACQ GCATTTTTAG ATGAACAGTA 960 

TCAGAAACAA GAACAAGATT ATAGTCAAAA TTATGCATAT GTACGCAAAG TAGGACATAC 1020 

CGTTTTCAAA TGGGTTGCTA TCGGTATGAC AACGTTAAGT GTTTTATTAA TTGCATTCTT 1080 

AGCCTTTTTA TATTTTTCAG TAATGAAGCA TAATGAGCGC ATTGAAAAAG GATACCAAGC 1140 

ATTTGTAAAG GATGtTATAC GCAAGTACTA AATACGTATG ATGATTTAGA TGGTaAAAAt 1200 

TgaTAAAGAG GCACTTTACA TTTATGCCAA AAGTTATATC CA 1242 
(2) INFORMATION FOR SEQ ID NO: 591: 

1$ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591: 
TTCCAGATAG AGCCTTAGTT GCTGCCGCTG AATTGTCTGA TCGTTACATC ACTGATCGTT 60 
25 TTTTTACCAG ATAAAGCGAT TGATTTAGTT GACCAAG CAT GTGCAACAAT TCGTACGGAA 120 

ATGGGATCAA ATCCAACTGA ATTGGATCAA GTTAATAGAC - GTGTCATGCA ATTAGAAATT 180 
GAAGAAAGCG CACTTAAAAA TGAATCTGAC AATGCGAGCA AACAGAGATT ACAAGAACTA 240 

30 

CAAGAAGAGC TTGCCAATGA AAAAGAGAAA CAAGCAGCAC TTCAATCTCG TGTAGAATCA 300 
GAAAAAGAAA AAATAGCAAA TTTACAAGAA AAACGTGCGC AACTAGATGA AAGTAGACAA 360 

3S GCGTTGGAAG ATGCACAAAC AAATAACAAT TTAGAAAAAG CTGCTGAACT ACAATATGGA 420 

ACAATTCCTC AATTGGAAAA AGAACTTAGA GAATTAGAGG ATAATTTCCA AGATGAGCAA 4 80 

. GGTGAAGATA CAGATCGAAT GATTCGTGAA GTTGTAACAG ACGAAGAAAT TGGCGATATT 540 

40 GTCAGCCAAT GGACAGGCAT ACCAGTTTCA AAATTAGTTG AAACAGAACG TGAAAAATTA 600 

CTTCACTTAA GTGACATCTT GCATAAACGT GTTGTAGGTC AAGATAAAGC GGTTGACCTG 660 
GTTTCAGATG CAGTAGTTAG AGCAAGAGCA GGTATTAAAG TnCAAACAGA CCTATTGGTA 720 

45 

GTTTCTnATT CCTAGGTCCn ACTG 744 

(2) INFORMATION FOR SEQ ID NO: 592: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 144 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

55 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592: 

TTGTTATAGT GTCTGAATCT GTGTAACTCT TGCCCATGTG TTCTGAGTAA AGCACCCACT 60 

5 GTTTATTTAC TTTTCGTTGT AGTCTAGCTT CGTGTAGTAG TTTGTTTAAC TTTTGTGCTG 120 

ATATACCGTA GTCTGCCGCG ATTTGAGTTG TGGCTAATGT GCCAGTTGAT TTTAAGATTT 160 

. CATCAACATA ATCTGCTTTG GGTTTTAGCT CTCCAATTTC TTGTTGTAAA AGTAAGTTTT 240 

10 

GCTCTTTTTC TTTCTTATAC TCAGTCAACA CTGTAATGAT GTAGTCTGGA TCTTTTAATG 300 

TTTGTTCAAT TACATTGTCT GTTGCGTATA TACCGTGTTT GCGAATAGCT GGTaGGACAT 360 

CTGATGTTAC CCAG CGTTTG AATTTTCTAG CGGTTTCTCT AATTTTTTCG TTTTTGCTTT 420 

1$ 

GTTTAGAAGC ATCGAAGATT AGACTGTATA ATCCTGATTC GTTGATAATG ATCATATTTC 4 80 

TGTTTTGACC TGATGCACTA AATTGGTGCG TCAGTTTGTC CTCGCTATCA ACATGATTTC 540 

20 TAATGGCATT GTCTGATCTT GCATATCCTA AAATCTCAGC AATATCTTTT CCTACAAAAT 600 

AAGGTTCGTT TTCAATTTCT ACTGTTCTTA CTGGTAGCTC TTTAAAATTA AATGTTTGTA 660 

ATQCTTGCAT TTGAGTATCC TCCTTTTTCC TCAACACCCA CATTCAGCAG ACGGTTATCG 720 

26 CAATGACTAT CGAATGTATT TAAACGCGGC TCATATCATC GCCAGcTCTC GCTCACATCT 780 

GCTCAATGTG GATGTTGATA AGCGTGGTTA TATTAAGAAG TGAATGTTAC TGATTCACTT 840 

TCCGCCACTC TGTTAAATCA GTAACTTTGT TATCGCTTTC AACACCGTTA AGCTTGTCTA 900 

30 

ACGCTTTCAC TACTTTTTGG AACTCTTTGA TAGCACTtCG TAG CTTTTT A GTAATTTCAT 960 

CTTCTACCAT TTCCAAACCA GCAAATGGGT CTTCGTTATT CATGCTTAGA TGTTTGTTGA 1020 

AAAGATCTCG AGTGTATCTT ATTTCTTTAA GTGATTTATC ATAAGCTTCA ATTTGTCCTG 108 0 

35 

AAAGGTTATG ATATTTTAGT TGTAGTTTTA CTAATTTTAA TGATTGGTCT TGCATTTGTT 114 0 

ATGTCTCCTT TAAGATGTTT GTTTGCGTTT CGTGTACTTT GTGGGTAAAA AAATATCTCC 1200 

40 AATATTTTCG TCAAAAAAAT CAGCGATAAT AAACATCTCA TCATTCTTAA ATTGATGCTT 1260 

TCCTAATTCT TTTAAACGAT AACCTTCAGT TGATATATTC AAGAGGTTTG CTAAATCTTC 1320 

TTGAGTACAC TTTCTTTCTT TTCTCAACTT TATTAAATTC CATTGCATGT TGTCACCTCC 1380 

45 CGCtTTACAA AACCTACTAT ACACGATACG GTACTTGnGT CAACATAAAA GTTTGCTTTT 1440 

CGTGTATTT 1449 
(2) INFORMATION FOR SEQ ID NO: 593: 

so 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1170 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

6S 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593: 

5 ACCAAAAACA CTTTCTAAAG AAATTGTTTC AGAACCATTT GCTAAAAACC AATTACGTGA 60 

AAAAGCTCGT CTAACTAACA TTAAAGOTTT AOAQATTCCA AGAGTTTCAT ACACTTTAGA 120 

CGATGATGAT TTCATTACAG ACGTAGAAAC AGCAAAAGAA TTAAAAGCAA AAGGTGATAC 180 

10 

AGTCAAGTTC ACTACTAATA AATTCAAAOT ATTTGCTGCA ATTTCAGATA CTGTAATTCA 240 

TGGATCAGAT GTAGATTTAG TAAACTGGGT TGAAAACGCA CTACAATCAG GATTAGCAGC 300 

TAAAGAGCGT AAAGATGCCT TAGCAGTAAG TCCTAAATCT GGATTAGAAC ACATGTCATT 360 

15 

TTATAATGGA TCTGTTAAAG AAGTTGArGG AGCAGACATG TATGATGCTA TTATTAACGC 420 

TTTAGCAGAT TTACATGAAG ATTATCGTGA TAACGCAACA ATTTATATGC GATATGCAGA 4 80 

20 TTATGTCAAA ATTATTAGTG TTCTTTCAAA TGGAACAACA AATTTCTTTG ACACACCAGC 540 

AGAAAAAGTA TTTGGCAAAC CAGTAGTATT TACAGATGCA GCAGTTAAAC CTATTGTGGG 600 

AGATTTCAAT TATTTTGOAA TTAACTATOA TGGAACAACT TATGACACTG ATAAAGATGT 660 

26 TAAAAAAGGC GAATATTTGT TTGTATTAAC AGCATGGTAT GATCAGCAAC GTACATTAGA 720 

CAGTGCATTC AGAATTGCAA AAGCAAAAGA AAATACAGGt CCATTACCCA GCTAAGCCCC 780 

AAAAGGTTAA TGTAACAGCT AAGGCTAAAT CAGCTGTAAT ATCAGCCGAA TAGGGGTGAT 840 

30 

GAAATGAGTT TAGAAGAAAk TAAATTGTGG TTGAGAATTG ACTATAATTT- CGAAAATGAT 900 

TTAATTGAAG GTCTCATTCA ATCGGCTAAG TCTGAATTAC TATTAAGTGG GGTnCCAGAT 960 

TATGACAAAG ATGACTTGGA ATACCCGCTT TTTTGTACAG CGATTAAATA TATCATTGCA 1020 

35' 

AGAGATTATG AAAGTCGTGG ATACTCAAAT GACCAATCTA GAAGCAAGGT GTTTAATGAA 1080 
AAAGGATTGC AAAAAATGAn TTTGAAATTA AAAAAGTGGT AGGTGATTTT TAAATGGAAT 1140 
TTAATGAATT TAAAGATCGC GCGTATnTTT 1170 

40 

(2) INFORMATION FOR SEQ ID NO: 594: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1393 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594: 
TGAAGGGTGT TTTGTTTATG GCCAATTGCT GTGTTATTAA AnCGAATGTT TCGAATGGAA 60 
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ATCATTCCGG AAGAGGACAT GTCTTCAGCT TCTGATTTGA AAGCACAGTT TTATCGCGGT 180 

ACTTTGCAGC GCTTTTACCA AtCnTTGCAC GCAGAAAAGC TTACACCTTA TGTTATGTCT 240 

TATGACGATA TCATTTCATT TTGTAAAGAA AACAATATCT CTGAAOTAGT GACTGCGGGT 300 

GATATTATGA GTTATCATCT TGAAGAATAT GATATTTTAC ATCAACGTTC TTTATTCAAT 360 

GAAGCACGCA TTGCCGTTAC TTTGATACGT GGGAATCATT ACTTTAAAGC GAGTAAAACA 420 

ATGAATCAAC AAGGGGAGCC ATACAATGTT TTTACTAGTT TCTATAAAAA ATGGCGACCT 480 

TACTTGAGGC ATAGAGACGT ATATCACTAT GATTTAAAAT CATTCGAAAA CTTTGTCATT 540 

GCATCACCTG ATGATTTAGT GTTTGATGAC ATAGCATTTG GATCCTCACA aATAATTGAA 600 

CAGatAAATG GCAACATTTT TTAGATCAAG ATATACAGAA TTACGAAAGC GGAAGAGACT 660 

ATTTACCTGA AGTATTAACA AGTCAGCTAA GTGTTGCTTT AGCATATGGA TTATTAGATA 720 

20 TTATTGAAAT TTTTAATGAT TTATTGGCGC GTTATGATGA AGATGAGGCA AACTATGAAG 780 

CATTTATAOG TGAACTCATT TTTAGAGAAT TTTATTATGT GTTAATGACA CAGTATCCTG 840 

AAACCTCATA CCAAGCTTTC AAACCTAAAT ATCGACAGAT AAAATGGTCG CAAAATGAAG 900 

25 CGGATTTTAA TGCATGGTGC GAAGGGCAAA CAGGATTTCC AATCATTGAT GCAGCAATAA 960 

TGGAATTGAC ACAAACTGGT TTTATGCATA ATCGAATGAG AATGGTTGTG TCGCAATTTT 1020 

TAACCAAAGA TTTATTTATA GATTGGACAT GGGGAGAAAA ATTCTTTAGA AAGCACCTTA 1080 

TTGACTATGA TGCAGCATCA AATATTCATG GATGGCAATG GTCTGCTTCT ACAGGTACGG 114 0 

ATGCAGTGCC GTATTTTAGA ATGTTTAATC CAATAAGACA GAGTGAACGC TTTGATG CT A 1200 

AAGCTTTGTA TATCAAAACA TATCTTCCGA TTTTTAATCA AATTGATGCA AAATATTTGC 1260 

ATGATACACA ACGCAATGAG TCCAACCTTT TTGAACAGGG GATTGAATTA GGTAGTCATT 1320 

ATCCAAGACA AATGGTAGAT CATCAAGAAA AACGTACACA AGTTTTAGCT ACATTTAAAG 1380 

CGCTAGACTA ATT 1393 
(2) INFORMATION FOR SEQ ID NO: 595: 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 84 base pairs 
45 (B) TYPE: nucleic acid 

<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



so 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 595: 
TTAACTTGAA AGATTTCCCC GACATCTATA ACTTGTATAA TGTGTATATG TCGTTTTTAG 60 
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10 



1$ 



20 



AAAAATTAAA AAAAGAATAT 


CTTGATTGGT ATAATCAGAC 


CTTAGAGTTT 


TCTAATTTAT 


180 


CAAACAATGT AGTAAGwATA GATACTCCTT TTAAAGATAA 


TTCTTTAGAT 


AATTTAATAA 


240 


TTTACGCTTT 


ATACGATCAG 


TCCAGAGACA TGATTACACT 


GACAGATGAC 


GGCTATACTA 


300 


TATtTGATTT 


AGAAAATAAT 


GGTATTTCTT 


TAAATAAATC 


AAAAAAAOGT AAAAAGATTT 


360 


TTGAAGAGCA 


CCTTTCAGCT 


TACGGTATTA AATATAACGA TAAAACTCAC 


GAAATTTTTG 


420 


TTCAAACTAA 


CTTTAAAAAT 


TTTAATAAAT 


CGAAACATAA 


TTTATTACAG 


TGCCTTATAT 


480 


TTGTTAATGA 


TATGTACTTA 


CTTTcTAATC 


CTAAGTCACA GAACATATTT 


ACAGAAGATG 


540 


TTGCAAACAA 


ATTGGATGAA CATAACATTT ATTACGGAAG AGATTTACCT ATTATAGGAA 


600 


GCAGTX3GTGT 


TGTTCATAAT 


TTCGACTTTT 


TTATTAGCGC 


TAAGAAAAAT 


PAAAAAf3AAA 


660 


AATTTATCAA 


TGCTATTTCT 


AACCCTAATA ATTCTATGAT 


TATTAAGTCG 


AAAATAAHfVI 


r « U 


ATGCTATGCA 


AGCAAAAAAA 


ATAAAAAGAC 


ACAGGCAAAA 


x vmu x x x ■rx x l 


x A L*\L X X 1 AA 


1 fl ft 


ATGACTCAAA 


AAAAGAAATA 


AATGAACATA 


ATAAAAATCT 


TCTTCATGAA 


AACTATATTA 


a a n 

CS4U 


GTACAATAGA 


TTATAGCGAA 


TTAGATGAAA 


AGATAGGTTT 


ATTGATTTAA 


TATATATAGA 


o Art 


CGTGATAATA 


TCAATGTTTA 


TATTAAATGA 


AACGAATTTG 


AAAATTTCGA 


AACAGCTTAA 


SOU 


GATAGCAAAT 


TGAATAGCCT 


TATTGATAAT 


GCAGAATCGT 


CTACACTTAG 


TTGAACAAAT 




TCTATGAGAA 


TAGATATTGT 


TAAACTATTT 


GGGTAGGCGA 


TTTTATTTTG 


ACAAGAGTCA 




GAAGATTATT 


TAGTTCATAG 


TTTAAGTTAC 


AAATGGCTAA 


ACTTTATGAA 


AATGGTATGA 


i i 


CTAGAATCGA 


AATTATACTC 


GAATACGATT 


TAACACTCTC 


AATCTTCTCG 


AACTGAATAA 


1 ^ Oft 


ATACTGGGTC 


ATTCAATCAT 


CAAGACAACT 


TAATAAGTGA 


TGATAAATGT 


TCATACCATT 


1 TCn 
IZoU 


TTGCAAATAT 


AACCAGATAC 


TCTCAACAAA ATGTAGAGTT 


CAAAATTAAA 


AATGTCCCTA 


1320 


ATTGGCACAT 


TGCTGTATGA 


TAATAAATTC 


AACTTAGATT 


TCAACAATAT 


AACTAATGAT 


1390 


GCTTATATTT 


ACOGCGATAT 


CAAACCAAAC 


AGCCCATCCA 


CAAACAAATT 


AGAATATATC 


1440 


TOTGTTAATAA 


TAGTAAATGA 


TATATCTTCC 


AAAGACCGCA 


AACT 




1484 


(2) INFORMATION FOR SEQ ID NO: 596: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1435 base pairs 
<B) TYPE: nucleic acid 
<C) STRAND EDNESS : double 
(D) TOPOLOGY: linear 



so 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596: 
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CTTTATCaTT AAATGCCaTT TTGaATCCTT CTTGAATTTG TTTGATTTCC TTTTCAGTAT 120 

TATTAACCTT GTTCAAGACA GTGCCATTAA TTTTCTTCTT GAGTGGACCA ACCTCATCTT 160 

5 TATTAGTTGA TTCATGAATC GTTAATCCAA TGTGTGGCTG TATTCTATAA CCATCATTCG 240 

CTATAGTTGA AACATATTGT GaTAATTGTA ATGGTGTATA GGTATCATAT TGACCAATTG 300 

ATAAATCTAG ATAATTACCT GGATTATTTG TTAATGGTTC GATTTGACCT CTTGTTTCAT 360 

10 

TTGGTAAATC TATCCCTGTT TTCACACCTA AGCCTACTTG ATTTAATCCT CTTCTTAGCT 420 

TTTGGGCAGG TGAACTTATG TCTGAAGGTA AAGCCATACC AGAATAATAA OGGTCTCCCG 480 

1S CTAATTTTAA TGCTGTTTTA AACATATATA CGTTTGATGA ATGCATCAAA GCTTGCTTAT 540 

CATTAATAGT TACATGCCCG TTTTTATTGA AGTATGATCG TTTTGTCAAA CCACCTTGGA 600 

AATGTAATGG TTCATCGACC ATTGTTTCTC CAACTTTGAT AGCTTTATTC TGATAACCGG 660 

20 CTAATAATGT TCCACCTTTT ACAGAAG AT C CAACCGCAAA TTGAGAAGTA AACGTACCAA 720 

TGTCATAATC AGTCATTTTA CCACTCTTAT TAATCTGCTT TCCGGCAAGC GCAAGAATGT 780 

CTCCATTTTT AGGATTTTGT ACAAC CATCA TTGcATTATC CATATCTTTG GCACCTTGAC 840 

25 TGCGAsTTCT TAATTTGTTT ATCTAATAAT GCTTCTACTT CTTTTTGAAG ATCTATATCG 900 

ATCGTTAATT TCAAATCTTG ACCGCGAGCG C CAGGr TTTA ACACTTCTGA AGATGTAACT 960 

TTACCAGATT TGTCCGTTGT GTATTTCATT TCTTTCTTCT TACCACGcAA TACATCTTCA 1020 

30 

TATTGATATT CTAGGTAAGA TTTTCCAACA CGATCATTGC GTGAATATCC TTTGGATAAG 1080 

TAATGTTCTG TCAATTCTTT TGGaATACCT TCAGCAGGTG TCGATACATC TC CGAAT AT A 1140 

CCTCTTAAAG TATCGCCATA TGGATATTTT CTATCCCAAT CCATAGACGT GTTAACACCT 1200 

35 

GGTAATTTGG AAAGTTGCTG AGAAACTGCT GCATACTCTT TTTCACTGAC ATCTTCATTT 1260 

TTTATCATTT GTGGATCTAA AACTGTTCCT GGCATTCATC TCTCGAAAAA TAGCTAAAAC 1320 

4Q TTGGTAAATC TTTAGAAGAC AATTCATCTA ATTGTGGATT TTCCGGATTT CGGATAACAG 1380 

TTGGTnTATC CATAATGGAT CCTGGTTTAA TACCTTCCAn CnGGCGAACA TAGCC 1435 

(2) INFORMATION FOR SEQ ID NO: 597: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 516 base pairs 
<B) TYPE : nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

50 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 597: 

55 
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AGAAAAAOAT TTAGAAAAGA TTACCATTCG OGACATAACA ACACGCGCTG ATATCAATAG 120 

AGGGACATTT TACTTACATT ACGAGGATAA ATATATGTTA CTCOCAGATA TGGAAC3ATGA 160 

GTATATTTCA OAACTAACGA CATATACTCA ATTTQATTTG TTAOGCQOTT CTTCArTTGA 240 

AGACATTGCG AATACTTTTG TAAATAATAT ACTCAAAAAT ATTTTCCAAC ATATTCATGA 300 

TAATTTAGAG TTTTATCATA CTATTTTACA ATTGGAACGC AOGAGTCAAT TAGAATTGAA 360 

AATCAACGAA CATATTAAAA ATAATATGCA ACGTTACATT AGTATCAATC ATTCTATCGG 420 

AGGCGTTCCA GAGaTGTATT TTTACAGTTA CGTTTCCGGA GCAACAATTT CaATTAxinTA 480 

r5 AATACTGGGG TAATGGACAA ACAGCCCATT TTCAGT 516 
(2) INFORMATION FOR SEQ ID NO: 598: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1955 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

26 - (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598: 

AGTACTTTAC CATGAGGTAA GACTTTATTC CCTGCTTTTA T AG ATTGTG C TTTTATTTGT 60 

GTGCCAGAAC CAATAGtAct GGTcTAATGA TTTTGCACTC ACATTATCAA TAGCTTTTTG 120 

TTTGTCTTGA CCTTCAACAT CTGGCACTTT ACTATACTCT GCATTAGATG TGTCATCTTT 180 

TGATTTACCT ACATTTAAAT ATTTCAAAGT ATTTTCCATT ATTGGTTTAA ACGCTTTACT 240 

AACACCTAAT TCATAAGCTT CTTGGTCATT TTTTTGTGCC AAGCTCATAC CAG CGTATAC 300 

AATAACTTTA GGATTTTTCT TCGGCGCGTC ACCCATAAAA CTTACAAAAT ATGGGTTTGG 360 

ACCTTTAACG TATCCACCAC CATTAGGTGC AGCGACTTGT GCTOTACCAG TCTTACCTTC 420 

AO GACCTCATAA CCATCAATAC GATAGTTTGC AGCGTGACTC TTCTTACTAT TCACAACTAA 480 

ATCCAATTGC TTTTCAACTT TTTCAGCAGT ATCTTTTGTT ATTGGTTTGC CTGCGATTTG 54 0 

TTTTTGCCCT TTATAAAATT GTCTTTTACT AACAGGATTT TCAACGCTAT TCACAAACCA 600 

45 TGGTTTTAAC ATATTACCAT CATTAAAGAA CGCTGATTGC GCTTGTAACA TTTGAACAGG 660 

TGTTACTGTT GTCGATTGAC CAAATGATGA CGTTTTTTGt TGCAACTCAT TACTCCATCC 720 

AATTTGaCCA GGTGCTTCTC CATCAAACAT ACCTTTAGTT GaTTTTCCaA ATCCaAATCG 780 

SO 

TTCATACCAA GATTTCATTT TGTCTGCACC AACTAAATCT TGTAAATGCA TCATCAATGT 84 0 

ATTAGATGAA TAAGTAAATC CGAGTGACAT TGGGATTTCA CCCCAACCGA CTCTATTCCA 900 

55 
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15 



20 



25 



30 



AAAAGCACCT 


TCTTGAATAG 


CAGCTGCTAA 


CCCATATGAT 


TTAAATGTTG 


ATCCAGGCTC 


1020 


GTATGTGTTT 


TOATAAAGGT 


CATTTGCCCA 


CTTTTTACCA AAGTCTTTAC 


CAGTTTCAGG 


1080 


ATTAAATGTT 


GGTCGCTGAC 


TGTATGcTAA 


AATTTCTCCA 


GTTTTGGCAT 


CCATGACAAC 


1140 


CGCAAATAAA 


TCTTTCGOCT 


GGTATCTTTC 


AACCATGCCA 


TCTAAAGCTT 


CTTCAACAAA 


1200 


TACTTQAATA 


TTTGAATCGA 


TTGTTAAATG 


GACATCATCA 


CCACGTTTAG 


GCTGCTTCTC 


1260 


TTTTTTAGTA 


TTTGGTGCGA 


TATATCCCCA 


AATATCATGA 


ATATATCTCA 


ATGATCCTTT 


1320 


AGATCCACTT 


AAATAACTAT 


CAAAAATCTT 


TTCAACTCCA 


AGTGCACCTT 


TAAGTTCACC 


1380 


AGTATCCGOA 


TTTTTCTGAG 


CTCTACCAAT 


TAAGTGTGAT 


GCAAAATTGC 


CATTTGGATA 


1440 


AAAGCGTTCT 


GTTTCAGGCA 


ATAAAGAAAT 


ACCAGGCAAA 


TTCATTTTCT 


CTATTTTCAA 


1500 


TTTGTCCTGA 


TACGTTAAAT 


TTGTTCCTTT 


GCGTCCAAAT 


TCAATTTGGA 


AAGCTTTCTT 


1560 


TTGACTAAGT 


CTCTTTTCAA 


TTTCCTCTGG 


CTTCATATTA 


ATGACTGTAG 


ATAATTTCTT 


1620 


TGrCAGTCTCT 


TTTTTATCAA 


CTACATG CCT 


AGGTTTTTTA 


GAATTGGCAC 




leo u 


ATCTATTACT 


GCAACAAGTT 


TATATCTTTC 


TACATCTTCT 


GCTAGCACTT 


TAC CATkACG 


1740 


ATCATATATC 


TTTCCTCGTT 


CTGGTTGTTG 


TGCATTCTTA 


ACTAAATACT 


TTTCATTTGC 


1800 


CTTCATGACT 


AAATCTTGAC 


CATTAGAATG 


TCCAGTAATC 


ATGATATATG 


AAATTCTTAA 


1860 


AACCAATATA 


AAAAAGAGCA 


GTCCGAATAA 


ACCAACAAGT 


AGGACTGCCC 


CTATTTTATT 


1920 


rrrriTAATT 


TTAATTTTTT 


TGGTCGCCAT 


TACTA 






1955 



(2) INFORMATION FOR SEQ ID NO: 599: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 704 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599: 
TAGCAAGAGT AGGTGATAAC AGTTCTTTTA ACGTGTTTGC TGCTTGCTCA TTTCTTGGCT 60 
45 CATTGTAGAC TAAAATATAA TAAGCGTTTT CAAATAAGTG CTTTTTAGCA TTTAGTACAC 120 

CAGATTTATG ACTACCAGCC ATTGGATGAC CACTGACTAA ATGAATATTA TGCTTTAATA 180 
AATTGCATTC GTGTTGCTGT ATCATTGCTT TAGTACTACC AGTATCAGAA ACAATAACAC 240 

50 

CAGGTTTAGT TGGCATATCT ATAAG CTCGC TAAGATATTT ATTTGTGATA GCAACAGGTG 300 
TTGCATAAAT AATTACATCG GCTTTTTTAA TAGCTTCACT ATAATTTAAA CATTTTTCAT 3 60 
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TATTAGTATT AGGGTTATGG TATTTTATAT TGCTAGCAAG ACTTCCACCA ATTAATCCAA 480 
GCCCAACAAA TAAAACTGTT GTCATATAAA TCACCTTATT TCGAAATTTT CAGAATAATA 540 

5 

ACATTGTAAA TGAGCTGTTG ACACAGTGCA ATAGTAAATA AAAATCGATA ATAGCATTAA 600 
TAGATAAACG GAGATAAATC ATCTACAATA AAGAGTATAG TAACACAATG GCAACGGAGG €60 
GGTAAATCAA TGGAACCAAT ATTAGAAATG ATtAAAACAT TAAC 704 

10 

(2) INFORMATION FOR SEQ ID NO: 600: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 
y5 (B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600: 





CCTCTAAAGG 


ATCACGAACC 


TTTTTCATCG 


TGACTAAGCC 


TATAATACCC 


TTAAAcCTAT 


60 




TATTATTAAC 


TTTTACCTCT 


GTGTATTCCC 


TATCAATTAG 


GCGACGCCAG 


TGACGTTTAT 


120 


25 


CTATATACTT 


TACTTTCACA 


GTCACCAACT 


CCTTGTCATT 


ATTATATAAA 


ACTACATAAA 


180 




TGATGTCATG 


TCATGATACA 


GTTTTATAGT 


TTTTGTTAAT 


CATATGAAAA 


TATTTATTAT 


240 


30 


TATTTTACTA 


CAACTCGCTT 


CAATTTACTT 


AAAATAGACA 


ATATTAATTA 


GATAGTACAC 


300 


ACATTTCTTC 


ATAAAAGTGA 


TTTTTCAAAA 


ATATAAATAA 


CACACTCTTA 


TCGTTTTCAA 


360 




AATCATTTAA 


TGCTATTTTC 


ATTAAAAACA 


GCTGAAGCAT 


CAAATCTATT 


CTGATTCAAT 


420 


35 


CAAGAATACA 


TATAAAAATG 


AAGTGACTCA 


AAGGTTTATT 


AACAACCTTC 


AAACCACTTC 


480 


ATTGATCACT 


TTTATTTTAA 


AGCATATTTT 


TCGATTACTG ATTTAAGATG CGGATATTGC 


540 




GTAATTAATT 


CAGATTGCTT 


AAACAATTCA 


AACTGCTTAA 


ACTCAAATGC 


CGGTTGACAC 


600 


40 


ATACAACCTA 


CTAAACTAAA 


AGTATTTGAT 


ATTTCGATTG 


AAGAAGCAAA 


AATTGTTCCT 


660 




TTAGGCACTA 


CATATTGCAA 


TACATCTCCA 


TTTTGGATAT 


CAGTACCCAA 


TGTTGCAGTC 


720 




GTATATTCCC 


CATCCGGATT 


TATCATATGA 


ATTGTTAGAG 


AATCGCCAGC 


ATGATAGTAC 


780 


45 


CATACTTCAT 


CAGCATCAAT 


TCGATGAAAA 


TGCGAAATAT 


TGTCATCTGT 


AAGTAAAAAA 


840 




TAAATACTAC 


TAAACGGCGC 


TCTGCGTCCA 


TCTTTCAATA 


CTTCTCGAAT 


TGTCTCTCTA 


900 




TAGAAACCAC 


CTTCAGGATG 


TGATTCAAGT 


TGCAATTCAT 


CaATCCATTG 


TTCTGCTGAT 


960 


50 


TTCATTATTT 


CAAATCCACA 


TTATGGAATA 


CGTTTTGTAC 


ATCTTCTAAA 


TCTTCTAATG 


1020 




CATCGATTAA 


TTTTTCAAAT 


GTTACTTGGT 


CCGCTTCAGA 


AAGTTCAATA 


TCTGTTTGAG 


1080 
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GAACGACTGC AAATTGAT H58 

(2) INFORMATION FOR SEQ ID NO: 601: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 

(D) TOPOLOGY- linear 

10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601: 



20 



2S 



30 



35 



TGATACTTAT 


TTTTCAATAT 


TATTTGGAAT 


AAAnTCTTnT 


AATTGTGGTG 


TGAAATAAAC 


60 


ACCGAAATTC 


TTTTCAGATT 


CATCTTTTcC 


ACTACCTGCA 


TATAAAATAC 


CAATCAGTyC 


120 


ATGTTTAGAA 


TTCAATACAG 


GAGATCCTGA 


ATTACCTGGT 


TGTGCATACG 


CATCAAATTC 


180 


CATAAACGTT 


CCACTGATAT 


GGTTAATCGT 


TCCTGTCGAT 


TCAAACATTT 


TATATTTTGT 


240 


TTGTGCACCC 


TTTGGATAAC 


CAATAACAGA 


AATTCTATCT 


TTCACTTTTG 


CTCCGTCTGC 


300 


AAATTTTGTA 


TAACTAACGT 


TCTTATTAAA 


ATTCAAACCT 


TCTGTACTTG 


TTTCATGAAC 


360 


ATGAACT AT C 


GCAAGGTCTT 


CTTTTCCGGG 


ATATTCTACA 


ATGTCTTTAA 


CGTCGTAGTT 


420 


TCCTCCGCCT 


TTACCTTTAC 


TCGAATGATG 


TGCTGATACT 


CTATTTTTAA 


AAATATCATT 


480 


ACTTTTAGCG 


ATATGTTTGT 


TAGTTACGAT 


TGTATTTTTA 


CCAACAACTA 


CACCAGTACC 


540 


ACCCACAAAT 


GCTACCACTG 


aATTGTATGG 


TTCCTTAGTT 


GCATCGGTAA 


TTTCTTTGAC 


600 


ATTCTTTTCT 


GCTTTGGCAA 


TTGAATGArG 


CTGATTAGAA 


ATATTTTCAG 


CAAAACCAAG 


660 


AGATGTTAAA 


ATAGTTAAAG 


CAGTTAAACC 


TTTAACCATT 


ACATTTTTAT 


TCATATTTTA 


720 


CCTCCTTCCA 


AATTTATTTT 


nATAGTTAAT 


GTATCTAGAT 


ACACATTTAC 


TATATACATT 


780 


TTAAATAAAA 


ATATTnATTA 


AATATTAATT 


AACCACTTAT 


nAACGTT 




827 



(2) INFORMATION FOR SEQ ID NO: 602: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 602: 
50 nGGnACGTAA AGTATTTATG TTAAATGGCA nAAATATATG AATGATGAGT TACCTGGATT 60 

ACCAATGTTC CAAGGTAAAT CGATAACTAT TGTTAACGAT AAAGTACGAA ACTTAGACAT 120 

55 



1533 



EP0 786 519 A2 



5 



10 



1S 



20 



OGACGATTTT 


GTCGAGGTTT 


ACTTATATGG 


GGGGATGTTA 


TGAATAATGT 


ATTGTTAGAG 


240 


GTTAAAGATT 


TAGAAACATC 


ATTAAAAATA 


AATAATGAAT 


GGTTAGCAAC 


TGTTGAAAAT 


300 


ATTTCTITXU 


AATTATCTAA 


AGGAGAAGTT 


TTGGGTATAG 


TAGGGGAATC 


TGGTTGCGGT 


360 


AAGTCCATAT 


TAAGTAAGTC 


AATTATTAAA 


TTATTACCAG 


AAAAGATATC 


TAAACTAAGT 


420 


AATGGAGAAG 


TTATATTTGA 


TGGTAAACGA 


ATCGATACGC 


TCAATGAGAA 


GCAATTGCTA 


480 


GATATTOGAG 


GAAATGATAT 


TGCTATGATT 


TTTCAAGAAC 


CTATGACTGC 


TTTAAATCCT 


540 


GTATTTACCA 


film « M H ft ft f^^^ft 

TAAAAAATCA 


Aci"i\j'l\jGAA 


IMMMHtS ft ft ft m 

TCTATAAAAT 


CACATAAAAA 


AATTTCTAAA 


c ft A 
bUU 


AAAGAAGCAA 


ATAAATTAGC 


AAAAGATTTA 


CTAAAAAAAG 


TTGGAATTGC 


TAGACAAGAT 


660 


GAAATATTAA 


ATAGCTATCC 


TCATCAATTA 


TCTGGTGGTA 


TGAOACAAAG 


AGTAATGATT 


720 


GCAATGGCCA 


TTTCATGTTC 


TCcTAAATTA 


TTAATTGCTG 


aTGAACCTAC 


AACAGCATTG 


780 


GATGTCACGA 


TTCAAG 










796 



(2) INFORMATION FOR SEQ ID NO: 603: 

2S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1618 base pairs 

(B) TYPE : nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603: 



55 



40 



45 



SO 



55 



GGATGTGTTC ATCCCAAATT GaAAcAgAaA 


TTTATGATCA 


AAtGATGTGG 


ATTGGTTTTA 


60 


AaCCATTCAa 


AATTaCCAAT 


ATTAmaCAAG 


AATCTGAAGA 


CATTAAATCA 


TTTtACAGTT 


120 


GAAACTGAAG 


AATATGACTT 


TAGTGAATTT 


ACACCAGGCC 


AATACATCAC 


AGTTGATGTT 


180 


TCTAGTGATA 


AACTTCCATA 


TAG AG CT AAA 


OGTCACTATT 


CTATCGTATC 


AGGTGAAAAA 


240 


AACCATTTAA 


CTTTTGGCGT 


TAAACGTGAT 


GTCACAACAG 


AACATGAAGG 


CGAAGTTTCA 


300 


ACAATTTTAC 


ACGATGAGAT 


TAAAGAAGGC 


GATATGATTA 


ATTTAsTGCG 


CCTGTAGGTG 


360 


GATTCGTATT 


AGAGAATACG 


ACTGAACCAC 


AACTTTTCTT AGGTTCAGGT ATTGGTGTTA 


420 


CACCTTTAGT 


AGCTATGTAT 


GAAGCTGCCT 


CTGCCAAAGG 


TTTAGATACA 


CAGATGGTTC 


480 


AAGTTGCTGA 


AAATGAACAA 


CATTTACCTT 


TCAAAGACAA 


CTTCAACAGT 


ATCGCAAGCC 


540 


ATCATGACAA 


CGCTAAATTA 


TATACACACT 


TAnAgATAAA CAAGGCTATA TTGGTGCTGA 


600 


AGAATTACAA 


GTATTTTTAG 


CAAATAAACC 


TGAAATTTAT 


ATCTGTGGTG 


GTACAAAATT 


660 


CTTACAaTCT 


ATGATCGAAG 


CACTTAAATC 


TTTAAATTAC 


GATATGGATC 


GCGTACACTA 


720 
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5 



to 



20 



25 



35 



40 



50 



ATATGAAACT 


TAATCAACGT 


TACGTAAAAG 


TATTTGCATT 


ATATTTOQTA 


AGTATTGTTA 


840 


CTGCAAATAT 


TATTGTTAAA 


AATAATAATT 


TAATTAAAAC 


AtTGATACAA 


ACCATAGCCG 


900 


GGTACACGGT 


CTTTGCAGTT 


GGTTTGAAGT 


ATTTAACTAA 


ACGTAAAAAT 


AAATGACATC 


960 


TATACCAAAA 


CAGCTATGAC 


TTTACTTTGT 


CATGGCTGTT 


TTTTAATTGG 


GAGTAGGACA 


1020 


GAGATGATAT 


TTTCGCAAAA 


TTTATTTGGT 


CGTCCCACCA 


CAACATGCAT 


TGATGTATGC 


1080 


TCACTGAATT 


TCATAAGAAA 


GGAGTTCACA 


AGATGACCGT 


AGATATTGGA 


CGGATTTATG 


1140 


ACAATAAAGA 


TAATACCGAC 


GCTATTCGTA 


TCCTAGTCGA 


TAGAGTCTGG 


CCGAGAGGTA 


1200 


TTTCGAAAAG 


AACTGCTAAC 


CTAGATTATT 


GGTTAAAAGA 


CATTGCCCCT 


TCTACTGAGT 


1260 


TGCGACAATG 


GTTCCAACAT 


GATCCTAAAC 


TTTTTGGAGC 


TTTTAAAGAA 


AAATATGAAA 


1320 


AAGAATTACG 


TGATCAGGAT 


GCGCAAAAAG 


ATGCTTTTGA 


AAAATTAAAG 


GATATTGTAA 


1380 


ATCAGCATAA 


TCATGTTCTA 


TTGTTATATG 


CAGCAAAAGA 


TACTAAACAT 


AACCAAGCTG 


1440 


TAGTACTACA 


GCAGTTGCTC 


AATACTTAGT 


TATATATGTT 


TATCGTGAAT 


CACTATAAAT 


1500 


GTCGCTAACT 


TCATTTCTAA 


TATTTATTTT 


TAAAAAGCAT 


AATTACTACA 


ATTAATTGGA 


1560 


ACTTTAATAA 


TTACTAAACT 


TGGAACAATA 


TTTTACTTTA 


ACAAAATAAA 


GTTTTAAA 


1618 


(2) INFORMATION FOR SEQ ID NO: 604: 








<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604: 






ATTATAATGC 


AAATATTGAA 


GCATTTAAAA 


CAGTCGCTAA 


AGCAGTAGGC 


AAAGAGAAAG 


60 


AAGGCGAGAA 


GCGTCTGGAA 


AAG CATGAT A 


AAATATTAGC 


GGAGATTAGA 


AAGAAAATTG 


120 


AACAGAGTAC 


GTTAAAATCT 


GCATTTGCAT 


TCGGTATCTC 


AAGAGCAGGT 


ATGTTTATTA 


180 


ATAATGAAGA 


TACATTTATG 


GGACAATTCT 


TAATTAAAAT 


GGGTATTCAA 


CCTGAaGt CA 


240 


mAAAArAmAA 


AACTACGCAT 


GTTGGTGAAC 


GCAAGGGTGG 


TCCTTATATA 


TATTTAAATA 


300 


ATGAAGAACT 


TGCCAATATC 


AATCCAAAAG 


TTATGATTTT 


AGCCACTGAC 


GGAAAAACGG 


360 


ACAAAAATAG 


AACGAAATTC 


ATTGATCCTG 


CAGTTTGGAA 


ATCATTAAAA 


GCTGTGAAAG 


420 


ATAACAAAGT 


TTATGACGTT 


GACCGAAATA 


AGTGGTTGAA 


ATCAAGGGGG 


ATTATCGCAA 


480 


GTGAAAGTAT 


GGCAGAAGAT 


TTAGAAAAAA 


TTGCAGAAAA 


AGCAAAATAA 


AAATACAGCG 


540 
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TGTGATGATT 


TACCTGAAAG 


TTTATTTCGA 


ATAAATTTAA 


TTACATAACC 


GACAAGGATT 


660 




GTTTTAACAG 


TTCTTTTAAT 


GAATTGGCGC 


ATCGTTACAT 


ACCTCATTTC 


TCTATATCTT 


720 


s 


ACGAACTATA 


TACCCATTCA 


TATATGCTTT 


TTAAACGTCA 


TTGTCACAAT 


TTAATTTTTA 


780 




GGGAATATAA 


TATAACCATC 


TTTATCTGCT 


TTTTTAGTAA AAATGACAAA AATTGCATGT 


840 


10 


ATTATTGaGA 


TGATGGTAGG 


GATACCTGTC 


CAGAAAAATA ATAAGTGAAA AAGACCTTGT 


900 


CCAAATTTAT 


CAGCATAAAA 


TTTATGaATA 


CCTAAACCTC 


CAAGAAATAA TGCAACmATa 


960 




ACATAAATGG 


CTTTATTGAC 


TTTCATTTGT 


AAtCCTCCTT 


AACTATAATT 


CTACTTAAAT 


1020 


IS 


TCGTTGTGAA 


AACCAATATT 


TCTAACTTTA 


GAATTTTCAA 


ACTTTCTAAA 


ATTATAAGTA 


1060 




TATCTTTTTA 


AAATAAGCTA 


GAATTTCTAT 


ATAATAAATG 


TTAATAACGT 


AAAAGGGAAT 


1140 




GATGACATAG 


TGATACGTCA 


AGCACGTCCA 


GAGGACCGAT 


TTGATATTGC 


GAAGTTAGTT 


1200 


20 


TATATGGTTT 


GGGATGATAT 


GGAATTAGAA 


TTGGTAAAGC 


ATCTACCTAA 


AGACATGGTA 


1260 




TTAGATGCAA 


TTGAAAAAAG 


CTGTGTTGAT 


GCAACATATC 


GAACTTTTTA 


TCAG CATATT 


1320 


25 


TTAGTTTATG 


AAGTAGAAAA 


TAAAGTAGCA 


GGTTGTATTA 


TTAGCTATAG 


TGGTGAAAAT 


1380 


GAATTGAAAT 


ACGAAAAAGC 


ATGGGAACTA 


CTTGACTTGC 


CAGAAAAAAT 


AAAACAATAT 


1440 




GGCACGCCAT 


TACCTGTAAA 


AGAAGCTAAA 


GACGATGAGT 


ATTATATAGA 


AACAATTGCG 


1500 


30 


ACATTTGCAG 


CATATAGAGG 


TAGAGGCATC 


GCGACAAAGT 


TATTAACGTC 


ATTACTTGAA 


1560 




TCAAATACAC 


ATGTTAAATG 


GAGTTTGAAT 


TGCGATATTA 


ATAATGAAGC 


AGCATTAAAG 


1620 




TTATATAAAA 


AAGTAGGCTT 


TATATCTGAT 


GGACAGATTG 


AATTATACAA 


GCACATGTAT 


1680 


35 


CATCATTTAA 


TTGTTAAATA 


AAATACTCGA 


CAGTTCGATG 


TAAGTCGATT 


GCCGAGTAGT 


1740 




ATCATTTTCT 


ATTAAATGCC 


TGCAAATAAT 


GCACTAATAT 


AAATACCTAA 


TGCATATAAT 


1800 


40 


AAACCGAAAA 


ATGTATTTGT 


TTTACCAGCA 


GCAGCCATTG 


CTGGCATCAT 


TGTAGGCGGT 


1860 


V7 X t\.X V^nl X v. X 




TYTY"221 r P21 ti fP 

Un X T%n v» X 


TTAACAGGCA 


TTGGGAATGA 


TAACAACGCA 






AGTAAGTAAA 


ATAATGAGCC 


ACCAGGTTTA 


ATAATGATCG 


TAAGTACAAT 


AAAGGCATAA 


1980 


45 


GCGATAAAGT 


ACATGATTGC 


CATAAATGTT 


AAAGAAGGAT 


TTTTACCTAA 


TAGAATGGGT 


2040 




AAAGTTTTGC 


GACCACTTGC 


TTTATCTTTG 


ACACGGTCGC 


GAATATTGTT AGCCATATTA 


2100 




ATTAAACCGA 


TAGTG 










2115 



50 (2) INFORMATION FOR SEQ ID NO: 605: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 
SS (C) ST RAND ED NESS : double 



1536 



EP0 786 519 A2 



10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605: 

TATGTCTCAA ACTCAAATTA ATCAAATGTT CCAGCAGAAA AACATGTCTA CTGAATTAAA 60 

ACOTCGTTAT GCACAACGTT TATTACAGTT TCCACATGTA CACAATAAAG AATACTTGAA 120 

ATCTTATGCT AAAAACCCTA AAGAAACTAA AGATAGTTAT ATTTCTGGTT TTAAAGAOAA 180 

TCAATTGATT AAAATAGAAG CGATTAAATC ATTGTTTGCA ATGGATAAAT CTCCATTAGA 240 

ACATGTTAAA CCTGCTACAA AACCAGACGC TTCTTGGGAT GAGATGAAAC AAAAAGCAGT 300 

TGAAATTGGT AAAGCTGATA CTACATCGAA TAAATTTGGT ATTAGAGATC AATACTGGAA 360 

ATTAATTCAA GAAAGTAAGC GTAAAGTTAG ACGTGACTAC GAATTCAATG TTAATTCTCC 420 

AGAATTCCAA GATTTAGAAT TACTTGTAAA AACAATGCGT GCTGCTGGTG CAGATGTTCA 480 

20 ATATGTAAGT ATTCCATCAA ACGGTGTATG GTATGACCAC ATTGGTATCG ATAAAGAACG 540 

TCGTCAAGCA GTTTATAAAA AAATCCATTC TACTGTTGTA GATAATGGTG GTAAAATTTA 600 

CGATATGACT GATAAAGATT ATGAAAAATA TGTTATCAGT GATGCCGTAC ACATCGGTTG 660 

GAAaGGTtGG GTTTATATGG ATGAGCmAAT TGCGAAACAT ATGAAAGGTG AACCACAACC 720 

TGAAGTAGAT AAACCTAAAA nTTAAAATAC AAATAGCACA TAACTCAACG ATTTTGATTG 78 0 

AGCGTATGTG CTATTTTTAT ATTTTAAATT TCATAGAATA GAATAGTAAT ATGTGCTTGG 84 0 

ATATGTGGCA ATAATAAAAT AATTAATCAG ATAAaTAGTA TAAAATAACT TTCCCATCAG 900 

TCCAATTTGA CAGCGAAAAA AGACAGGTAA TAACTGATTA TAAATAATTC AGTATTCCTG 960 

TCTTTGTTGT TATTCATAAT ATGTTCTGTT AACTTAATAT CTT 1003 
(2) INFORMATION FOR SEQ ID NO: 606: 



25 



3S 



40 



45 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1097 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 606: 

GCTnTTATAT TTAAAAAATT TATTnGCGGA TAATTACCTT TATTAAATCC CACCCATTAG 60 

so GAwTaCGAGT AATAGGACTA ATACTGGAAT CACATAATGT AACATAACGT CCCTCCTTTA 120 

ACTTAATTTT AATTGTAATC AAATTTGACA ATAAGTCAAA mCATTAATAC CTATGATArG 180 

TATCATTTAT TAACATATGT ATCATATTTT TAATCTTGCG TAATTTTTAT CGTTAACTAT 24 0 
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5 



10 



16 



25 



CTTTTGTCAT 


TTTAATTATG 


TTAAGATAAT 


AGTAGTTATA GAAGTTCAAT 


CTATAGGAGG 


360 


CATAGCATGG 


ATATTCCAAA 


AATCACGACA 


TTTTTAATGT TTAATAACCA 


AGCTGAAGAA 


420 


GCTGTTAAAC 


TATACACAAG 


CTTATTTGAA 


GATAGTGAGA 


TTATAACAAT 


GGCTAAGTAT 


480 


GGTGAAAATG 


GACCTGCTGA 


TCCCGGGACT 


GTACAACACT 


CAATATTTAC 


ATTAAATGGA 


540 


CAAGTATTCA 


TGGCGATTGA 


TGCTAATAGT 


GGCACAGAAT TACCAATGAA 


TCCTGCGATT 


600 


TCATTATTTG 


TTACAGTAAA 


AGATACTATT 


GAAATGGAAC 


GACTATTTAA 


TGGATTAAAA 


660 


GATGAAGGTG 


CCATTTTAAT 


GCCAAAAACG 


AATATGCCAC 


CATACAGAGA 


GTTTGCTTGG 


720 


GTTCAAGATA 


AGTTTGGAGT 


AAGTTTTCAA 


TTAGCATTAC 


CTGaGTAAAA 


GGATTOGCAC 


780 


AGCtTGgAAT 


GATAAAGAAA 


» * A * A <*> 


TATGCATGCG 


TTTTACCTAT 


GTATTTCTAT 


840 


TTTAAGTACA 


CATTAGCATT 


TTAGTTTCGC 


TCATTTTTAA 


AATCACAAAG 


ATAATTTGTA 


900 


ACTTAAATGA 


TGATACGTTA 


TTTAAAAACA 


CGATACTTCG 


TTTCAATGAA 


CGCATTAAAT 


960 


AATAAATAAA 


CACCTCACCA 


TAAGAAAGGA 


ctactttctt 


TTGTGAGGTG 


TTTat TTGTC 


1020 


GACAACTTGA 


TTATGATTGT 


TTCATTTTTT 


GAATAAGTTC 


ATAATCAGGT 


GTAGCATACA 


1080 


AAGTTTTTTG 


ATTGTCA 










1097 



<2) INFORMATION FOR SEQ ID NO: 607: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2031 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607: 

GAGGTCTAGC ATGTCTCGTT CAAAAAAATA CTTTTACTTA TCTAGCTTAA TGATTATTTT 60 

40 

AAGCTTTTTC TTTAATACAA ATAACGTTTT CCTAAGTGGA CTTTTTAATT CTTTTATTAA 120 

ATTAATACTT TTCTGCAGTG TTATTAACTC AATTGTACTA ATTTTGTCTA TAATTTTTGC 180 

45 AGATOGTTCA ATTAAATCAC TAAAGCCTGA TGCAGATTGG ATTAGAATTG CGAGTAAAAG 240 

TTTGCCTTGG aTTATTCTAA TTGTTATTTT AGTACATATC TTTTCAATTG TTCGTACATT 300 

CGGTTTTATT TAAAAAAGTT AATATGTCAT TGTAGCCTTA ATACAAAACA ATACAATGTA 360 

50 

TCATGCTATA ATGAGTAAAA CAATTTGATA ACGTTGTTGC GTATAAAAAA TATTAGATTT 420 

TCGAAATCAT AACTATGCAT CTAATCGCTA TAGTTATACA ACAAGATATA ACATATAATG 480 

AGGTTTGATA ATGCATCGAC AATTTTTGTC GTCGCGTTGC CAAAACCTCT TTTTTAAATT 540 
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TTTAGGCGCT TTATTACGTT ACCTCATTTC TTTTCTOAAT ACTQACOGAG GTTTTCCAAT 660 

CGGAACACTG ATAGCCAATT TGACTGGTGC CTTTGTAATG GGATTGCTAA CAGCCTTAAC 720 

5 AATTGCATTT TTTTCAAACC ATCCGACCCT AAAAAAAGCT ATTACGACTG GTTTTCTTGG 780 

TGCTTTAACG ACTTTTTCAA CATTTCAATT AGAATTAATA CATATGTTTG ATCATCAACA 840 

ATTTATAACT TTACTACTAT ATGCTGTAAC AAGTTATGTC TTTGGTATTT TGTTATGTTA 900 

10 

CGTCGGTATA AAACTAGGTG GTGGTTTATC ATGATATCAA TCATTTTAGT CATGATTGGC 960 

GGCGGTTTTG GCGCAATTGC TAGAAGTGCC ATTACTGATT ATTTTAATCA TAAATTTACT 1020 

15 TCAAAGTTAC CTATCGCAAC ATTGATAGTA AATCTAGTTG GTAGTTTTTT AATTGGATTA 1080 

ACTATAGGCT TATCAATTTC AATCTCATGG TTCCCTGCGT TCTTTGTTAC CGGTTTTTTA 1140 

GGTGGCTTAA CAACTTTCTC AACGTTAGCC AAAGAACTTA CACTAATGAT GACGCCAAAA 1200 

20 

TTTAATATTA ACCTTTTTCT CAATTATTCA CTTTTACAAT TCATCATTGG ATTTATAGCT 1260 

TGTTATATTG GCTATCATAT TTAAAAATAA AATGCTTCAT TCAGCAAATA GGTAAATTAC 1320 

GACACCTTCC TGAACGAAGC ATTTTTTAAT TTTCATGCAA ATTTTTAAGC ACCATATAAT 1380 

26 

GCCTACCAAA TTTCAATAAT CTTTGTTGCC GTTTAAATAA TGTGAATGTC AATAAATTCT 144 0 

CCAAACTAGT CGAAAATAAA GGGAGTGGGA CATAAATCCC TAAAAAAACA GCAGTAAGAT 1500 

30 AATTTTCAAT TAGAAAATAT CTTACTGCTG TTCTCTATTT ATACAATACT TCGTATTGAa 1560 

TGGCTTCGCT TTCCTAGGGT GCCGTCTCAG CCTCGGcTTC GACTGGCACT GCTCCCTCAG 162 0 

GAGTCTCGCC ATTAATAcTA CGTATTAACA TGTAATTTTA CTTTTACATA CTTTAAAAAA 1660 

35 TAAGACACTT TGCCCAACTT ACACTACCAA TAGAAACCTC TGTTAGAATT CCTCAAAATG 174 0 

ATATTTCGCG ATATGTTAAT GAAATTGTTG AAACGATACC TGATAGCGAA TTCGATGAAT 1800 

TCAGACATCA TCGTGGCGCA ACATCCTATC ATCCAAAAAT GATGTTAAAA ATCATCTTAT 1860 

40 

ATGCATATAC TCAATCTGTA TTTTCTGGTC GAAGAATAGA GAAATTACTT CATGACAGTA 1920 

TTCGAATGAT GTGGTTAGCT CAAAATCAAA CACCTTCTTA TAAAACTATT AATCGTTTTA 1980 

45 GAGTGAATCC TAATACTGAT GCGTTAATTG AATCTTTATT TATTCAGTTC C 2031 

(2) INFORMATION FOR SEQ ID NO: 608: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 687 base pairs 

<B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

55 
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TCCCCGGGAA TCGAACCCGC GAnCTCCTGC GTGACATGCA GGCGTGTTAA CCGCTACACT 60 

AOGAGACCTA TTAnATTAAA AACTATGTAT TGOGGGAGGC GGATTTOAAC CACCGACCTT 120 

s 

OGGGTTATGA GCCCGACGAG CTACCGAACT GCTCCATCCC GCGCTAATAT TATTTTGAAT 180 
TACCTAATTA ATATACCATA ATCAAAAACC TAAAGTsrrG AACTTTTTQA ATTTAATTTA 240 
AATGTTATCT CTTAAATAAT TACTTAAATA TCGTAGCAAC ATGTTCTCTG TTGAACACAA 300 

10 

ATATTAGTAT ATTCATTTTT GTAGTGTACG TCAACGACAT TTTCAAAGTT TTTTGTGTAA 360 
AAAAACGCTT CTTATTCCCT TTTATCATAT AAGTGTCTAA TAGTTGTCAT AAATAGTGkT 420 
1S AAAGCATTTA AAAAGGTATA GGAGTTATAA AGTTEACAAC GCCTATACCT TCTGAAAAAG 480 
AAATTATAAA AGCTTGTTAC ACCGCATATT CTTTCAGTCA GGGACTACCA ATATAACATT 540 
GTAGCCCtAA GACATTGCTT GACGCCTCAn TTACAACAAT TTTTCAAAAT CAGCAGCTAC 600 

20 

CTACTGACAC AACATAACAC AACCCnTACA CTAnCTATCG TGTCATGTAA TCTTGCATCC 660 
GATCTTGCAA CGCTGTAAAT GTTTCGA 687 
(2) INFORMATION FOR SEQ ID NO: 609: 

25 

* (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
30 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 609: 

35 

GTGTTGAATA CTTTGACAAA ATGAAATCAT TTGTTCTTTC GTTTCAAATT TAACTGTTTG 60 

AATTAAATCA GTTCGTTTTT CGTAGTACTT CGGCGTTGTG TTCATATTCA TTTTTTCTAA 120 

AAATAAACTA GTAAACAATG CACCTTTAAG ACTCTGACTG ACAACGTGTG GTGCTAAAAA 180 

40 

GAAACCTTGA TACATTTCAA GCAATGCATT TAATGATGCA CCCGCTTCTT TACCAATACC 240 

AGGTGCTGTC AATCTATAAC CACATCGTTC AATTAAATCT TTTCTACCAG CAATGTATCC 300 

4S ACCAATCTTA GCTAAACCAC CGCCAGGGTT TTTAATTAAT GATCCTGCTA TTAAATCGGC 360 

ACCACATTCT ATAGGTTCAC GTCTTTCAAC AAATTCCCCA TAACAGTTAT CCACAAATAT 420 

TAAAATATTA GGATGCACGT TTTTCAACCT AGTAATTACC TTTTCAATTT CATCTAGCGG 480 

50 

AATTGAAGGT CTTTGATCAT AGCCTTTCGA ACGTTGAATC GCTATTACTT TGGTGCGCTC 540 

AGAAACCCCa TCTAACACAC TTTCAATATC GATCTTACCT TCTTTAAGTG CAATATCTTT 600 

ATACGATmCG CCATGCTCCA TTAAACTTTC AATACCATTT CCGTTTACGC CAATGACTTC 660 
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ACTTTGTAAT GCAATAGTAA TCGCATGCGT ACCTGAAATA ATTTGCGGAC GAACAATTGC 780 
ATCTTCTGCT TTAAATGCCT GCGCATATAT TTCTTAATTG AACGAATTTA GAACTAATCG 840 

5 

TAA 843 

(2) INFORMATION FOR SEQ ID NO: 610: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

15 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: €10: 



35 



GATTATTATA 


AACCGAATGA 


AACAAAAAAG 


TTTTTACATA 


TTTTCAACAA 


CTTTATTTCA 


60 


ATAAAG CATC 


AAGTTGATCT 


AAAGTTTGAT 


TCATACCTTG 


TTCAACACCC 


ATGTTTATGA 


120 


CTTGTTGAGC 


GGCTTCTTTT 


GTTGGAAATA 


CCGATGTTGA 


TGTCACTGTC 


GTTTTCGTTG 


180 


TATTGCTCTT 


GGAAAATGAC 


AAAGTAATTT 


TCATGCTTGG 


CATTTTTGTA 


TCTTTTGCAC 


240 


CTTGAGGTGT 


CGCAAATGAG 


TCAATATATT 


CAATTAAATA 


TGGACGCTTA 


ACCGTTTTAT 


300 


ATTCTGCTAA 


TGTATAGCTG 


GTCATTGTAG 


GTGTTTTAAT 


CGCATAAAAT 


GCATCACCGC 


360 


CTGAAACAGC 


ATTAAAGCGA 


AACACTTTGG 


TACTAGCGTC 


TTTTGGAtGA 


AACCATTTTT 


420 


CAAATAATGC 


TTTkGTCGTA 


TAGG CATCAA 


ACACCTTTTC 


AATCGGTGCT 


TCTATCGTTC 


480 


TTGAAAAAAT 


AATTTTATTA 


TCTTCAACTT 


TAATCGTCAT 


CTTCCCACTC 


CCTATCTTTT 


540 


ATATTTACTC 


ATCTTAACAT 


GCATTGAGTA 


ACATATGTTA 


CCAAATCATT 


TGTTATAAAC 


600 


TATAGTAATA 


TTAATCATTC 


CCTCTGTnGA 


AATTGATAAC 


TACGAAATAA 


AAAACACACT 


660 


CTATTCAGCT 


ATTAACAGTT 


GAGTAGAGTG 


TGCAAACCTG 


TTGAATTAAT 


GCGCTAACAT 


720 


TTCTTCTTTG 


ATTTGGTCTT 


nATnTAATTT 


TGAAGGATAA 


TATGTTGGCC 


AGTTA 


775 



(2) INFORMATION FOR SEQ ID NO: 611: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 728 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

so 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611: 
ATCAAAATTT GATACTAAAG GGCTCACGCT ATTTTATGTA TTTATTGGAT TGATTATGTT 60 

55 
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TGTGGCAATG TGTTTATTTA AAGTAGAAAA ACATGTTTCT TCACCATTTT TACCTGTGGT 180 

TGAATTTAAT CGTTCGATTA CTTTAGTTTT TATAACTGAC CTTCTAACAG CTATTTGTTT 240 

6 

AATGGGATTC AATTTATATA TTCCAGTCTA CCTTCAAGAA CAACTAGGAT TATCTCCATT 300 

GCAAAGTGGA TTGGTTATTT TTCCTTTATC TGTAGCTTOG ATTACATTGA ATTTTAATTT 360 

ACaTCOAATT GAAGCAAAAC TATCAAGGAA AGTTATTTAC TTACTATCAT TTACATTGCT 420 

10 

ACTAGTAAGT AGTATTATCA TTTCATTTGG TATTAAATTG CCGGTACTTA TAGCATTTGT 480 

GTTAATTTTG GCAGGATTAA GTTTTGGATA TATTTATACG AAAGATAOTG TGATTGTCCA 540 

IS AGAGGAAACT AGCCCATTAC AAATGAAGAA AATGATGTCA TTTTATGGAT TAACTAAAAA 600 

TCTTGGGGCA TCAATAGGTT CAACAATAAT GGGATATCTT TATGCGATAC AATCAGGAAT 660 

CTTTGGTCCA AACTTACACA nTGTGTTAAG TGCTGTTGCT GTAATTAGCA TGGGCTTATn 720 

20 

GTTTTATG 728 

(2) INFORMATION FOR SEQ ID NO: 612: 

(i) SEQUENCE CHARACTERISTICS: 
ss (A) LENGTH: 913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612: 



3$ 



TATCAGAGCA 


AATAAGCAAG 


TT AG AT AAAA 


AGATTTCTGA 


CTT ACAATT A " 


ATTAGACGCT 


60 


CTGTATGTGA 


ATTTATTAAA 


GGACTCTCTC 


TAATAGATAC 


CAGCATTTTA. AACAAGACAC 


120 


TACAGTCACA 


ATATGATAAA 


GAAGCATCTA 


TAAAATATGG 


TCATACGAAA 


GCATATCAnT 


180 


CATTTATTAG 


ACGTAAAGAC 


AG CTTACAAT 


CGCAGGATAT 


CAGACATAAA 


TTGACAACTA 


240 


TCTTCAATAA 


ATTTAATCAT 


ATGTCTTTGA 


GTCATTATCC 


AATCCAAGAT 


TGTAGTGATC 


300 


TCGTATTTGA 


GTGGAAGGCA 


TTTATGAACA 


CTATCGCTGA 


TTTTGmTGAT 


GAAACATTAT 


360 


GCTGTATTGC 


TAAAACATAT 


GAAGATGATA 


CGCGTTTCAA 


AGATTACTTT 


AATTCATATG 


420 


ATAATCAAAA 


TTTAGCATCA 


TACATTTCAG 


AAGCTGTTAA 


TTATTTTTTG 


AGCAATGTGA 


480 


ATAAGAGCGA 


CAATTTTTAA 


TCCTCATACA 


GATGCTACAA 


CACTGAATAC 


ACCAACTAAA 


540 


ATAATTGTTC 


AAAGTAGAAA 


ACTAATCATT 


TATTCTCTAA 


ATCAATAACT 


ATATTAAAAG 


600 


TTATACCTTT 


GCAAAGCGAA 


TTAGTATAGG 


TTACCGAAAG 


GAGAAAGGAT 


TAGGTTCCAT 


660 


TCGATTTATA 


AAAAAATATA 


TTTCGAAACA 


TACAAAAGCG 


CCAGAAGATA 


ATCATTTCCT 


720 



55 
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TCAATGCATA TTCAACTAGC TCAGGTTTAC TCTTTAAGCC AAOCTTTGTC ATAATATGCG 



840 



TCTTATOTOC TTCTACTGTT TTCACAGATA CAAATAATTT CTCTGCAATT tCTTTATTCC 



900 



CGTAACCTTT GGC 



913 



10 



IS 



£0 



25 



30 



35 



40 



45 



50 



(2) INFORMATI ON FOR SEQ ID NO: €13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613: 

ACAAGTGTTG ATGTCATAGT AGCGTCAAwT GTGCTTTATT TCTGGGACAC ATTTAAATTT 60 

TTCCACCCCA TTACTTGGAG CTTAATCTTT ATTACAATTT TACTATTATT AAACATTTTT 120 

TCTGTAAAAT CATTTGGAGA AACTGAGTTT TGGTTATCAT TGATTAAAGT GTTAACAATT 180 

ATCGTATTCG TTATTTTTGG CTTTTTAATG ATTTTCGGTA TCTTAGGTGG TCATACATAT 24 0 

GGATTTGAAA ACTATACAAA AGGCCAAGCA CCGTTTGTTG GTGGTATCTC TGGTTTCTTA 300 

GGCGTATTAT TAGTCGCCGG ATTTTCGGTT GGTGGTACAG AAGTAGTAGC AGTAACTGCT 360 

GGTGAATCAG ATGAC CCTAA AAAGTCTATG CCTAAGGCAA TTAAACAAGT ATTTTGGCGT 420 

ATTCTTTTAT TCTATGTCTT ATCAATTGCA GTAATTGGTG CAATTATTCC GTACACAGAT 480 

CCATCATTAT TAAGAGCAAG TAGTTCAATA AGTCAAAGCC CATTTACAAT TGTATTCGAT 540 

AGAGtAGgCA TAGCCTTTGC AGCATCAGTA ATCAACGCGG TTATTTTAAC TTCATTATTA 600 

TCCGCTGCAA ATTCAGGTGT TTATACAACA GGCAGAATGT TGTATTCCTT AAGT 654 
(2) INFORMATION FOR SEQ ID NO: 614: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614: 

TCGATCTTTA TTGATAATAA TTAAATTGTC GCCTTTAAAA TGTGATATTA ATCCTGCGGC 60 

AGGnTGTACA ACGAGTGATG AACCTAGTAC AACAAGGGTG TCAGCATGTT CAATTTTATT 120 

TAATGCCCTT ATGATGGTAG GTTGATCTAA CATTTCACCG TATAATACGA TGTCCGGTCG 180 
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ATAAGATTTA TGACATACAT TACAATAAAA ACGATTTAAC GTGCCATGTA ATTCATCAAC 300 

ATGTTGACTT CCAGCGTCTG AGTGCAAACC ATCGATATTT TGCGTGATGA CACCTAAAGA 360 

5 

TTGTTGATTA CGTTCTAATT TTGCAATCCA ATCATGAACG ATATTGGGCA TCGTATOGAC 420 

AAATAGTAAG CGCTTATGGC AGAAATTGAT AAAACCTTCA GGATCATCTT CTAAATAATC 480 

ACGGCTTAAC AAGTATTCTG GCGAAAGCCC ATCTTTTGaA ATTTCATCAA ATAAGCCACC 540 

10 

CATTGaACGG AAATCTGGAA CGCCACTTGC GACAGATACA CCAGCACCTG TAAAAAATOT 600 

AATACGATTC GAACTATCTA TAATATGTTT TAGTGTCTCT AA 642 

1S (2) INFORMATION FOR SEQ ID NO: 615: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1210 base pairs 
<B) TYPE : nucleic acid 

20 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615: 



25 


TTTCCCCnCC 


CCnCCAAATA 


TCCAAnGGAA 


CTTTAATAGT 


CCAATTGGCA 


CAGTAAAACT 


60 




ATGGCATTTT 


ATAAGTATAA 


TATATCTGTA 


ATTTATGGTC 


AATTAGTAAA 


TTGTTTTTTA 


120 


30 


TTTGAAACAT 


ATTTCACATc 


AAAATCACAA 


AGACTTTTAG 


ATTTTGtTCT 


AAAAATCTCT 


180 




TAATAATTTA 


TTTAATGAGA 


AGAGTTGCTT 


ATATAGTAAA 


TTGTGAAGCC 


GTTAAAACAA 


240 




CGTTACAAAA 


CCTATATCTT 


TAATACGGAA 


CCATATGGTA 


TGAATCAAGG 


AATACTTAAA 


300 


35 


CTAAAACTTC 


TCTATCAGAT 


TTA'ITTGTTG 


CGAAATCAAC 


AACTTTAATT 


GCTTGCCCTT 


360 




CATTTAATGG 


ATAATTTGCT 


TGCGTAATTT 


TAACTTTTAC 


AATTTGACCT 


ATGAGTGATT 


420 


40 


CGTCACCTTC 


AAATTGTACT 


TT CAT AT AAT 


TATCTGCATA 


TCCAACTAAT 


GTACCTTCTG 


480 


TGtCACCCTG 


TTCCTCAGGA 


ATTACTTCAA 


GCACATCTTG 


ATCAAATTTA 


GACGCATATA 


540 




ACTTTCCGAG 


TTGATTGCTT 


AGCGTAATTA 


ACTTATGCAC 


CCGTTCATTT 


TTAATTTCTT 


600 


45 


CATCAATTTG 


GTCATCCATT 


CTTGCAGCTG 


GCGTGCCAAT 


TCTAGGAGAA 


TAAGGGAAAA 


660 




CATGCAGTTC 


AGAGAACTTA 


TGCTTTACGA 


TAAAATCATA 


TGTTTCTTGG 


AACTCAGCTT 


720 




CAGTTTCACC 


TGGGAAACCA 


ACAATTACAT 


CACTCGTAAy 


TGCCAAGTCT 


GGTAAAGCTT 


780 


50 


TATGCAATTT 


TGTTAATCGT 


TCTGAAAATC 


TATCCATTGT 


ATACTTACGT 


CTCATACGTT 


840 




TTAATACTGT 


ATCTGAACCA 


GATTGTAATG 


GAATATGCAA 


ATGACGCACA ACTTTTGTTG 


900 


55 


AACGTTCTAA 


AACGTCAATT 


ACTTCATCTG 


TAAGTTGACT 


TGCTTCAATT 


GAAGAAATTC 


960 
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20 



25 



60 



SS 



CTTTTAAATC TTGACCATAT CCACCTGTAT GAATTCCCGT CAATACAATT TCCTTATATC 1080 

CTGAATTCAC TAGTTGCGTC GCTTGTTCAA CTACTTTTTC CGGATCTCTT OAACGCATTA 1140 

AGCCACGAGC CCATGGAATA ATAcAGAATG TGCAGAAGTT GTTACAACCT TCTTGAATTT 1200 

TTAATGACGC 1210 
(2) INFORMATION FOR SEQ 10 NO: 616: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 652 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
is (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616: 

TAATAAAATA CAATACTTTT CAATACAGAG AATCCCGCAA TGTCGGGATT CTTATTTATG 60 

CTGATTTTGT TTTTGTCTAT GATCAGGGAC TTtCAGGGAC TCAATTAATT ATCACATTCA 120 

TATTTTCTTA TGACCAAATT GATACATATG ATGCTATTTA ATGAGAAAAT TAGGCATCAC 180 

TTGGTTATTG AATTTCCTTC ATTAAACTTT CCAGCTCAAT TTAATAGTTA GTCGACTATT 240 

ATTCATTAAA CACTTTTTAA TCATAAAAAA GTGTTTTTGa TAATTCACTa CCaAAAACAC 300 

30 CTTCTTACTT ATAATTCTAT TTGTTCACCA TTTCTAATCT TATCGGCTAA ATCATTCAGT 360 

TTTCTTAATC GGTGATTTAC ACCTGATTTT GAAATTGGAC CAGTTGATAC CATTTCTCCA 420 

AGCTCTTTCA ACGAAATTTC TTGATGTTCT ACTCGAATTC TAGCAATCTC TCTCAACCTG 4 80 

55 TCTGGTAAAT TTTCAATACC AATTTCTTTA TCAATCAATT TAATGCTCTC AACTTGTTTC 540 

ATCGCAGCAC TAACTGTTTT ATTTAGATTG GcCGTTTCAC AATTAACGAG TCGGTTAACA 600 

GAATTACGCA TATCTCTTAC AATACGTACG TCTTCAAATT TTAATAAOGC TT 652 

40 

(2) INFORMATION FOR SEQ ID NO: 617: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 798 base pairs 
45 <B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617: 
TnACCACTTT AAAATAGCGC TTAATAAAtG AAGGGGGCAA GTCTTATGAC GtTTTACaAT 60 
TTCATCaTGG GTTTTCAAAA TGrTAACACA CCATTTGGTA TATTGGC CGa ACACGTTAGT 120 
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TCTAATTACA 


CAGATCATCA ATTAATTGAA ACTACAAATA 


GAGCTATTAG 


CTTATATATG 


240 




GCAAATTAAT 


TTGAGTAGTA 


CCAATTATGA 


TGTATTAGTG 


CATCCCAAAT 


ATCTTTTGTT 


300 


5 


TTAAAGTTTA 


TTTCATCATT 


TCTTATCGAA AATGGTGTAA 


TAATGTCTTT 


ATCTAAC CAA 


360 




GTOTTGATAA 


GTTCATTTGG 


TACACCATCT 


AACAACATTT 


CACTTTTACT 


AATTATAAAA 


420 


10 


CATTCCCAGT 


CAAGTGAAAC 


ATTTTGTGGA 


TTCACATAAT 


TACATTGATT 


ATGATTATCC 


480 




ATAAACACTC 


ACTCCTTTAA AATTCTGTAC 


TCTTCATTGC 


GTTTTACCCC 


GTCACATTAT 


540 




CTTTTAAACT 


AAAATCATCA 


TTACTTATGA 


AAAAAATGTA 


CATCAAAAGC 


AAAGGTTTTC 


600 


15 


GCTACCGAAA 


AAGTTTAAAT 


AATGGTttAA 


TATATttGGT 


ACTCATTTTA 


ATAAAAAGAG 


660 




AATACATTTT 


GAGCTATCAA TACTTTTTAT TGAAGAGGTG TTATTsyTGG 


CTAAAACGTT 


720 




ATATTTAATG 


CGCCACGGAC 


AAACTTTGTT 


TAATTTTAAG 


GGACTAATTc 


AGGGATTTGG 


780 


20 


AGATTCGCCG 


CTAACAGA 










798 




(2) INFORMATION FOR SEQ ID NO: 618: 








25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1786 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








30 


<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618: 








ATACGCTCAA 


TTGATAAAAA 


TTTAATATCT 


GCCATTTGTA 


TACGCATCGC 


TTCGAATGTT 


60 


35 


TCCGTTGCAA 


TATCAAATGA 


TAATTGTGAC 


TGGAACCTTA 


AACATCGAAT 


CATACGTAAA 


120 




GCATCTTCTT 


GGAATCGTTC 


CTCAGCTATA 


CCTACAGTTC 


TTATTATTCG 


ATTATTAATA 


180 


40 


TCTTGTTGAC 


CATCAAAATA 


ATCATACAAT 


TTGTATGCTG 


TATCCATTGC 


TATCGCATTC 


240 


ATCGTGAAAT 


CTCGTCGTTG 


CAAATCTTCG 


TATAAATCAC 


GAACAAATGT 


AACACCACTT 


300 




GGTCTACGGT 


GATCGACATA 


ATCTTCTTCA 


GCCCGGAATG 


TTGTCACTTC 


ATAATTTtCA 


360 


45 


TCATtAAAaa 


CTACATTtAT 


CGTGCCAtGT 


yCTTnACCTA CAGGTATCGT ATGACTAAAG 


420 




ATAGATTCTA 


TTTCATCCGG 


CGTTGCACTT 


GTTGTGATAT 


CTATATCATG 


AATATTTCTT 


480 




CCCATGACAT 


AATCTCTTAC 


AGAGCCACCT 


ACATAATATG 


CTTCAAAACC 


ATTGTCTTGA 


540 


SO 


ATTTGTTCTA 


ATATAGGCCT 


TGCCTGTTCA 


AATAATGATT 


TATCCATATT 


ATTACTCGCC 


600 




TTTACTTTTG 


TTATGCTCAT 


TTAGCATTTT 


TTGATAATAA 


TACTCATATT 


GATCTGTAAT 


660 


55 • 


AAGTTCTGAT 


CCAAAACGTT 


CAGCAATATC 


TGCTAGCATG 


TTTTTCTGAA 


GTTTGTTGTA 


720 
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ATCTACGACA 


AATCCAGTTT 


CACCATGTTT 


AATAACCTCT 


TTAATTCCAC 


COGCATTTGA 


840 


5 


ACCAATTGGA 


ACGACGCCTG 


TwTTCATAGC 


CTCAAGTAAA 


GTTAGTCCAA 


AGCTTTCTTT 


900 




TTCACTTAAT 


AATAATACTA 


AGTCAGATAA 


TTGGTAAAAT 


TCACTTACGC 


AATCTTGTTT 


960 




CCCTAAAAAT 


AAAACATCCT 


CTTCTACGTT 


TAACTCTTTC 


GTCAATTGAC 


GCATTGGCAC 


1020 


10 


TAATTCAGGA 


CCATCTCCAA 


GTAAAATTAA 


TTTACTAGGT 


ATCTTTTCAC 


GTACTTTTGC 


1080 




AAATGTTTCT 


ATAATAGTAT 


CTATGCGTTT 


TACTTGTCTA 


AAA1TCGATA 


CATGTATTAA 


1140 




CACTTTTTCA 


TCTGGTGCTA 


TACCAAATTG 


TGATTTTAAT 


GCTGTGTTAT 


GTTTAGTTGG 


1200 


1S 


AAACTCATTT 


TCACGTACAA 


AATTATAAAT 


CGGTATAATT 


TCTTTGTTAG 


TTTCAATAAT 


1260 




TTCATGTGTT 


TCTTGTGCTA 


AAGATTTACT 


CACACTTGTC 


ACAATATCAC 


TTTTTTCAAT 


1320 


20 


GCCAAATTTA 


ATTGCACCTT 


GGAGTGAATG 


ATCATAGCCC 


AAAACAGTAA 


TATCAGTACC 


1380 


GTGTAGCGTT 


GTCATAATTT 


TTATATCTTT 


ACCTGACATC 


TCACGAGCTA 


AAATCCCACA 


1440 




AATTGCATGA 


GGTACAG CAT 


AGTGCATATG 


CAACAAATCA 


AGATCATATT 


CTTTAATAAC 


1500 


25 


TTCAGCGATT 


TTAGTACTTA 


ACGTAATATC 


ATACGGTGGA 


TACTGAAATA 


CTGCATATTG 


1560 




ATTCACTTCA 


ACTTGATGAA 


AAATCATATT 


CGGTAATGGT 


TTTCTTATTC 


TAAACGGGAT 


1620 




ATTTGAAGTG 


ATAAAATGTA 


CTTCGTGACC 


TCGCTCTGCT 


AATTTAATTC 


CTAATTCTGT 


1680 


30 


GGCAATAATT 


CCAGAACCAC 


CCATGGACGG 


GTAACATGTT 


ATACCTATCn 


TCATTCGCTT 


1740 




GCCCATCCTT 


TCTTTCTATT 


TCTCnCTATG 


ATnCTCGATG 


CGTAGA 




1786 



(2) INFORMATION FOR SEQ ID NO: 619: 

35 "(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

40 

<X±> SEQUENCE DESCRIPTION: SEQ ID NO: 619: 
45 ACAGGTATGG ACTTnGCTCA AATGACACGA CATTATTTAT CAAGACCTAT TGCTATAATC 60 

TTTTGGATCA TTGCAGAACT AGCAATTATC GCTACAGATA TTGCTGAGGT TATTGGTAGT 120 

GCTATTGCTC TTAATCTCCT ATTTAACATA CCTTTAATCG TCGGTGCACT AATAACTGTA 180 

60 

CTTGATGTAT TTTTACTACT TTTTATAATG AAATATGGTT TTAGAAAAAT TGAAGCTATT 24 0 

GTTGGTACAT TAATTTTCAC AGTGTTATTC ATCTTTATAT TTGAAGTCTA TATTTCATCA 3 00 

C CACAGTTG A ATGCTGTGTT AAATGGATTT ATACCACATA GTGAAATCAT TACAAATAAC 360 
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TTACATTCAT CAATTGTACA ATCTAGAACA TACTCAAGAC ATAACAATGA AGAAAAAGCG 
CAAGGaTTAA ATTTGCTACG ATAGATTCGA ACATTCAGTT ATCAATCGCA TTTGTAGTCA 
ATTGCTTATT ATTAGTGtTA GGAGCATCAC TATTTTJcCAA CTCAAATGCT GACGATTTAG 
GTGGTTTCTA TGATTTATAT CACGCCTTAA AAACTGAACC TGTACTAGGT GCAACAATGG 
GTGCAATCAT GAGTACATTA TTTGCAGTTG CATTATTAGC GTCAGGTCAA AATTCAACGA 
TTACTGGTAC TTTAGCAGGA CAAATTGTAA TGGAAGGATT TTTAAGATTA CACATACCAA 
ATTGGTTAAG AOGTTTAATT ACACGTTCTC TTGCTGTCAT TCCTGTTATC GTATG CTTAA 
TCAT 

(2) INFORMATION FOR SEQ ID NO: 620: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: S74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620: 
TGTATTGCAG TCATGC CAT A AATTTTTTGA ATAATAAACG GAGACGCAGA AATATAAGTA 
AATAATATTA CAAATGTCAT ACCTTGGAAT GAGCATTGGT AATACAAAAC GTGGCGTCTT 
CAnTAATATT TTGAAGTTTT TAAACATTGT CTTTAaTCCA CTACTTGACT CACGATTTGT 
CACTGTTAAT GATTCAGGTA CTTTTAATAA AGAACCTATG ACCATGACGA AGCCAAAGAT 
AGTCAGAATG ACAAAGACCA TACGCCAGAC AGAATAATTT AAAATTATGC CCCCTATTGT 
TGGCGCAACA ACTGGTGCAA TACCATTAAC AAGCATCAAT AATGCCATAA ACTTAGTTAG 
TTCATTACCA CTATACATAT CACTTGCTAT AGCTCTTGAA ATAACTGCTG tGCGCCACCT 
GTCACTCCTT GAAGaAATCT TAATGCAACC ATCAGCCAAA TATTATGTAC AAAAACAATA 

CCTAAACTTG CTAATGTAAA AATAATCATA GCTATAATAA GCGGCTTTCT GCGCCCAGTT 

GAATCTGAAA TTGGACnAGC AACCAAATTA CCAA 

(2) INFORMATION FOR SEQ ID NO: 621: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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AAGCAAGAGA GAGTACAACA TTTATATGAT ATTAAAGACT TACATCGATA CTACTCATCA €0 
GAAAGTTTTG AATTCAGTAA TATTAGTGGT AAGGTTGAAA ATTATAACGG TTCTAACGTT 120 

5 

GTACGCTTTA ACCAAGAAAA TCAAAATCAC CAATTATTCT TATTAGGTAA AGATAAAGAG 180 
AAATATAAAG AAGGCATTGA AGGCAAAGAT GTCTTTGTGG TAAAAGAATT AATTGATCCA 240 

10 AACGGTAGAT TATCTACTGT TGGTGGTGTG ACTAAGAAAA ATAACAAATC TTCTGAAACT 300 
AATACACATT TATTTGTTAA TAAAGTGTAT GGCGGAAATT TAGATGCATC AATTGACTCA 360 
TTTTCAATTA ATAAAGAAGA AGTTTCACTG AAAGAACTTG aTTTCAAAAT TAGACAACAT 420 

15 TTAGTTAAAA ATTATGGTTT ATATAAAGGT ACGACTAAAT ACGGTAAGaT CACTATCAAT 480 
TTGAAAGATG GAGAAAAGCA AGAAATTGAT TTAGGTGATA AAT 523 
(2) INFORMATION FOR SEQ ID NO: €22: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1871 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622: 



30 



40 



45 



AGAAGTGTGA 


nAAAAATTTA 


AnAGAGATAT 


GCACATAGAT 


GACGCATTGC 


TATATCCAAG 


60 


CAATTGAGAA 


AGCTGCTGAT 


GCTCCAAATC 


aCGGAATGAG 


GGAACCATGG 


AGAGTTGTGC 


120 


ATGTTCCGAA 


AGACAGATTA 


GGAGATATGA 


GTAAGGATAT 


TTCTAAATTT 


GCATTTCCTA 


180 


ATGAATTAGA 


TAAGCAACAA 


TGTCATTATG 


ATGCAGTTAC 


GAAACTAGGT 


GGCATGTTAT 


240 


TGCTTATTTT 


AAAAACAGAT 


CCAAGACAAC 


GTCAAAATGA 


TGAAAACTAC 


TTTGCATTTG 


300 


tGCATATGCA 


CAAAATCTTA 


TGTTGTTACT 


TTATGAAGCG 


GGAATAGGTA 


CATGTTGGAA 


360 


aTCGCCATTA 


TATATCTATG 


ATCCTAAAGT 


AAGnAAACAC 


TTGGTATAAA 


GnAAGATGAA 


420 


GTTCTTGCTG 


GATTCTTATA 


TTTAACGGAT 


TTAGAAGnAG 


ATATG CCTAA 


AGCACCACGT 


480 


AAAAATAGAA 


ACTTAATTAC 


ATTATATTAA 


TATGTATAAT 


TATAGAAACA 


TTAATAAAAG 


54 0 


CTGAGTCATG 


AATTGATGGA 


CATCTATCGA 


GTTAGAGATT 


TAATCTAACT 


TACTAGAGTC 


600 


GGTACAATAA 


CAGTCTCAGC 


TTTTTATTGT 


GCAGTATATA 


CACATTTTTA 


TTTTAGTATT 


660 


TATTtAAAAG 


TTtCTGCTaA 


AAATGATTCA 


ACTTGTTCAG 


GTGACTTAGC 


ATTTGCTGAA 


720 


TGAAGGTGTG 


CAATTTTATC 


GCCGTTTTTA 


AAT ACT AG CA 


AGCTAGGGAT 


ACCCATAACT 


780 


TCATTTTCAA 


CAACTACATC 


TTCTAATTCA 


TCACGATTAA 


CAGTATACCA 


TTGGTAATCA 


840 
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CCTGCCTCAA ATTTAACAAT 


TACAGGTGTA TCGCTATTAA TTACAGATTT AAATGATTCA 


960 


5 


TTACTTTTGA 


TTGATTGCAT 


TGTAACAACT CCTCTAGATA GTTTAATAAT 


TTTTATTATA 


1020 




GCTAAATTTA 


TATCATAATA 


AAAAATTTTA GCTTCAAAAT GAAAGCCTTT 


TGCTTTGGAA 


1080 




AATGATATAT 


TTATTTTAAA 


TACATAAAGG AGGTTGCAGT CGTATGATTA AATTTTACCA 


1140 


10 


ATATAAGAAT 


TGTACAACTT 


GTAAAAAGGC AGCAAAGTTT TTAGATGAAT 


aTGGCGTAAG 


1200 




TTATGAACCA ATTGATATCG 


TTCAACATAC ACCTACAATA AATGAATTTA AAACAATAAT 


1260 




TGCAAATACA 


GGCGTAGAAA 


TTAATAAATT GTTTAATACA CACGGCGCGA 


AATATCGTVaA 


1320 


1S 


GCTTGATTTG 


AAAAATAAAT 


TACAAACTTT ATCAGATGAT GAAAAGTTAG 


AGTTGTTATC 


1380 




ATCTGATGGT 


ATGTTAGTAA 


AGCGTCCTCT AGCAGTAATG GGCGATAAGA 


TAACATTAGG 


1440 


20 


ATTTAAAGAA GATCAATATA 


AAGAGACTTG GTTAGCGTAA GTGaAATGTA 


AGCGTTTACT 


1500 


AAATATCTCG 


ATATTTAGAT 


TCATTACATG TAAAATGAAA TAAGCTATAC 


AATTGTTAAT 


1560 




TTTTATAAAT 


ATAGTTGAAT 


AGCATCTAGC CTTATGGCAT CATTAATGAT 


GTAAAGATTA 


1620 


25 


ATTAGGAGGG 


GATTCTCTTG 


GCAGTACCAA ATGAAyTGAA ATATTCAAAA 


GAGCATGAAT 


1680 




GGGTTAAAGT 


TGAAGGTAAT 


GTAGCAATAA TTGGAATCAC AGAATACGCA 


CAAAGCGAGT 


1740 




TAGGTGATAT 


TGTTTTCGTT 


GAATTACCAG AAACAGATGA TGAAATTAAT 


G AAGGGGATA 


1800 


30 


CGTTTGGTAG 


CGTAGAATCA 


GTTAAAACTG TATCAGAATT ATATGCACCA 


ATCTCTGGTA 


1860 




AAGTAGTTnA 


A 






1871 



(2) INFORMATION FOR SEQ ID NO: 623: 

3S 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623: 

45 TTG CTGTAG A AGATAAAGAG TCCCATCAAT GGATAGGCTT TATAGGTTTG AATTATATTC 60 

CAGAAACAAG CGATTATCCA TTTAAAGAAT TACCGCTTTA TGAAATAGGT TGGCGCTTGT 120 
TGCCAGAATT TTGGGGAAAA GGATTAGCAA CTGAAGGCGC AAAGGCAACA TTGAAGTTAG 180 

50 CAGAAGAACA TCAAATATAC GATGTCTATA GTTTTACAGC AGAAGCAAAT AAAGCTTCAC 240 

AACGTGTAAT GGAAAAAATT GGCATGACAG TGTATGATCA TTTCGAATTA CCCAATCTAA 300 
GTAAGTATCA TTTATTaAAA AGGcAAGTGC GCTATTACAT TAATCTTCcG AAAGTGGAAA 360 
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ATTAgGGGGT GtTTnGTkTn ATTTTTTTAA n 451 
(2) INFORMATION FOR SEQ ID NO: 624: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 665 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624: 

CGATAGTTAT CTAAAGCCAT TTTnGATTGT GTTATGAAAT CTAATGATGC GTGATAATTT 60 

AATGCrACAT AACGrTaATA TAAAATATCA ATAGTGAACA TTTGAGCAAA TAATGAAGTT 120 

GTTGCTCCCA TATGCATCTC ATTTTCATCA GTTTTCCCAT AAGTTAAAAC AATATTTGAT 180 

20 GCCTGTGCTA CGGGATTATC CCTTGTACTA GTAATTGTAA TTATAGGTAT ATGGTAGTCA 240 

TCAATAACTT TAACCATTGA TTGCATTTCA CTTTGCGTAC CATTGTTAGT AATAAGAATA 300 

aCACTGTCGT TCGAATTGTG AGTTGCTAAT AATGTAGCAA AAATATGTGT TTCTTGAACA 360 

AGTTGAATAT TAAGACCTAT TCTTGATAAC TTTTGGTATA AGTCGGTAGC AACTACAAAA 420 

GATGCGCCAA AACCATATAT AAAAATCGTC TCAGAACGTT TTAAACAATG ACATATTTGA 480 

TCAATAGTTT TATCATTTAA TTCGTTATTT GCATGATTAA GTGCGCGTGT AGTACGTGTA 540 

TGGAGTTTAG TTCTTAAAGA TTCTGTGCTT TCGTTATTCA TTAATTCAAC ATTGTAAATT 600 

GATGATGCTT TAGGAACATA TTTAGATATA TTTATTTTCA AGTCGTGAAA ACCGCCATCA 660 

35 GTAAT 665 
(2) INFORMATION FOR SEQ ID NO: 625: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625: 

GATGCCAATT AACCGTGCAT ATAATGTTGA GAAGTTAATC GAAGCAATTC AATATTATCA 60 

so AGAAAAAACA AATCGTCGTG TTACTTTTGA ATATGGTCTG TTTGGTGGTG TGAATGACCA 120 

ACTAGAACAT GCAAGAGAAT TAGCACATTT AwTAAAAGGC TTAAACTGCC ATGTTAACTT 180 

AATTCCtGTC AACCATGTTC CAGAAAGAAA TTATGTGAAA ACGGCTAAAA ATG AT AT CTT 24 0 
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TTCGGATATT GACGCAGCTT GTGGTCAATT AAGAGCAAAG GAACGACAAG TAGAAACGAG 360 

GTAAAGACAA ATGCTAGAGG CACAATTTTT TACTGATACT GGACAACATA GAGATAAGAA 420 

TGAAGATGCG GGTGGTATTT TTTATAATCA AACTAATCAA CAACTTTTAG TTCTGTGTGA 480 

TGGTATGGGT GGCCATAAAG CAGGAGAAGT TGCAAGTAAA TTTGTTACAG ATGAGTTGAA 54 0 

ATCyCGTTTT GAAGCGGAAA ATCTTATAGA ACAACATCAA GCTGAAAATT GGTTGCGTAA 600 

TAATATAAAA GATATAAATT TTCAGTTATA TCACTATGCA CAAGAAAATG GAGAATATAA 660 

AGGTATGGGT ACAACATGTG TTTGTGCACT TGTTTTTGAA AAATCAGTTG TGATAGCAAA 720 

TGTCGGTGAT TCTAGAGCCT ATGTTATTAA TAGTCGACAA ATTGAACAAA TTACTAGTGA 780 

TCACTCATTT GTTAATCATC TTGTTTTAAC GGGTCAAATT ACGCCGGAAG AAGCATTTAC 84 0 

ACATCCACAA CGTAATATTA TTACGAAGGT GATGGGCACA GATAAACGTG TGAGTCCAGA 900 

20 TTTGTTTATT AAGCGATTAA ATTTTTATGA TTATTTATTA TTAAATTCAG ATGGATTAAC 96 0 

TGATTATGTT AAAGACAATG AAATTAAGCG TTTGTTAGTA AAAGAAGGTA CAATAGAAGA 102 0 

TCATGGTGAT CAATTAATGC AATTGGCATT AGATAACCAT TCGAAAGATA ACGTTACTTT 1080 

CATACTCGCG GCTATTGAAG GTGATAAAGT ATGATAGGTA AAATAATAAA TGAACGATAT 114 0 

AAAATTGTAG ATAAG CTTGG CGGCGGTGGC ATGAGTACCG TTTATCTTGC TGAAGATACG 1200 

ATACTTAACA TTAAAGTTGC AATTAAGGCG ATTTTTATAC CACCTAGAGA AAAAGAAGAA 1260 

ACATTAAAAC GTTTTGAACG AGAAGTACAT AACTCATCAC AGCTATCACA TCAAAATATA 1320 

GTAAGTATGA TCGATGTTGA TGAAGAAGAT GACTGTTACT ACTTAGTAAT GGAATATATC 13 8 0 

^ GAAGGTCCGA CTTTGTCTGA GTATATTGAA AGTCATGGGC CATTAAGTGT TGACACAGCG 144 0 

ATTAATTTTA CGAATCAAAT ATTGGATGGC ATTAAACATG CGCATGATAT GCGTATTGTA 1500 

CATAGAGATA TTAAGCCACA AAATATATTA ATTGACAGCA ATAAAACGTT GAAAATATTT 1560 

40 GATTTTGGAA TTGCTAAAGC TTTAAGTGAG ACGTCTTTAA CTCAGACTAA TCATGTGTTA 1620 

GGTACTGTGC AGTACTTTTC GCCAGAACAA GCAAAAGGTG AGGCAACGGA TGAATGTACA 1680 

GATATTTATT CTATAGGTAT kGTGTTATAT GAAaTGCTTG TTGGTGAACC ACCCTTTAAT 174 0 

GGAGAAACTG CAGTTAGCAT TGCGATTAAA CATATTCAGG ATTCTGTGCC AAATGTGACA 1800 

ACAGATGTAC GTAAGGATAT TCCGCAATCT TTAAGTAATG TCATTTTACG CGCTACAGAA 186 0 

AAAGACnAAG CGAATCGTTA CAAAACAATT CAAGAAATGA AAGATGATTT GAGTAGTGTT 192 0 

TTACATGAAA ATCGAGCGAA TGAAGATGTC TATGAACTCG ATAAAATGAA AACGATAGCG 1980 

GTACCTTTGA AAAAAGAAGA TCTAGCAAAG CATATTAGTG AACATAAGTC GAATCAACCT 204 0 
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AGCCAGAAGG TACGGTGTAC GAACCAAAAC CTAAAAAGAA ATCAACACGA AAGATTGTGC 2160 

TCTTATCACT AATCTTTTCG TTGTTAATGA TTGCACTTGT TTCTTTTGTG GCAATGGCAA 2220 

TGTTTGGTAA TAAATACGAA GAGaCACCTG ATGTAATCGG GAAATCTGTA AAAGAAGCAG 2280 

AGCAAATATT CAATAAAAAC AACCTGAAAT TGGGTAAAAT TTCTAOAAGT TATAGTGATA 2340 

AATATCCTGA AAATGAAATT ATTAAGACAA CTCCTAATAC TGGTGAACGT GTTGAACGTG 24 00 

GTGACAGTGT TGATGTTGTT ATATCAAAGG GCCCTGAAAA GGTTAAAATO CCAAATGTCA 2460 

TTGGTTTACC TAAGGAGGAA GCCTTGCAGA AATTAAAATC GTTAGGTCTT AAAGATGTTA 2520 

CGATTGAAAA AGTATATAAT AATCAAGCG 254 9 
(2) INFORMATION FOR SEQ ID NO: 626: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2286 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 

. (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626: 

TGCTTACTTC GCCTTCAATA CGTACTAATT CATGTCCACA ACTTGGACAA TGGGTTGGCA 60 

TATGATATGT GACAGCATCC TCAGGTCTAC GTTCTGGAAT ACTACGTACA ACTTCAGGTA 120 

30 

TGATGTCACC TGCTTTTTTC ACTACAACAC TATCACCAAT TCGAATATCT CTGTCATGAA 180 

TTAAATCCTC ATTGTGCAAA GATGCTCTTG aTACAGTTGT ACCAGCTACT TTTACTGGTT 240 

3S CTAAAATAGC AGTAGGTGTG ACTACACCTG TTCGTCCAAT ACTTAATTCA ATATCTAATA 300 

ATTTAGTTAC TACTTCCTCA GCTGGAAATT TATAAGCAAT GGCCCATCTA GGAGATTTTT 360 

GTGTGAATCC CATCTCATCC TGTTGATCTA AATCATTAAC CTTAATAACA ATCCCATCAA 420 

TATCATAAGG TAATGACTCT CTTTGGCTTG TCCATTTTTC AATATACTCT AAAACACCAT 4 80 

CGATATTATT TACACGCGCT CTATTTTTAT TCGTTGTAAA ACCTAATTTA TCTAACTCAT 540 

CTAATGCTTC ACTTTGCGAA CGCGCATTGA AATCAGTGAA ATCATTGACA CTATATATAA 600 

ATACGCTTAG CTTTCGTTTT GCCGTTAATT TAGAATCTAA CTGTCTTAAT GATCCCGCAG 66 0 

CAGCGTTTCT TGGATTnGCA AATAACTGCT CATCATTTTT TTCTTTTTCT TCATTTAATC 720 

GTAAAAATGA ACGTCTCGGC ATATATGCTT CACCACGAAC TTCTACATTT AATGGtTCTT 780 

TCATTTT CAA AGGTATCGCA TGAATTGTTT TTaAATTTTC GGTAATATCT TCACCTGTTG 840 

TTCCATCACC ACGTGTTAAA CCTTGaACGA AGTATCCATC AACATATTTC AATGATACTG 900 
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GTTGOTCQAA TTTTCTCAAA 


TCATCCTCAT 


TAAATGCATT 


CCCTAAACTT AACATTGGCG 


1020 




TGTCATGGTT 


GACTTTATTG 


AAAGAGGCTT 


GGGCTTCACC 


GCCAACTCTA ACTGTTGGAG 


1080 


5 


AATCTACAGT 


CTTATACTCA 


GGATGCTCCT 


CTTCTATTTT 


AATCAGTTCA 


TGAAGTAATT 


1140 




TGTCATATTC ACTATCTGGT 


ACAGATGGAT 


TATCCTCTAC 


ATAGTATTCA 


TAACTGTATT 


1200 


10 


GATTTAATAA ATCATGTAAC 


TCGTTCACAC 


GAGACGATAA 


ATCAGCCATC 


CCTTAATCCT 


1260 


CCTTTTTTtC 


AATTGGTGCA 


AATTGCGCTA 


ACAAACGTTT 


TGGCCCTTGT 


GATTTAAAGA 


1320 




TAATATCTAG 


TTCGATTGAG 


CCATTTTTCT 


CGTTTACATT 


ACTCACCATG 


CCTTCTCCCC 


1380 


IS 


AGGCTTTATG 


CATCACTTTG 


TCACCTACAT 


TCCAATCAGA 


TGACAATACT 


TGTTTTTTCG 


1440 




TTGACGTTGT 


TCGTTGACTA 


AATCCGCGTT 


TAGGAAAAGG 


TTTTGCCTTA 


GGTTGTATCG 


1500 




TTTGTCGTTT 


GCCACTTGAA 


TGATTTTCTA 


ATAGTGATTC 


TGGAATTTCC 


TTTAAAAATC 


1560 


20 


TGGATGGCAT 


ATTTGACTGA 


GGGCGACCAA 


ATAACATTCT 


TGATGTCGCA 


TGAGTGATAT 


1620 




ATAACACCTC 


TTCAGCCCTT 


GTAATTGCTA 


CATAACAAAT 


ACGAOGTTCT 


TCTTGCATTT 


1680 




CATGATCATC 


TTCACTCTTA 


ATCGCTCTAA 


TATGTGGGAA 


TAAAGATTCT 


TCCATCCCCA 


1740 


25 


TTATAAAGAC 


AATTGGAAAT 


TCAAGGCCCT 


TAGCCGAGTG 


CATCGTCATT 


AGTGTTACGC 


1800 




CATTTTCAGT 


ATCTGCCTCA 


TCAATATCAG 


CTACTAACGA TAAATCCGTT AAAAAGTTAA 


1860 


30 ' 


TTAATGACTG 


TTCTTCTAAT 


GGGGTATTTT 


CCTCATAGTC . 


TTTTGGTACT 


GACATAAATT 


1920 


CATCGATGTT 


TTCTAATCTA 


CTTCGAGATT 


CTAATGTATT 


TTCACGTTCA 


AG CATTTCTC 


1980 




GATAGCCAGA 


CTTTTGTAAT 


ACTTCATCAA 


CTATTTCATG AATTTCTAAA . 


AATTCTTGTT 


2040 


35 


CTTTTATCAA 


GCTTTGGATT 


AACTCGTAAA 


AATTAAGACA 


CTCTTGTGTC 


ACCTTTTTTG 


2100 




aCAATCCGAT 


AAAATCAGCT 


TCTCCAAGTG 


CATCAAACAT 


ACTGATATTG 


TTTTGAAGTG 


2160 




CATAGTTTTG 


AACTTTTTCA 


ACAGATGAAG 


GACCTACACC 


TCTTTTTGGA 


ACATTaATAA 


2220 


40 


TACGTTGCAA 


ACTAATGTCA 


TCATTACTAT 


TGGCAATTAT ACG£AAATAA CTTAATAAAT 


2280 




CTTTGA 












2286 




(2) INFORMATION FOR SEQ ID NO: 627: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627: 
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TGACATTAAA AGTTGGGTGA AnAAACAAGG CATACTTTAG TTAAGCTTGA TGAAAATAAC 120 

AATGGAATTA ATGCGATTAT TCAAAAAGAA AAAGCAAAAG ATTTAGATAT AAATTATTCT 180 

5 GCTAAAGGTA CTACCAATTG TATTATTTAG TGGAGAATTA GACAAGCTGT AGCAGCGTTG 24 0 

ATTATTGCCA ATGGTGCTAG AGCTGCTGGA AAAGATGTAA CTACCTCCTT ACTTTTTGGG 300 

GGCTTATGCC nTTAAAAAAG TGCCACCGTT AATGTTAAAA GCAGTTGCCA AAATGTTTGA 360 

10 

TTnATGTTGC CCCAAAAGAT TTACGATGCC CCTTCCCAAA 4 00 
<2) INFORMATION FOR SEQ ID NO: €26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628: 
AATAATTTGG GCGCTnTTTG CGTCGGGATA TTATACCGCT TCCTTAATTG TTCAACATTG 60 
26 TAATCACTGT TTTTCAATTG ATATTTTGCA GAGTAAATTG GTACTTCTGG GTTATATGAC 120 
ACTTCGTCCT C1TTATAGTT TTCCAATTCT TTGAAATTCC CGTATTGTAC AAAGAAGTTA 180 
AATTCTTCGA TTTCTTTTTT TACTTTTTCG TCATCGATTG GTTTTAATGG AATCATTTTA 240 

30 

TTAkTTTCCA TTTTCACAGG ATATCTTTTT GTATGATTGT GTGTCATTCC ATCGCTATCT 300 

TCAACAACTT CTCTAACAAT ATAATGCCCT TTAGCCGTTC TAGTATTTCT GTTAATTTCT 360 

AAAACTGCTC CTCTrGATTC CAGATTTTCT CCTTTTAATT GGATTTTCAT TTCAGATCTA 4 20 

ATT AG C CAAG TACCTTTATC ATCTt TTTTA AAT 453 

(2) INFORMATION FOR SEQ ID NO: 629: 

40 (i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 1221 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 629: 
TCAT ATG CAT TTGCAAAATA AACGCCAGAA GCAAGGTTTA GAATTGGGCC GTCCGTTTTG 60 

SO 

CTCAaTTCAC TTGcATTCAA TAATTCATGC TGATCATGAT CAATTTGCTT ATCTAATTCT 120 
GCaATTTTCT TCATTTGCTT ATCTGATTTG TTTTCTTTCG CCATCATTTG ATCACGACGA 180 
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5 



20 



30 



GCTAATGOTA 


CTAATACTTT 


ATCTCCATAT 


GTGTCAATAA AGTTATAAAA 


ATAATCATCT 


300 


GTTTTTGATA 


CAAATCCAGC 


ACGCTCTTCA 


GTTTCACGAT ATAAATCTAA 


GAAAAGATTG 


360 


AACTCATCAC 


GTTCAAGGAA 


TCTGACTTTA 


ACACCATAGT 


TTATCGCTTT 


ATTAATATTA 


420 


CGTTTACGTT 


GACTATCAAA 


TGTCTTTTTC 


AATGTTTCGG 


GTGTTTTACC 


TTCAAGGTTT 


480 


AATACGCCCA 


TCCATCGTAC 


TTGGCTCGAT 


GTATCATACT 


CAGTTGTAAA 


GCCATGATGC 


54 0 


TCGTAACCAT 


GTGATTTAAA 


CAAGTTTACT 


AGGGCATCAT 


TTTTCTOGCG 


ACCTTCAAAT 


600 


GGCACGATAT 


CTTTATCATA 


TAGATGATAT 


AACCAATACG 


GATCTAATTT 


AACATATAAA 


660 


CATTGATGTT 


GCTGTAAATA 


TTTATCTAAC 


TCTTTTAAAT 


AATAATGAAC 


TAATCCTAAA 


720 


TCTGAAAAAT 


CCATTACTGG 


ACCACGATTC 


GAATAGTAAA 


CATAACTTCC 


CATAGTAGGA 


780 


ATTTTAGAGA 


AAAGGCTTGC 


TGCAATTACT 


TTGTTATTGT 


CGTCTTTAAT 


ACCTAATAAA 


840 


ACTACTTCAA 


AGCCATCATT 


CTCACGGGTA 




V» X X X iAVr X lO 


GAAATAATGA 


900 


CTTTCCAATG 


ATGGATTTTG 


TAGAAAGTTG 


TCAAATTCGG 


TAACAGTTAA 


CTCTGTAAAT 


960 


TTCATGTTTT 


GATAATTCCt 


TCCTAAAAAA 


TTCTGTCTTT 


AAcrrmTA 


AGTGCGGTAT 


1020 


ATGCTGCGTA 


AACAGGTTTA 


TTAATTGGTT 


TAATAAAGTC 


ACCAACATAT 


TCmATAATTT 


1080 


CAGCATTCTA 


accttttttg 


AATTTAACTA 


CACCAGCATC 


TTCAGCATCT 


TCTGTAAATT 


1140 


TrCCACTAAC 


ACCATAGAAA 


TTATAACGGT 


CAATGCATGA 


TTTAATGCAT 


AATTAATCAT 


1200 


TTCCCATTGC 


ACTGCATAAC 


T 
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(2) INFORMATION FOR SEQ ID NO: 630: 

3S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1121 base pairs 
{B) TYPE! nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630: 
TGGCCCAAnT AACACAAGTA ATTGCTGCTA ATGGTAACAT CACAAAAAAT GAAATCGTAA 60 

45 

CTACAAATGT TAAACCTTGG AATACACCAA CCATTTCTGG TAATCGTTTA CTATAGTATC 120 
TATTGTGAAT CCAAGTAATA ATAGCTGAAA TAATAATACC ACCTAGAATA TTCGTATCCA 180 
ATGTGGCAAT ACCTGCAATT GATTTTAAAC CAGGTACATT TTCAACGCCT TTTTCTAAAT 240 

SO 

TAGCGCCAAA CGTATGTGGC CATTGTGTTA AAATGGCATT TATAAATGTA TTAAACATTA 300 
AGTAACCCAT CAATGCTGCA AGTGCTGCAT GACCTGGTGC TTTTTTAGCT AAAGAAAGTG 360 

55 
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